Human Pose
Human pose estimation, the task of determining the 3D configuration of a human body from images or sensor data, aims to accurately and efficiently capture human movement and posture. Current research focuses on improving robustness to challenges like occlusions, variations in viewpoint and lighting, and data scarcity, often employing diffusion models, transformers, and graph convolutional networks to achieve this. These advancements are driving progress in diverse applications, including human-computer interaction, animation, robotics, healthcare (e.g., gait analysis), and activity recognition, by enabling more accurate and nuanced understanding of human motion. The field is also actively addressing issues of data quality and bias in training datasets to enhance the reliability and generalizability of pose estimation models.
Papers
MotionMap: Representing Multimodality in Human Pose Forecasting
Reyhaneh Hosseininejad, Megh Shukla, Saeed Saadatnejad, Mathieu Salzmann, Alexandre Alahi
DRDM: A Disentangled Representations Diffusion Model for Synthesizing Realistic Person Images
Enbo Huang, Yuan Zhang, Faliang Huang, Guangyu Zhang, Yang Liu
AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers
Sherwin Bahmani, Ivan Skorokhodov, Guocheng Qian, Aliaksandr Siarohin, Willi Menapace, Andrea Tagliasacchi, David B. Lindell, Sergey Tulyakov
G3Flow: Generative 3D Semantic Flow for Pose-aware and Generalizable Object Manipulation
Tianxing Chen, Yao Mu, Zhixuan Liang, Zanxin Chen, Shijia Peng, Qiangyu Chen, Mingkun Xu, Ruizhen Hu, Hongyuan Zhang, Xuelong Li, Ping Luo
UniPose: A Unified Multimodal Framework for Human Pose Comprehension, Generation and Editing
Yiheng Li, Ruibing Hou, Hong Chang, Shiguang Shan, Xilin Chen
UNOPose: Unseen Object Pose Estimation with an Unposed RGB-D Reference Image
Xingyu Liu, Gu Wang, Ruida Zhang, Chenyangguang Zhang, Federico Tombari, Xiangyang Ji