Human Pose
Human pose estimation, the task of determining the 3D configuration of a human body from images or sensor data, aims to accurately and efficiently capture human movement and posture. Current research focuses on improving robustness to challenges like occlusions, variations in viewpoint and lighting, and data scarcity, often employing diffusion models, transformers, and graph convolutional networks to achieve this. These advancements are driving progress in diverse applications, including human-computer interaction, animation, robotics, healthcare (e.g., gait analysis), and activity recognition, by enabling more accurate and nuanced understanding of human motion. The field is also actively addressing issues of data quality and bias in training datasets to enhance the reliability and generalizability of pose estimation models.
Papers
UniPose: A Unified Multimodal Framework for Human Pose Comprehension, Generation and Editing
Yiheng Li, Ruibing Hou, Hong Chang, Shiguang Shan, Xilin Chen
UNOPose: Unseen Object Pose Estimation with an Unposed RGB-D Reference Image
Xingyu Liu, Gu Wang, Ruida Zhang, Chenyangguang Zhang, Federico Tombari, Xiangyang Ji
From Text to Pose to Image: Improving Diffusion Model Control and Quality
Clément Bonnett, Ariel N. Lee, Franck Wertel, Antoine Tamano, Tanguy Cizain, Pablo Ducru
VioPose: Violin Performance 4D Pose Estimation by Hierarchical Audiovisual Inference
Seong Jong Yoo, Snehesh Shrestha, Irina Muresanu, Cornelia Fermüller
Y-MAP-Net: Real-time depth, normals, segmentation, multi-label captioning and 2D human pose in RGB images
Ammar Qammaz, Nikolaos Vasilikopoulos, Iason Oikonomidis, Antonis A. Argyros
Try-On-Adapter: A Simple and Flexible Try-On Paradigm
Hanzhong Guo, Jianfeng Zhang, Cheng Zou, Jun Li, Meng Wang, Ruxue Wen, Pingzhong Tang, Jingdong Chen, Ming Yang
SPLIT: SE(3)-diffusion via Local Geometry-based Score Prediction for 3D Scene-to-Pose-Set Matching Problems
Kanghyun Kim, Min Jun Kim