Multi Person Pose Estimation

Multi-person pose estimation (MPPE) aims to accurately locate the key body joints of multiple individuals within an image or video, a crucial task in computer vision with applications ranging from human-robot interaction to sports analysis. Recent research emphasizes developing efficient and accurate single-stage methods, often employing transformer networks or novel convolutional architectures designed for real-time performance on resource-constrained devices like smartphones. These advancements focus on improving robustness to occlusions and crowded scenes, often through innovative approaches to modeling inter- and intra-person relationships and incorporating instance-aware keypoint estimation. The resulting improvements in speed and accuracy are driving significant progress in various fields requiring real-time human understanding.

Papers