Motion Disentanglement

Motion disentanglement aims to separate different components of motion within video data, such as object movement, camera movement, and character actions, to enable more precise control and understanding of visual dynamics. Current research focuses on developing models that leverage techniques like spatial and temporal disentanglement, often employing diffusion models, GANs, or specialized encoder-decoder architectures with losses designed to promote separation of motion features. This research is significant for advancing video generation, manipulation, and analysis, with applications ranging from realistic 3D human body reconstruction to improved human pose estimation and unsupervised depth prediction in dynamic scenes.

Papers