Head Synthesis
Head synthesis focuses on generating realistic and animatable 3D head models from various input sources, such as videos and audio, aiming for high fidelity and real-time performance. Current research heavily utilizes neural radiance fields (NeRFs), Gaussian splatting, and diffusion models, often incorporating techniques like 3D morphable models and hierarchical architectures to improve efficiency, controllability, and the handling of temporal consistency. This field is significant for its applications in virtual and augmented reality, film production, and healthcare, with advancements driving improvements in avatar creation, animation, and potentially even robotic procedures.
Papers
LokiTalk: Learning Fine-Grained and Generalizable Correspondences to Enhance NeRF-based Talking Head Synthesis
Tianqi Li, Ruobing Zheng, Bonan Li, Zicheng Zhang, Meng Wang, Jingdong Chen, Ming Yang
Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis
Tianqi Li, Ruobing Zheng, Minghui Yang, Jingdong Chen, Ming Yang