Talking Face Video
Talking face video generation aims to synthesize realistic videos of a person speaking, driven by audio input. Current research focuses on improving lip synchronization and visual fidelity using various approaches, including diffusion models, neural radiance fields (NeRFs), and transformer-based architectures, often incorporating optical flow for smoother transitions and attention mechanisms for enhanced feature extraction. These advancements have significant implications for applications such as virtual avatars, video conferencing, and film production, while also raising concerns about deepfake detection and the ethical implications of realistic video manipulation.
Papers
November 6, 2024
August 10, 2024
July 22, 2024
May 23, 2024
March 29, 2024
August 18, 2023
July 8, 2023
May 23, 2023
March 21, 2023
December 9, 2022
May 13, 2022
April 13, 2022
April 6, 2022
March 10, 2022
January 16, 2022