Text to Talking Avatar

Text-to-talking avatar research aims to generate realistic, animatable 3D avatars that speak based on textual input. Current efforts focus on improving avatar quality and animation fidelity, often employing techniques like progressive generation (building avatars step-by-step), disentangled representations (separating body and clothing for better control), and neural rendering methods. These advancements are driven by the need for high-quality, personalized digital humans in various applications, including virtual assistants, video conferencing, and entertainment, impacting fields such as computer graphics, natural language processing, and speech synthesis.

Papers