Speech Generation Task

Speech generation research focuses on creating high-quality, natural-sounding speech from various inputs, such as text or lip movements. Current efforts concentrate on improving model efficiency through techniques like prompting and distillation, alongside developing robust and diverse datasets for training and evaluation, including those addressing singing voice and multilingual speech. These advancements are crucial for enhancing applications like text-to-speech synthesis, voice conversion, and speech enhancement, ultimately leading to more realistic and accessible speech technologies.

Papers