Speech Generation Task
Speech generation research focuses on creating high-quality, natural-sounding speech from various inputs, such as text or lip movements. Current efforts concentrate on improving model efficiency through techniques like prompting and distillation, alongside developing robust and diverse datasets for training and evaluation, including those addressing singing voice and multilingual speech. These advancements are crucial for enhancing applications like text-to-speech synthesis, voice conversion, and speech enhancement, ultimately leading to more realistic and accessible speech technologies.
Papers
October 27, 2024
August 23, 2024
August 12, 2024
June 16, 2024
June 13, 2024
March 26, 2024
December 25, 2023
July 23, 2023
December 8, 2022