Text to Speech
Text-to-speech (TTS) research aims to synthesize natural-sounding human speech from textual input, focusing on improving speech quality, speaker similarity, and efficiency. Current efforts concentrate on developing advanced architectures like diffusion models and transformers, often incorporating techniques such as flow matching and semantic communication to enhance both the naturalness and expressiveness of generated speech. This field is crucial for applications ranging from assistive technologies and accessibility tools to combating deepfakes and creating more realistic synthetic datasets for training other AI models.
Papers
March 17, 2024
March 13, 2024
March 9, 2024
March 7, 2024
March 5, 2024
February 29, 2024
February 26, 2024
February 22, 2024
February 19, 2024
February 12, 2024
February 9, 2024
February 8, 2024
February 5, 2024
February 1, 2024
January 28, 2024
January 25, 2024
January 24, 2024
January 23, 2024