Text to Speech Synthesis
Text-to-speech (TTS) synthesis aims to convert written text into natural-sounding speech, focusing on improving both the quality and efficiency of generated audio. Current research emphasizes developing faster and more lightweight models, often employing diffusion models, autoregressive methods, and transformer architectures, while also exploring techniques like post-training quantization to reduce computational demands. These advancements are significant for expanding access to speech technologies across diverse languages and resource-constrained environments, impacting fields ranging from accessibility tools to personalized communication systems.
Papers
November 2, 2024
October 30, 2024
October 17, 2024
October 4, 2024
September 20, 2024
June 30, 2024
June 25, 2024
June 10, 2024
June 8, 2024
June 6, 2024
June 4, 2024
June 1, 2024
May 15, 2024
April 4, 2024
March 20, 2024
December 8, 2023
December 6, 2023
October 25, 2023
October 14, 2023