Low Resource Text to Speech
Low-resource text-to-speech (TTS) research focuses on generating high-quality synthetic speech from limited training data, addressing the significant data scarcity problem for many languages. Current efforts explore techniques like transfer learning across languages using various input representations (phonetic features, articulatory features), data augmentation strategies (noise addition, pitch shifting, voice conversion), and semi-supervised learning methods to improve model performance with minimal data. These advancements are crucial for expanding access to speech technologies for under-resourced languages and communities, impacting fields like accessibility, language preservation, and personalized voice assistants.
Papers
October 23, 2024
September 27, 2024
October 25, 2023
June 21, 2023
June 16, 2023
June 1, 2023
October 27, 2022
October 25, 2022
July 13, 2022
May 24, 2022
April 21, 2022
March 29, 2022
March 7, 2022
February 16, 2022