Mandarin Speech
Research on Mandarin speech focuses on improving automatic speech recognition (ASR) and text-to-speech (TTS) systems, particularly for challenging scenarios like children's speech, diverse dialects (e.g., Hakka), and noisy environments. Current efforts leverage advanced models such as Conformers, HuBERT, and large language models (LLMs), often incorporating techniques like multi-modal and multi-granularity approaches to enhance accuracy and robustness. These advancements are crucial for developing applications in education, healthcare (e.g., personalized TTS for the speech impaired), and human-computer interaction, while also contributing significantly to language preservation and revitalization efforts for under-resourced dialects.
Papers
September 27, 2024
September 20, 2024
September 3, 2024
August 18, 2024
July 15, 2024
June 7, 2024
May 24, 2024
May 6, 2024
March 25, 2024
October 7, 2023
September 18, 2023
September 15, 2023
August 27, 2023
April 13, 2023
December 11, 2022
December 10, 2022
October 24, 2022
October 11, 2022
October 4, 2022