Source Speech
Source speech analysis focuses on extracting meaningful information from spoken language, encompassing tasks like transcription correction, speaker identification, emotion recognition, and topic segmentation. Current research heavily utilizes large language models (LLMs) and transformer-based architectures, often incorporating techniques like self-supervised learning, multi-task learning, and multilingual training to improve performance and robustness across diverse languages and speaking styles. These advancements are driving progress in various applications, including improved speech-to-speech translation, real-time voice conversion, and enhanced accessibility for low-resource languages.
Papers
November 21, 2024
November 11, 2024
October 3, 2024
September 15, 2024
September 10, 2024
September 8, 2024
August 25, 2024
June 29, 2024
May 28, 2024
March 25, 2024
January 19, 2024
January 11, 2024
January 5, 2024
November 29, 2023
October 17, 2023
October 1, 2023
September 25, 2023
September 15, 2023
September 6, 2023
September 3, 2023