Speech Processing
Speech processing research aims to enable computers to understand, interpret, and generate human speech, focusing on tasks like speech recognition, synthesis, and enhancement. Current efforts concentrate on improving model efficiency (e.g., using linear-complexity attention mechanisms) and robustness across diverse languages and acoustic conditions, often leveraging large language models and self-supervised learning techniques. These advancements are crucial for broader accessibility of speech technology, impacting fields ranging from healthcare (e.g., depression screening) to assistive technologies and improving human-computer interaction.
Papers
June 20, 2024
June 14, 2024
June 11, 2024
June 10, 2024
June 5, 2024
May 21, 2024
May 7, 2024
April 29, 2024
April 26, 2024
April 17, 2024
April 9, 2024
March 6, 2024
February 26, 2024
February 20, 2024
January 22, 2024
January 14, 2024
January 8, 2024
January 4, 2024
December 6, 2023