Speech Driven
Speech-driven research focuses on developing computational models that effectively process and understand spoken language, encompassing tasks like speech recognition, speaker identification, and emotion detection. Current research emphasizes multi-task learning frameworks, often employing transformer-based architectures and diffusion models, to improve the robustness and efficiency of these models across diverse scenarios and languages. This field is crucial for advancing human-computer interaction, improving accessibility for individuals with communication challenges, and enabling more sophisticated applications in areas like personalized healthcare and virtual assistants.
Papers
February 23, 2023
February 20, 2023
February 18, 2023
January 29, 2023
December 13, 2022
November 9, 2022
October 31, 2022
October 19, 2022
October 15, 2022
June 26, 2022
April 4, 2022
March 17, 2022
December 23, 2021
December 10, 2021
November 12, 2021