Speech Classification Task
Speech classification involves automatically categorizing spoken audio into predefined classes, such as emotions, languages, or speaker identities. Current research focuses on improving accuracy and robustness, particularly in low-resource settings, by exploring techniques like semi-supervised learning, multimodal approaches combining audio and text features (often using transformer-based architectures like BERT and Wav2Vec 2.0), and prompt tuning methods. These advancements are crucial for applications ranging from improved voice assistants and healthcare diagnostics to environmental monitoring and wildlife research, where accurate and reliable speech analysis is essential.
Papers
September 25, 2024
July 23, 2024
July 10, 2024
June 26, 2024
January 15, 2024
October 19, 2023
September 28, 2023
September 19, 2023
June 7, 2023
March 1, 2023
November 23, 2022
October 28, 2022
October 26, 2022