Language Recognition
Spoken language recognition (SLR) aims to automatically identify the language spoken in an audio recording, a crucial task for various applications like multilingual human-computer interaction and data analysis. Current research emphasizes improving model calibration to enhance reliability, addressing challenges posed by mixed-language audio and cross-domain variations using techniques like energy-based models, optimal transport, and multi-agent data generation frameworks. These advancements, along with explorations of efficient model architectures and the development of new datasets for under-resourced languages, are driving improvements in accuracy and robustness, particularly for real-world scenarios with noisy or limited data.
Papers
November 11, 2024
June 26, 2024
October 27, 2023
October 26, 2023
October 20, 2023
June 7, 2023
June 2, 2023
February 28, 2023
February 27, 2023
November 30, 2022
May 14, 2022
April 13, 2022
March 28, 2022
January 5, 2022