ASR Model
Automatic speech recognition (ASR) models aim to accurately transcribe spoken language into text, a task crucial for numerous applications. Current research emphasizes improving model robustness across diverse accents, languages, and noisy environments, often leveraging transformer-based architectures like Wav2Vec 2.0 and Conformer, and incorporating visual information for improved accuracy. Significant efforts focus on addressing biases in ASR models, enhancing efficiency through knowledge distillation and self-supervised learning, and developing methods for low-resource languages. These advancements are driving progress in various fields, including accessibility technologies, human-computer interaction, and language documentation.
Papers
November 14, 2024
November 10, 2024
November 6, 2024
October 16, 2024
September 19, 2024
August 22, 2024
June 14, 2024
May 11, 2024
April 2, 2024
July 20, 2023
July 13, 2023
June 28, 2023
June 11, 2023
June 1, 2023
May 21, 2023
May 18, 2023
May 5, 2023
April 20, 2023
March 25, 2023