Automatic Speech Recognition
Automatic Speech Recognition (ASR) aims to accurately transcribe spoken language into text, driving research into robust and efficient models. Current efforts focus on improving accuracy and robustness through techniques like consistency regularization in Connectionist Temporal Classification (CTC), leveraging pre-trained multilingual models for low-resource languages, and integrating Large Language Models (LLMs) for enhanced contextual understanding and improved handling of diverse accents and speech disorders. These advancements have significant implications for accessibility, enabling applications in diverse fields such as healthcare, education, and human-computer interaction.
Papers
October 7, 2022
October 6, 2022
October 3, 2022
September 30, 2022
September 27, 2022
September 26, 2022
September 24, 2022
September 17, 2022
September 16, 2022
September 15, 2022
September 13, 2022
September 12, 2022
September 8, 2022
September 7, 2022
September 6, 2022
September 5, 2022