Speech Recognition Accuracy
Automatic speech recognition (ASR) aims to accurately convert spoken language into text, a crucial task with broad applications. Current research focuses on improving accuracy, particularly for challenging scenarios like low-resource languages, accented speech, and noisy environments, often employing techniques like retrieval-augmented generation and contextual awareness within transformer-based models (e.g., Conformers) and large language models (LLMs). These advancements are vital for enhancing the accessibility and reliability of speech technologies across diverse populations and applications, including healthcare, assistive technologies, and human-computer interaction.
Papers
October 19, 2024
September 18, 2024
September 13, 2024
August 12, 2024
July 19, 2024
June 28, 2024
June 13, 2024
May 15, 2024
April 10, 2024
April 4, 2024
March 16, 2024
February 2, 2024
January 21, 2024
January 12, 2024
December 15, 2023
September 27, 2023
September 25, 2023
May 30, 2023
May 12, 2023