Automatic Speech Recognition Error
Automatic Speech Recognition (ASR) errors, stemming from inaccuracies in transcribing spoken language to text, significantly hinder the performance of downstream natural language processing tasks. Current research focuses on mitigating these errors through techniques like incorporating ASR confidence scores and phoneme sequences into models, developing error detection and correction mechanisms using sequence-to-sequence models and large language models (LLMs), and employing multimodal fusion to leverage audio information alongside text. Addressing ASR errors is crucial for improving the reliability and accuracy of various applications, including voice assistants, medical transcription, and spoken language understanding systems.
Papers
August 26, 2024
January 24, 2024
January 12, 2024
October 17, 2023
August 15, 2023
June 12, 2023
June 11, 2023
June 9, 2023
May 22, 2023
November 8, 2022
July 22, 2022
March 22, 2022
March 14, 2022
March 1, 2022
February 2, 2022