Automatic Speech Recognition Hypothesis
Automatic speech recognition (ASR) hypothesis research focuses on improving the accuracy and robustness of speech-to-text transcriptions, primarily by addressing errors in recognizing infrequent words or noisy audio. Current efforts leverage large language models (LLMs) for tasks like rescoring N-best ASR hypotheses, correcting errors using retrieval-augmented generation or conservative data filtering, and improving confidence estimation. These advancements are significant because they enhance the reliability of ASR systems across various applications, from voice assistants and speech emotion recognition to spoken language understanding, ultimately leading to more natural and effective human-computer interaction.
Papers
September 9, 2024
August 30, 2024
July 18, 2024
June 29, 2024
June 27, 2024
January 24, 2024
January 20, 2024
January 8, 2024
December 22, 2023
December 20, 2023
October 17, 2023
September 29, 2023
September 18, 2023
July 22, 2023
June 23, 2023
June 21, 2023
May 23, 2023
April 21, 2023
March 27, 2023