Language Model Rescoring
Language model rescoring enhances automatic speech recognition (ASR) by using large language models (LLMs) to re-evaluate the initial ASR output, improving accuracy. Current research focuses on leveraging various LLMs, including instruction-tuned and multi-modal models, often employing techniques like lattice rescoring or n-best list reranking to refine transcriptions. This approach yields significant word error rate reductions across diverse ASR tasks and datasets, impacting the development of more accurate and robust speech technologies for applications like virtual assistants and voice search.
Papers
September 25, 2024
August 30, 2024
June 27, 2024
June 12, 2024
March 28, 2024
March 16, 2024
December 20, 2023
November 2, 2023
October 14, 2023
September 26, 2023
September 19, 2023
July 13, 2023
June 13, 2023
March 27, 2023
June 26, 2022
April 1, 2022