Generative Error Correction
Generative error correction (GEC) uses large language models (LLMs) to improve the accuracy of automatic speech recognition (ASR) systems by refining initial transcriptions. Current research focuses on enhancing GEC performance through techniques like multi-pass processing of ASR hypotheses, incorporating multimodal information (e.g., lip movements, acoustic features), and optimizing prompt engineering for LLMs. This approach holds significant promise for improving the robustness and accuracy of ASR across diverse languages and challenging acoustic conditions, leading to more effective speech-based interfaces and applications.
Papers
October 17, 2024
September 20, 2024
September 15, 2024
August 29, 2024
July 23, 2024
June 6, 2024
May 16, 2024
May 6, 2024
February 8, 2024
October 17, 2023
October 10, 2023
September 27, 2023