Inverse Text Normalization
Inverse text normalization (ITN) converts spoken-form text, often produced by automatic speech recognition (ASR) systems, into its written-form equivalent. Current research focuses on improving ITN accuracy and robustness, particularly for low-resource languages and ASR-generated text, employing neural models like transformers and leveraging techniques such as data augmentation and semi-supervised learning to address data scarcity and out-of-domain issues. These advancements are crucial for enhancing the usability and downstream processing of ASR outputs, impacting various applications including improved user experience in voice assistants and more effective natural language processing pipelines.
Papers
August 1, 2024
September 12, 2023
January 20, 2023
November 7, 2022
October 26, 2022
July 29, 2022
July 20, 2022
March 31, 2022