Text Normalization
Text normalization aims to standardize text by converting non-standard forms (like numerals, abbreviations, and informal spellings) into consistent, canonical representations. Current research focuses on improving normalization accuracy for low-resource languages and less frequent terms, employing techniques like weakly supervised learning, transformer-based language models, and rule-guided neural architectures. These advancements are crucial for enhancing the performance of various natural language processing tasks, including speech recognition, machine translation, and information retrieval, particularly in domains with diverse or historically-influenced writing styles.
Papers
March 9, 2023
January 26, 2023
October 6, 2022
July 29, 2022
June 16, 2022
June 13, 2022
June 8, 2022
May 20, 2022
March 31, 2022
March 29, 2022
February 1, 2022