Monolingual Data
Monolingual data, consisting of text or speech in a single language, plays a crucial role in advancing natural language processing (NLP), particularly for low-resource languages lacking extensive parallel corpora. Current research focuses on leveraging monolingual data through techniques like back-translation, denoising autoencoders, and self-supervised learning to improve multilingual machine translation, speech-to-speech translation, and other NLP tasks. These efforts are significant because they address the data scarcity problem hindering progress in many languages, enabling the development of more inclusive and widely applicable NLP technologies.
Papers
October 17, 2024
August 21, 2024
August 8, 2024
May 31, 2024
March 20, 2024
February 19, 2024
February 10, 2024
January 9, 2024
November 1, 2023
October 18, 2023
September 19, 2023
September 9, 2023
June 16, 2023
June 11, 2023
May 27, 2023
May 24, 2023
May 23, 2023
May 22, 2023