Monolingual Data
Monolingual data, consisting of text or speech in a single language, plays a crucial role in advancing natural language processing (NLP), particularly for low-resource languages lacking extensive parallel corpora. Current research focuses on leveraging monolingual data through techniques like back-translation, denoising autoencoders, and self-supervised learning to improve multilingual machine translation, speech-to-speech translation, and other NLP tasks. These efforts are significant because they address the data scarcity problem hindering progress in many languages, enabling the development of more inclusive and widely applicable NLP technologies.
Papers
April 21, 2023
April 3, 2023
February 10, 2023
December 20, 2022
December 13, 2022
December 2, 2022
November 15, 2022
November 2, 2022
October 21, 2022
October 5, 2022
September 20, 2022
May 25, 2022
May 21, 2022
May 9, 2022
April 12, 2022
March 16, 2022
March 6, 2022
January 31, 2022
January 27, 2022