Parallel Text
Parallel text, consisting of texts in multiple languages that are mutual translations, is crucial for machine translation and cross-lingual natural language processing. Current research focuses on improving the efficiency of parallel text acquisition through smart crawling techniques and data augmentation methods, as well as optimizing parallel processing of large language models using techniques like tensor parallelism and kernel fusion to accelerate training and inference. These advancements are vital for bridging language barriers and improving the performance of various NLP applications, particularly for low-resource languages where parallel data is scarce.
Papers
November 8, 2024
November 4, 2024
October 20, 2024
October 13, 2024
September 22, 2024
July 31, 2024
June 11, 2024
May 23, 2024
March 28, 2024
February 6, 2024
February 2, 2024
November 24, 2023
November 13, 2023
October 9, 2023
August 8, 2023
May 24, 2023
May 4, 2023
April 6, 2023
March 11, 2023