Machine Translation
Machine translation (MT) aims to automatically translate text between languages, with current research heavily focused on leveraging large language models (LLMs) and exploring various architectures like encoder-decoder and decoder-only models. Key areas of investigation include improving translation quality, particularly for low-resource languages and specialized domains like medicine, mitigating biases (e.g., gender bias), and enhancing evaluation methods beyond simple correlation with human judgments. These advancements have significant implications for cross-cultural communication, information access, and the development of more equitable and effective multilingual technologies.
Papers
The Impact of Syntactic and Semantic Proximity on Machine Translation with Back-Translation
Nicolas Guerin, Shane Steinert-Threlkeld, Emmanuel Chemla
m3P: Towards Multimodal Multilingual Translation with Multimodal Prompt
Jian Yang, Hongcheng Guo, Yuwei Yin, Jiaqi Bai, Bing Wang, Jiaheng Liu, Xinnian Liang, Linzheng Cahi, Liqun Yang, Zhoujun Li
Large Language Models Are State-of-the-Art Evaluator for Grammatical Error Correction
Masamune Kobayashi, Masato Mita, Mamoru Komachi
Advancing Speech Translation: A Corpus of Mandarin-English Conversational Telephone Speech
Shannon Wotherspoon, William Hartmann, Matthew Snover
Synthetic Data Generation and Joint Learning for Robust Code-Mixed Translation
Kartik Kartik, Sanjana Soni, Anoop Kunchukuttan, Tanmoy Chakraborty, Md Shad Akhtar
Enhancing Taiwanese Hokkien Dual Translation by Exploring and Standardizing of Four Writing Systems
Bo-Han Lu, Yi-Hsuan Lin, En-Shiun Annie Lee, Richard Tzong-Han Tsai
Investigating Markers and Drivers of Gender Bias in Machine Translations
Peter J Barclay, Ashkan Sami
A Novel Paradigm Boosting Translation Capabilities of Large Language Models
Jiaxin Guo, Hao Yang, Zongyao Li, Daimeng Wei, Hengchao Shang, Xiaoyu Chen
Scaling Behavior of Machine Translation with Large Language Models under Prompt Injection Attacks
Zhifan Sun, Antonio Valerio Miceli-Barone
MT-PATCHER: Selective and Extendable Knowledge Distillation from Large Language Models for Machine Translation
Jiahuan Li, Shanbo Cheng, Shujian Huang, Jiajun Chen
Did Translation Models Get More Robust Without Anyone Even Noticing?
Ben Peters, André F. T. Martins
gaHealth: An English-Irish Bilingual Corpus of Health Data
Séamus Lankford, Haithem Afli, Órla Ní Loinsigh, Andy Way
BiVert: Bidirectional Vocabulary Evaluation using Relations for Machine Translation
Carinne Cherf, Yuval Pinter