View Translation
View translation encompasses the automated conversion of information between different modalities (e.g., text, speech, images, sign language) and languages, aiming to bridge communication gaps across diverse forms of expression. Current research emphasizes improving translation accuracy and efficiency using large language models (LLMs), exploring techniques like contrastive preference optimization, attention mechanism refinements, and multi-source pivoting, often within specific architectures such as transformers and Conformers. This field is crucial for advancing multilingual natural language processing, enabling broader access to information and facilitating cross-cultural communication in various applications, including healthcare, education, and cybersecurity.
Papers
Advanced Knowledge Extraction of Physical Design Drawings, Translation and conversion to CAD formats using Deep Learning
Jesher Joshua M, Ragav V, Syed Ibrahim S P
Lost in Translation? Translation Errors and Challenges for Fair Assessment of Text-to-Image Models on Multilingual Concepts
Michael Saxon, Yiran Luo, Sharon Levy, Chitta Baral, Yezhou Yang, William Yang Wang