Multilingual Vision
Multilingual vision research aims to develop artificial intelligence systems that can understand and interact with visual information across multiple languages. Current efforts focus on adapting existing vision-language models, like CLIP, to multilingual contexts, often employing techniques such as knowledge distillation, continual learning, and contrastive learning to improve efficiency and cross-lingual generalization. These advancements are crucial for bridging the language gap in multimodal AI, enabling applications such as improved image retrieval, visual question answering, and broader accessibility of AI-powered tools across diverse linguistic communities.
Papers
October 30, 2024
October 2, 2024
April 17, 2024
February 24, 2024
January 30, 2024
October 19, 2023
July 13, 2023
June 29, 2023
May 29, 2023
May 13, 2023
February 26, 2023
September 7, 2022