Language Image
Language-image research focuses on developing models that effectively bridge the gap between visual and textual information, aiming to improve tasks like image captioning, visual question answering, and image retrieval. Current research emphasizes efficient pre-training methods, often employing transformer-based architectures and contrastive learning, to reduce computational costs and improve robustness to noisy or incomplete data. These advancements are significant because they enable more accurate and efficient multimodal applications, impacting fields ranging from media forensics and document understanding to more general visual analytics and cross-lingual information retrieval.
Papers
June 4, 2024
November 2, 2023
September 28, 2023
May 23, 2023
May 8, 2023
April 14, 2023
February 6, 2023
December 14, 2022
September 14, 2022
August 4, 2022
June 6, 2022
May 23, 2022
May 3, 2022
January 19, 2022
November 29, 2021