Image Text
Image-text research focuses on developing models that understand and generate relationships between visual and textual information, aiming to bridge the gap between these modalities. Current research emphasizes improving the robustness and efficiency of vision-language models (VLMs) like CLIP, often through techniques such as prompt engineering, contrastive learning, and specialized datasets for domains like medicine and agriculture. This work is significant because it enables advancements in various applications, including medical image analysis, agricultural monitoring, and improved multimodal large language models (MLLMs), ultimately leading to more accurate and efficient AI systems.
Papers
April 14, 2023
March 23, 2023
March 21, 2023
March 2, 2023
March 1, 2023
January 19, 2023
December 20, 2022
December 13, 2022
November 17, 2022
November 13, 2022
October 18, 2022
September 29, 2022
September 28, 2022
September 12, 2022
August 8, 2022
July 26, 2022
May 4, 2022
April 15, 2022
December 31, 2021