Image Text
Image-text research focuses on developing models that understand and generate relationships between visual and textual information, aiming to bridge the gap between these modalities. Current research emphasizes improving the robustness and efficiency of vision-language models (VLMs) like CLIP, often through techniques such as prompt engineering, contrastive learning, and specialized datasets for domains like medicine and agriculture. This work is significant because it enables advancements in various applications, including medical image analysis, agricultural monitoring, and improved multimodal large language models (MLLMs), ultimately leading to more accurate and efficient AI systems.
Papers
December 20, 2023
December 14, 2023
December 11, 2023
November 30, 2023
November 28, 2023
November 23, 2023
November 1, 2023
October 7, 2023
October 5, 2023
September 27, 2023
August 3, 2023
July 28, 2023
July 22, 2023
July 15, 2023
June 14, 2023
June 3, 2023
June 1, 2023
May 24, 2023
May 12, 2023