Image Text
Image-text research focuses on developing models that understand and generate relationships between visual and textual information, aiming to bridge the gap between these modalities. Current research emphasizes improving the robustness and efficiency of vision-language models (VLMs) like CLIP, often through techniques such as prompt engineering, contrastive learning, and specialized datasets for domains like medicine and agriculture. This work is significant because it enables advancements in various applications, including medical image analysis, agricultural monitoring, and improved multimodal large language models (MLLMs), ultimately leading to more accurate and efficient AI systems.
Papers
May 24, 2023
May 12, 2023
May 9, 2023
April 14, 2023
March 23, 2023
March 21, 2023
March 2, 2023
March 1, 2023
January 19, 2023
December 20, 2022
December 13, 2022
November 17, 2022
November 13, 2022
October 18, 2022
September 29, 2022
September 28, 2022
September 12, 2022
August 8, 2022