Image Text Retrieval
Image-text retrieval (ITR) aims to find the most relevant images for a given text query, and vice versa, bridging the semantic gap between visual and textual data. Current research emphasizes improving the accuracy and efficiency of ITR, focusing on advancements in vision-language models (VLMs) like CLIP and its variants, exploring techniques such as contrastive learning, fine-grained alignment, and efficient model architectures (e.g., dual-stream, lightweight models). The field is significant for its applications in various domains, including multimedia search, medical image analysis, and remote sensing, driving improvements in information retrieval and cross-modal understanding.
Papers
March 16, 2024
March 8, 2024
November 23, 2023
November 3, 2023
October 30, 2023
October 12, 2023
October 9, 2023
October 3, 2023
September 4, 2023
August 27, 2023
June 15, 2023
June 11, 2023
May 28, 2023
May 26, 2023
April 25, 2023
April 21, 2023
April 20, 2023