Zero Shot Composed Image Retrieval
Zero-shot composed image retrieval (ZS-CIR) aims to retrieve images based on a combined query of a reference image and modifying text, without requiring training data specifically labeled for this task. Current research focuses on developing training-free or weakly-supervised methods, often employing techniques like multimodal large language models, projection modules mapping images to text embeddings, and diffusion models, to effectively fuse image and text information for accurate retrieval. This field is significant because it addresses the high cost of creating labeled datasets for traditional composed image retrieval, potentially enabling more efficient and scalable image search applications across various domains.
Papers
October 22, 2024
September 7, 2024
September 6, 2024
June 27, 2024
June 13, 2024
May 1, 2024
March 24, 2024
March 3, 2024
December 14, 2023
December 4, 2023
November 25, 2023
September 30, 2023
June 12, 2023
March 21, 2023