Composed Image Retrieval
Composed image retrieval (CIR) aims to retrieve images matching a multimodal query consisting of a reference image and a text description specifying desired modifications. Current research heavily focuses on zero-shot CIR, developing methods that avoid the need for expensive triplet-labeled datasets, often employing techniques like textual inversion, contrastive learning, and large language models to generate synthetic training data or improve feature representation. This field is significant for its potential to enhance image search capabilities beyond keyword-based systems, impacting applications in digital humanities, remote sensing, and e-commerce by enabling more nuanced and flexible image querying.
Papers
October 31, 2024
October 22, 2024
October 2, 2024
July 8, 2024
June 13, 2024
May 29, 2024
May 24, 2024
May 5, 2024
May 1, 2024
April 29, 2024
April 23, 2024
April 17, 2024
December 19, 2023
December 4, 2023
November 13, 2023
October 13, 2023
October 2, 2023
September 28, 2023