Composed Image Retrieval
Composed image retrieval (CIR) aims to retrieve images matching a multimodal query consisting of a reference image and a text description specifying desired modifications. Current research heavily focuses on zero-shot CIR, developing methods that avoid the need for expensive triplet-labeled datasets, often employing techniques like textual inversion, contrastive learning, and large language models to generate synthetic training data or improve feature representation. This field is significant for its potential to enhance image search capabilities beyond keyword-based systems, impacting applications in digital humanities, remote sensing, and e-commerce by enabling more nuanced and flexible image querying.
Papers
December 15, 2024
December 7, 2024
December 4, 2024
November 24, 2024
October 31, 2024
October 22, 2024
October 2, 2024
July 8, 2024
June 13, 2024
May 29, 2024
May 24, 2024
May 5, 2024
May 1, 2024
April 29, 2024
April 23, 2024
April 17, 2024
December 19, 2023
December 4, 2023