Composed Image Retrieval
Composed image retrieval (CIR) aims to retrieve images matching a multimodal query consisting of a reference image and a text description specifying desired modifications. Current research heavily focuses on zero-shot CIR, developing methods that avoid the need for expensive triplet-labeled datasets, often employing techniques like textual inversion, contrastive learning, and large language models to generate synthetic training data or improve feature representation. This field is significant for its potential to enhance image search capabilities beyond keyword-based systems, impacting applications in digital humanities, remote sensing, and e-commerce by enabling more nuanced and flexible image querying.
Papers
September 5, 2023
August 28, 2023
August 22, 2023
May 25, 2023
May 5, 2023
March 29, 2023
March 27, 2023
March 21, 2023
March 16, 2023
February 6, 2023