Unseen Composition

Unseen composition research focuses on enabling artificial intelligence systems to recognize and understand novel combinations of visual concepts (objects and their states) not explicitly seen during training. Current efforts concentrate on leveraging large pre-trained vision-language models, employing techniques like prompt tuning, attention mechanisms, and contrastive learning to improve generalization to unseen compositions. These advancements are significant because they address a key limitation in AI's ability to reason about the world, paving the way for more robust and adaptable computer vision systems with applications in areas like image captioning and object recognition.

Papers

November 25, 2024

Teaching Smaller Language Models To Generalise To Unseen Compositional Questions (Full Thesis)
Tim Hartill
Large Language Model Yes No Question Reasoning System Larger Language Model Smaller Language Model Context Dependent Question Electronic Thesis Free Text Rationale Unseen Composition

July 18, 2024

Attention Based Simple Primitives for Open World Compositional Zero-Shot Learning
Ans Munir, Faisal Z. Qureshi, Muhammad Haris Khan, Mohsen Ali
Human Attention Compositional Zero Shot Learning Simple Primitive Attribute Value Pair Unseen Composition

June 2, 2024

OLIVE: Object Level In-Context Visual Embeddings
Timothy Ossowski, Junjie Hu
Fine Grained Vision Language Model Object Representation Object Embeddings Unseen Composition Olive Oil Manufacturing

December 2, 2023

Prompt Tuning for Zero-shot Compositional Learning
Lingyu Zhang, Ting Hua, Yilin Shen, Hongxia Jin
Prompt Tuning Pre Trained Vision Language Model Compositional Zero Shot Learning Challenge Dataset Unseen Composition

May 23, 2023

Prompting Language-Informed Distribution for Compositional Zero-Shot Learning
Wentao Bao, Lichang Chen, Heng Huang, Yu Kong
Zero Shot Zero Shot Learning Compositional Zero Shot Learning Unseen Composition

November 19, 2022

October 20, 2022

Learning Attention Propagation for Compositional Zero-Shot Learning
Muhammad Gul Zain Ali Khan, Muhammad Ferjad Naeem, Luc Van Gool, Alain Pagani, Didier Stricker, Muhammad Zeshan Afzal
Zero Shot Learning Compositional Zero Shot Learning Attention Based Network Unseen Composition

September 30, 2022

SmallCap: Lightweight Image Captioning Prompted with Retrieval Augmentation
Rita Ramos, Bruno Martins, Desmond Elliott, Yova Kementchedjhieva
Image Captioning Retrieval Augmentation Unseen Composition

June 29, 2022

Siamese Contrastive Embedding Network for Compositional Zero-Shot Learning
Xiangyu Li, Xu Yang, Kun Wei, Cheng Deng, Muli Yang
Network Programming Compositional Zero Shot Learning Unseen Composition Contrastive Siamese

December 12, 2021

Contextualized Scene Imagination for Generative Commonsense Reasoning
PeiFeng Wang, Jonathan Zamora, Junfeng Liu, Filip Ilievski, Muhao Chen, Xiang Ren
Implicit Knowledge Semantic Scene Graph Generative CommonSense Reasoning Unseen Composition Story Generation Task