State Object Composition

State object composition research focuses on enabling artificial intelligence systems to understand and reason about scenes by decomposing them into constituent objects and their associated states (e.g., a "rusty car"). Current efforts concentrate on developing models that can accurately recognize and generate novel combinations of objects and states, even those unseen during training, often leveraging vision-language models, large language models, or knowledge graphs to improve generalization and handle the vast space of possible compositions. This work is significant for advancing artificial intelligence capabilities in areas such as image recognition, robotic manipulation, and video understanding, ultimately leading to more robust and adaptable AI systems.

Papers