Object Representation
Object representation in artificial intelligence focuses on developing computational models that can effectively encode and utilize information about objects within scenes, mirroring human-like object understanding. Current research emphasizes learning compositional and disentangled representations using architectures like transformers, graph neural networks, and variational autoencoders, often incorporating self-supervised or contrastive learning methods to improve generalization. These advancements are crucial for improving performance in various applications, including robotic manipulation, visual reasoning, and scene understanding, by enabling more robust and flexible interaction with the visual world.
Papers
Category-Agnostic 6D Pose Estimation with Conditional Neural Processes
Yumeng Li, Ning Gao, Hanna Ziesche, Gerhard Neumann
Object Scene Representation Transformer
Mehdi S. M. Sajjadi, Daniel Duckworth, Aravindh Mahendran, Sjoerd van Steenkiste, Filip Pavetić, Mario Lučić, Leonidas J. Guibas, Klaus Greff, Thomas Kipf
Simple Unsupervised Object-Centric Learning for Complex and Naturalistic Videos
Gautam Singh, Yi-Fu Wu, Sungjin Ahn
Prune and distill: similar reformatting of image information along rat visual cortex and deep neural networks
Paolo Muratore, Sina Tafazoli, Eugenio Piasini, Alessandro Laio, Davide Zoccolan