Object Level

Object-level understanding in computer vision aims to represent and reason about individual objects within scenes, moving beyond simple object detection to encompass their properties, relationships, and interactions. Current research heavily utilizes transformer-based architectures, often incorporating multi-modal learning (combining visual and textual data) and leveraging techniques like knowledge distillation and contrastive learning to improve model performance and generalization. This focus on object-centric representation is crucial for advancing applications such as autonomous driving, robotics, and image understanding, enabling more robust and context-aware systems.

Papers

May 26, 2022

Learning What and Where: Disentangling Location and Identity Tracking Without Supervision
Manuel Traub, Sebastian Otte, Tobias Menge, Matthias Karlbauer, Jannik Thümmel, Martin V. Butz
LeArning Abstract Vision Based Object Representation Predictive Coding Object Level Location Information Supervised Localization

May 23, 2022

Meta-Learning Regrasping Strategies for Physical-Agnostic Objects
Ning Gao, Jingyu Zhang, Ruijie Chen, Ngo Anh Vien, Hanna Ziesche, Gerhard Neumann
Meta Learning Object Level Accurate Grasping Physic Embedded

May 11, 2022

Identifying concept libraries from language about object structure
Catherine Wong, William P. McCarthy, Gabriel Grand, Yoni Friedman, Joshua B. Tenenbaum, Jacob Andreas, Robert D. Hawkins, Judith E. Fan
Human Language Natural Language Description High Impact Concept Different PaRT Object Level Program Representation Naturalistic Word Order

May 9, 2022

Multi-Fingered In-Hand Manipulation with Various Object Properties Using Graph Convolutional Networks and Distributed Tactile Sensors
Satoshi Funabashi, Tomoki Isobe, Fei Hongyi, Atsumu Hiramoto, Alexander Schmitz, Shigeki Sugano, Tetsuya Ogata
Tactile Sensor Dexterous Manipulation Object Level Tactile Data Robot Hand Multi Fingered Hand

January 17, 2022

Disentangled Latent Transformer for Interpretable Monocular Height Estimation
Zhitong Xiong, Sining Chen, Yilei Shi, Xiao Xiang Zhu
Semantic Segmentation Object Level Height Estimation Joint Semantic Disentangled Transformer

December 28, 2021

Robotic Perception of Object Properties using Tactile Sensing
Jiaqi Jiang, Shan Luo
Robotic Grasping Tactile Sensor Robot Perception Tactile Sensing Object Level Tactile Perception

December 1, 2021

Pose2Room: Understanding 3D Scenes from Human Activities
Yinyu Nie, Angela Dai, Xiaoguang Han, Matthias Nießner
3D Scene Gaussian Mixture Stable Pose Object Level Trajectory Pattern Probabilistic 3D