Arbitrary Object
Arbitrary object processing in computer vision aims to develop algorithms capable of understanding, manipulating, and reasoning about objects of any type, regardless of prior knowledge or training data. Current research focuses on developing robust models, often leveraging transformer architectures and diffusion models, to achieve accurate object detection, segmentation, pose estimation, and manipulation in diverse and complex scenes, including those with occlusions and interactions between multiple objects. These advancements are crucial for progress in robotics, autonomous systems, and augmented/virtual reality applications, enabling more flexible and adaptable interactions with the physical world. Furthermore, the development of efficient and generalizable methods for arbitrary object processing is driving innovation in self-supervised learning and knowledge distillation techniques.
Papers
Totems: Physical Objects for Verifying Visual Integrity
Jingwei Ma, Lucy Chai, Minyoung Huh, Tongzhou Wang, Ser-Nam Lim, Phillip Isola, Antonio Torralba
Impact-Friendly Object Catching at Non-Zero Velocity Based on Combined Optimization and Learning
Jianzhuang Zhao, Gustavo J. G. Lahr, Francesco Tassi, Alessandro Santopaolo, Elena De Momi, Arash Ajoudani
InterCap: Joint Markerless 3D Tracking of Humans and Objects in Interaction
Yinghao Huang, Omid Tehari, Michael J. Black, Dimitrios Tzionas
OA-SLAM: Leveraging Objects for Camera Relocalization in Visual SLAM
Matthieu Zins, Gilles Simon, Marie-Odile Berger
Fast, Accurate and Object Boundary-Aware Surface Normal Estimation from Depth Maps
Saed Moradi, Alireza Memarmoghadam, Denis Laurendeau
Understanding the Impact of Image Quality and Distance of Objects to Object Detection Performance
Yu Hao, Haoyang Pei, Yixuan Lyu, Zhongzheng Yuan, John-Ross Rizzo, Yao Wang, Yi Fang