Arbitrary Object
Arbitrary object processing in computer vision aims to develop algorithms capable of understanding, manipulating, and reasoning about objects of any type, regardless of prior knowledge or training data. Current research focuses on developing robust models, often leveraging transformer architectures and diffusion models, to achieve accurate object detection, segmentation, pose estimation, and manipulation in diverse and complex scenes, including those with occlusions and interactions between multiple objects. These advancements are crucial for progress in robotics, autonomous systems, and augmented/virtual reality applications, enabling more flexible and adaptable interactions with the physical world. Furthermore, the development of efficient and generalizable methods for arbitrary object processing is driving innovation in self-supervised learning and knowledge distillation techniques.
Papers
Self-Supervised Learning of Object Parts for Semantic Segmentation
Adrian Ziegler, Yuki M. Asano
Can deep learning match the efficiency of human visual long-term memory in storing object details?
A. Emin Orhan
Binding Actions to Objects in World Models
Ondrej Biza, Robert Platt, Jan-Willem van de Meent, Lawson L. S. Wong, Thomas Kipf
VAE-iForest: Auto-encoding Reconstruction and Isolation-based Anomalies Detecting Fallen Objects on Road Surface
Takato Yasuno, Junichiro Fujii, Riku Ogata, Masahiro Okano
3D object reconstruction and 6D-pose estimation from 2D shape for robotic grasping of objects
Marcell Wolnitza, Osman Kaya, Tomas Kulvicius, Florentin Wörgötter, Babette Dellen