Arbitrary Object
Arbitrary object processing in computer vision aims to develop algorithms capable of understanding, manipulating, and reasoning about objects of any type, regardless of prior knowledge or training data. Current research focuses on developing robust models, often leveraging transformer architectures and diffusion models, to achieve accurate object detection, segmentation, pose estimation, and manipulation in diverse and complex scenes, including those with occlusions and interactions between multiple objects. These advancements are crucial for progress in robotics, autonomous systems, and augmented/virtual reality applications, enabling more flexible and adaptable interactions with the physical world. Furthermore, the development of efficient and generalizable methods for arbitrary object processing is driving innovation in self-supervised learning and knowledge distillation techniques.
Papers
Generative Design of Physical Objects using Modular Framework
Nikita O. Starodubcev, Nikolay O. Nikitin, Konstantin G. Gavaza, Elizaveta A. Andronova, Denis O. Sidorenko, Anna V. Kalyuzhnaya
FCSN: Global Context Aware Segmentation by Learning the Fourier Coefficients of Objects in Medical Images
Young Seok Jeon, Hongfei Yang, Mengling Feng
Progress and limitations of deep networks to recognize objects in unusual poses
Amro Abbas, Stéphane Deny
You Should Look at All Objects
Zhenchao Jin, Dongdong Yu, Luchuan Song, Zehuan Yuan, Lequan Yu
TransGrasp: Grasp Pose Estimation of a Category of Objects by Transferring Grasps from Only One Labeled Instance
Hongtao Wen, Jianhang Yan, Wanli Peng, Yi Sun
Mechanical Search on Shelves with Efficient Stacking and Destacking of Objects
Huang Huang, Letian Fu, Michael Danielczuk, Chung Min Kim, Zachary Tam, Jeffrey Ichnowski, Anelia Angelova, Brian Ichter, Ken Goldberg
3D Part Assembly Generation with Instance Encoded Transformer
Rufeng Zhang, Tao Kong, Weihao Wang, Xuan Han, Mingyu You