RGB D Data
RGB-D data, combining color (RGB) and depth information, fuels advancements in 3D scene understanding and object manipulation. Current research emphasizes developing robust and efficient algorithms, often employing deep learning architectures like convolutional neural networks and transformers, to perform tasks such as 3D reconstruction, object pose estimation, and semantic segmentation from this rich data source. These improvements are driving progress in robotics, augmented reality, and medical applications, particularly in areas requiring precise 3D perception and interaction in complex environments. The development of new, diverse datasets is also a key focus, enabling the training and evaluation of more generalizable models.
Papers
Polarimetric Information for Multi-Modal 6D Pose Estimation of Photometrically Challenging Objects with Limited Data
Patrick Ruhkamp, Daoyi Gao, HyunJun Jung, Nassir Navab, Benjamin Busam
In-Rack Test Tube Pose Estimation Using RGB-D Data
Hao Chen, Weiwei Wan, Masaki Matsushita, Takeyuki Kotaka, Kensuke Harada