Open Vocabulary 3D
Open-vocabulary 3D object detection (OV-3DDet) aims to enable computers to identify and locate 3D objects, even those not seen during training, using diverse data sources like RGB-D images and point clouds. Current research focuses on leveraging pre-trained vision-language models and multi-modal learning techniques, often incorporating strategies like cross-modal alignment and novel object discovery to overcome data scarcity. These advancements are significant for applications in robotics, autonomous navigation, and augmented reality, where robust and adaptable object recognition is crucial.
Papers
October 31, 2024
August 25, 2024
July 7, 2024
June 4, 2024
June 2, 2024
March 28, 2024
March 20, 2024
December 22, 2023
October 4, 2023
September 18, 2023
April 3, 2023
July 5, 2022