3D Object Detection
3D object detection aims to accurately identify and locate objects within three-dimensional space, primarily using sensor data like LiDAR and cameras. Current research emphasizes improving accuracy and efficiency through advanced model architectures such as PointPillars, transformers, and Gaussian splatting, often incorporating multimodal fusion techniques and active learning strategies to reduce annotation costs. This field is crucial for autonomous driving, robotics, and augmented reality, with ongoing efforts focused on enhancing robustness, generalization across diverse datasets, and reducing computational demands for real-time applications.
Papers
3DiffTection: 3D Object Detection with Geometry-Aware Diffusion Features
Chenfeng Xu, Huan Ling, Sanja Fidler, Or Litany
mmFUSION: Multimodal Fusion for 3D Objects Detection
Javed Ahmad, Alessio Del Bue
3DifFusionDet: Diffusion Model for 3D Object Detection with Robust LiDAR-Camera Fusion
Xinhao Xiang, Simon Dräger, Jiawei Zhang
FusionViT: Hierarchical 3D Object Detection via LiDAR-Camera Vision Transformer Fusion
Xinhao Xiang, Jiawei Zhang
UniPAD: A Universal Pre-training Paradigm for Autonomous Driving
Honghui Yang, Sha Zhang, Di Huang, Xiaoyang Wu, Haoyi Zhu, Tong He, Shixiang Tang, Hengshuang Zhao, Qibo Qiu, Binbin Lin, Xiaofei He, Wanli Ouyang
GraphAlign: Enhancing Accurate Feature Alignment by Graph matching for Multi-Modal 3D Object Detection
Ziying Song, Haiyue Wei, Lin Bai, Lei Yang, Caiyan Jia