3D Understanding
3D understanding focuses on enabling computers to perceive and interpret three-dimensional scenes and objects, mirroring human spatial reasoning. Current research emphasizes developing robust models that integrate multiple data modalities (point clouds, images, text, even audio) using techniques like multi-modal mixing, contrastive learning, and large language models (LLMs) to improve accuracy and efficiency. This field is crucial for advancements in robotics, autonomous driving, augmented reality, and other applications requiring sophisticated scene understanding, with recent work highlighting the importance of data efficiency and explainability in model development.
Papers
September 1, 2023
July 28, 2023
May 14, 2023
April 12, 2023
December 27, 2022
December 22, 2022
December 10, 2022
September 24, 2022