3D Space

Research on 3D space focuses on developing methods for robustly representing, understanding, and interacting with three-dimensional environments using various data modalities like point clouds, neural radiance fields (NeRFs), and images. Current efforts concentrate on improving the integration of 3D data with large language models (LLMs) for tasks such as scene understanding, question answering, and object manipulation, often employing novel architectures like hypernetworks and transformers to enhance spatial reasoning and alignment between visual and textual information. This work is significant for advancing fields like autonomous driving, robotics, and augmented reality, enabling more sophisticated and context-aware interactions with the physical world.

Papers