3D Scene Graph Generation

3D scene graph generation aims to represent 3D scenes as structured graphs, encoding both the objects present and their relationships, enabling richer scene understanding than simple object detection. Current research focuses on improving accuracy and robustness, particularly for complex scenes and fine-grained object distinctions, employing methods like graph neural networks, transformers, and ray tracing to model object interactions and leverage both visual and linguistic information. These advancements are significant for robotics, autonomous navigation, and virtual/augmented reality applications, providing more comprehensive scene interpretation for improved decision-making and interaction.

Papers