3D Parsing

3D parsing aims to decompose a scene into its constituent parts, representing both their semantic labels and geometric properties in three dimensions. Current research focuses on developing robust methods for 3D shape reconstruction from various data sources, including single images and point clouds, often employing neural networks such as convolutional neural networks (CNNs) and transformers, along with techniques like signed distance functions (SDFs) and mesh processing. These advancements are driving progress in applications ranging from autonomous driving and robotics to medical image analysis and 3D modeling, enabling more sophisticated scene understanding and interaction. The field is also exploring efficient and unsupervised learning approaches to handle large-scale, unannotated datasets.

Papers

December 20, 2024

3D Shape Tokenization
Jen-Hao Rick Chang, Yuyang Wang, Miguel Angel Bautista Martin, Jiatao Gu, Josh Susskind, Oncel Tuzel
3d Representation Flow Matching Shape Information 3D Parsing 3D Alignment

September 24, 2024

SDFit: 3D Object Pose and Shape by Fitting a Morphable SDF to a Single Image
Dimitrije Antić, Sai Kumar Dwivedi, Shashank Tripathi, Theo Gevers, Dimitrios Tzionas
Ground Truth 3D Object Object Shape 3D Shape Signed Distance Function 3D Parsing

May 30, 2024

DenseSeg: Joint Learning for Semantic Segmentation and Landmark Detection Using Dense Image-to-Shape Representation
Ron Keuth, Lasse Hansen, Maren Balks, Ronja Jäger, Anne-Nele Schröder, Ludger Tüshaus, Mattias Heinrich
Semantic Segmentation Joint Learning Landmark Detection 3D Parsing

April 2, 2024

Continuous Sculpting: Persistent Swarm Shape Formation Adaptable to Local Environmental Changes
Andrew G. Curtis, Mark Yim, Michael Rubenstein
Mobile Robot Resilient Swarm Swarm Behavior Environmental Change 3D Parsing

February 26, 2024

Neural Mesh Fusion: Unsupervised 3D Planar Surface Understanding
Farhad G. Zanjani, Hong Cai, Yinhao Zhu, Leyla Mirvakhabova, Fatih Porikli
Unsupervised 3D 3D Parsing Planar Reconstruction

November 29, 2023

ShapeGPT: 3D Shape Generation with A Unified Multi-modal Language Model
Fukun Yin, Xin Chen, Chi Zhang, Biao Jiang, Zibo Zhao, Jiayuan Fan, Gang Yu, Taihao Li, Tao Chen
3D Shape 3D Shape Generation Multi Modal Language Model Shape Generation Text Shape 3D Parsing Generative Shape

October 29, 2023

3DMiner: Discovering Shapes from Large-Scale Unannotated Image Datasets
Ta-Ying Cheng, Matheus Gadelha, Soren Pirk, Thibault Groueix, Radomir Mech, Andrew Markham, Niki Trigoni
Object Shape 3D Parsing

April 27, 2023

Analogy-Forming Transformers for Few-Shot 3D Parsing
Nikolaos Gkanatsios, Mayank Singh, Zhaoyuan Fang, Shubham Tulsiani, Katerina Fragkiadaki
Transformer Megatron Decepticons Shot Learning Analogical Reasoning 3D Parsing

December 10, 2022

Multi-Sem Fusion: Multimodal Semantic Fusion for 3D Object Detection
Shaoqing Xu, Fang Li, Ziying Song, Jin Fang, Sifen Wang, Zhi-Xin Yang
3D Object Detection 3D Object Detector 3D Object Detection Benchmark 3D Parsing Feature Impact Balance Sem

November 1, 2022

Seg&Struct: The Interplay Between Part Segmentation and Structure Inference for 3D Shape Parsing
Jeonghyun Kim, Kaichun Mo, Minhyuk Sung, Woontack Woo
Part Segmentation Part Aware Body Part 3D Parsing

February 1, 2022

Laplacian2Mesh: Laplacian-Based Mesh Understanding
Qiujie Dong, Zixiong Wang, Manyi Li, Junjie Gao, Shuangmin Chen, Zhenyu Shu, Shiqing Xin, Changhe Tu, Wenping Wang
Unstructured Grid 3D Parsing Shape Classification Static Mesh Flexible Recurrent Network

November 4, 2021

Towards Panoptic 3D Parsing for Single Image in the Wild
Sainan Liu, Vincent Nguyen, Yuan Gao, Subarna Tripathi, Zhuowen Tu
Wild Challenge Single Image Panoptic Segmentation 3D Parsing