Text to 3D Generation
Text-to-3D generation aims to create three-dimensional models from textual descriptions, bridging the gap between natural language and 3D content creation. Current research heavily utilizes diffusion models, often coupled with techniques like Score Distillation Sampling (SDS) and Gaussian splatting, to generate high-fidelity 3D objects represented as neural radiance fields or meshes. These advancements are improving the realism, detail, and efficiency of 3D model generation, impacting fields such as computer graphics, animation, and virtual/augmented reality by offering faster and more intuitive content creation pipelines. Ongoing efforts focus on addressing challenges like geometric consistency, view consistency, and efficient generation of complex scenes.
Papers
PI3D: Efficient Text-to-3D Generation with Pseudo-Image Diffusion
Ying-Tian Liu, Yuan-Chen Guo, Guan Luo, Heyi Sun, Wei Yin, Song-Hai Zhang
UniDream: Unifying Diffusion Priors for Relightable Text-to-3D Generation
Zexiang Liu, Yangguang Li, Youtian Lin, Xin Yu, Sida Peng, Yan-Pei Cao, Xiaojuan Qi, Xiaoshui Huang, Ding Liang, Wanli Ouyang
ControlDreamer: Blending Geometry and Style in Text-to-3D
Yeongtak Oh, Jooyoung Choi, Yongsung Kim, Minjun Park, Chaehun Shin, Sungroh Yoon
StableDreamer: Taming Noisy Score Distillation Sampling for Text-to-3D
Pengsheng Guo, Hans Hao, Adam Caccavale, Zhongzheng Ren, Edward Zhang, Qi Shan, Aditya Sankar, Alexander G. Schwing, Alex Colburn, Fangchang Ma