Multi View Image
Multi-view image processing focuses on leveraging information from multiple viewpoints to achieve more robust and complete scene understanding than single-view methods. Current research heavily emphasizes using this data for 3D reconstruction, often employing neural radiance fields (NeRFs), diffusion models, and Gaussian splatting techniques to generate novel views, create 3D models from 2D images, and improve tasks like object detection and image editing. These advancements have significant implications for various fields, including robotics, autonomous driving, and 3D modeling, by enabling more accurate and efficient scene representation and manipulation. The development of robust and generalizable methods for handling occlusions, varying weather conditions, and limited data remains a key challenge.
Papers
Precise Drive with VLM: First Prize Solution for PRCV 2024 Drive LM challenge
Bin Huang, Siyu Wang, Yuanpeng Chen, Yidan Wu, Hui Song, Zifan Ding, Jing Leng, Chengpeng Liang, Peng Xue, Junliang Zhang, Tiankun Zhao
CAD-NeRF: Learning NeRFs from Uncalibrated Few-view Images by CAD Model Retrieval
Xin Wen, Xuening Zhu, Renjiao Yi, Zhifeng Wang, Chenyang Zhu, Kai Xu