Multi View Image
Multi-view image processing focuses on leveraging information from multiple viewpoints to achieve more robust and complete scene understanding than single-view methods. Current research heavily emphasizes using this data for 3D reconstruction, often employing neural radiance fields (NeRFs), diffusion models, and Gaussian splatting techniques to generate novel views, create 3D models from 2D images, and improve tasks like object detection and image editing. These advancements have significant implications for various fields, including robotics, autonomous driving, and 3D modeling, by enabling more accurate and efficient scene representation and manipulation. The development of robust and generalizable methods for handling occlusions, varying weather conditions, and limited data remains a key challenge.
Papers
CRAYM: Neural Field Optimization via Camera RAY Matching
Liqiang Lin, Wenpeng Wu, Chi-Wing Fu, Hao Zhang, Hui Huang
MVImgNet2.0: A Larger-scale Dataset of Multi-view Images
Xiaoguang Han, Yushuang Wu, Luyue Shi, Haolin Liu, Hongjie Liao, Lingteng Qiu, Weihao Yuan, Xiaodong Gu, Zilong Dong, Shuguang Cui
Precise Drive with VLM: First Prize Solution for PRCV 2024 Drive LM challenge
Bin Huang, Siyu Wang, Yuanpeng Chen, Yidan Wu, Hui Song, Zifan Ding, Jing Leng, Chengpeng Liang, Peng Xue, Junliang Zhang, Tiankun Zhao
CAD-NeRF: Learning NeRFs from Uncalibrated Few-view Images by CAD Model Retrieval
Xin Wen, Xuening Zhu, Renjiao Yi, Zhifeng Wang, Chenyang Zhu, Kai Xu