Learning Based Multi View Stereo
Learning-based multi-view stereo (MVS) aims to reconstruct high-fidelity 3D models from multiple 2D images by leveraging deep learning techniques. Current research emphasizes improving accuracy and completeness, particularly in challenging scenarios like textureless regions and large depth variations, through advancements in model architectures such as transformers, ray-based methods, and novel cost volume aggregation strategies. These improvements are significant for applications requiring precise 3D scene understanding, including augmented/virtual reality, autonomous driving, and robotics.
Papers
TransMVSNet: Global Context-aware Multi-view Stereo Network with Transformers
Yikang Ding, Wentao Yuan, Qingtian Zhu, Haotian Zhang, Xiangyue Liu, Yuanjiang Wang, Xiao Liu
IB-MVS: An Iterative Algorithm for Deep Multi-View Stereo based on Binary Decisions
Christian Sormann, Mattia Rossi, Andreas Kuhn, Friedrich Fraundorfer