Cross View
Cross-view research focuses on bridging the significant visual discrepancies between images captured from different viewpoints, primarily aiming to improve the accuracy and robustness of tasks like geolocalization, scene understanding, and 3D reconstruction. Current research heavily utilizes deep learning models, including transformers, autoencoders, and diffusion models, often incorporating techniques like contrastive learning, bird's-eye-view transformations, and geometric constraints to align and fuse information across views. This field is crucial for advancing autonomous navigation, remote sensing, and human-computer interaction applications by enabling more reliable and efficient processing of multi-perspective data.
Papers
Unpaired Multi-View Graph Clustering with Cross-View Structure Matching
Yi Wen, Siwei Wang, Qing Liao, Weixuan Liang, Ke Liang, Xinhang Wan, Xinwang Liu
Beyond Geo-localization: Fine-grained Orientation of Street-view Images by Cross-view Matching with Satellite Imagery with Supplementary Materials
Wenmiao Hu, Yichen Zhang, Yuxuan Liang, Yifang Yin, Andrei Georgescu, An Tran, Hannes Kruppa, See-Kiong Ng, Roger Zimmermann