Cross View
Cross-view research focuses on bridging the significant visual discrepancies between images captured from different viewpoints, primarily aiming to improve the accuracy and robustness of tasks like geolocalization, scene understanding, and 3D reconstruction. Current research heavily utilizes deep learning models, including transformers, autoencoders, and diffusion models, often incorporating techniques like contrastive learning, bird's-eye-view transformations, and geometric constraints to align and fuse information across views. This field is crucial for advancing autonomous navigation, remote sensing, and human-computer interaction applications by enabling more reliable and efficient processing of multi-perspective data.
Papers
Probing Fine-Grained Action Understanding and Cross-View Generalization of Foundation Models
Thinesh Thiyakesan Ponbagavathi, Kunyu Peng, Alina Roitberg
Spatial-Temporal Cross-View Contrastive Pre-training for Check-in Sequence Representation Learning
Letian Gong, Huaiyu Wan, Shengnan Guo, Xiucheng Li, Yan Lin, Erwen Zheng, Tianyi Wang, Zeyu Zhou, Youfang Lin