Eye View
Bird's-Eye-View (BEV) perception aims to create a top-down representation of a scene from multiple camera images, mimicking a helicopter view, crucial for autonomous driving and robotics. Current research focuses on improving BEV generation accuracy and robustness using various deep learning architectures, including transformers and attention mechanisms, often incorporating sensor fusion (e.g., lidar and camera) and addressing challenges like occlusion and varying camera viewpoints. This work is significant because accurate and reliable BEV representations are essential for safe and efficient navigation in autonomous systems, impacting the development of self-driving cars and other robotic applications.
Papers
BEV-TSR: Text-Scene Retrieval in BEV Space for Autonomous Driving
Tao Tang, Dafeng Wei, Zhengyu Jia, Tian Gao, Changwei Cai, Chengkai Hou, Peng Jia, Kun Zhan, Haiyang Sun, Jingchen Fan, Yixing Zhao, Fu Liu, Xiaodan Liang, Xianpeng Lang, Yang Wang
Holistic Autonomous Driving Understanding by Bird's-Eye-View Injected Multi-Modal Large Models
Xinpeng Ding, Jinahua Han, Hang Xu, Xiaodan Liang, Wei Zhang, Xiaomeng Li