Aware Spatial Cross Attention

Aware spatial cross-attention mechanisms enhance deep learning models by selectively focusing on relevant spatial information across different feature layers or modalities. Current research emphasizes integrating these mechanisms into transformer architectures and applying them to diverse tasks, including object detection in aerial and medical images, bird's-eye view generation for autonomous driving, and fine-grained visual categorization. This approach improves model performance by mitigating information loss and enhancing the representation of both global context and localized details, leading to more accurate and robust results in various computer vision applications.

Papers

November 27, 2024

DualCast: Disentangling Aperiodic Events from Traffic Series with a Dual-Branch Model
Xinyu Su, Feng Liu, Yanchuan Chang, Egemen Tanin, Majid Sarvi, Jianzhong Qi
Traffic Forecasting Dual Branch Traffic Video Based Event Aware Spatial Cross Attention

July 29, 2024

Cross-Layer Feature Pyramid Transformer for Small Object Detection in Aerial Images
Zewen Du, Zhenjiang Hu, Guiyu Zhao, Ying Jin, Hongbin Ma
Aerial Image Small Object Detection Feature Pyramid Network Cross Layer Attention Pyramid Cross Fusion Transformer Network Aware Spatial Cross Attention

October 9, 2023

AdaFuse: Adaptive Medical Image Fusion Based on Spatial-Frequential Cross Attention
Xianming Gu, Lihui Wang, Zeyu Deng, Ying Cao, Xingyu Huang, Yue-min Zhu
Image Fusion Multi Modality Cross Attention Fusion Frequency Band Attention Aware Spatial Cross Attention

March 7, 2023

F2BEV: Bird's Eye View Generation from Surround-View Fisheye Camera Images for Automated Driving
Ekta U. Samani, Feng Tao, Harshavardhan R. Dasari, Sihao Ding, Ashis G. Banerjee
Automated Driving Eye View Bird Specie Fisheye Image Fisheye Video Surround View Fisheye Aware Spatial Cross Attention

February 25, 2023

Introducing Depth into Transformer-based 3D Object Detection
Hao Zhang, Hongyang Li, Ailing Zeng, Feng Li, Shilong Liu, Xingyu Liao, Lei Zhang
Cross Attention Large Depth Monocular 3D Detection Depth Aware Transformer Transformer Based 3D Object Aware Spatial Cross Attention

October 17, 2022

Cross-layer Attention Network for Fine-grained Visual Categorization
Ranran Huang, Yu Wang, Huazhong Yang
Fine Grained Fine Grained Visual Cross Layer Attention Aware Spatial Cross Attention

Aware Spatial Cross Attention

Papers

DualCast: Disentangling Aperiodic Events from Traffic Series with a Dual-Branch Model

Cross-Layer Feature Pyramid Transformer for Small Object Detection in Aerial Images

AdaFuse: Adaptive Medical Image Fusion Based on Spatial-Frequential Cross Attention

F2BEV: Bird's Eye View Generation from Surround-View Fisheye Camera Images for Automated Driving

Introducing Depth into Transformer-based 3D Object Detection

Cross-layer Attention Network for Fine-grained Visual Categorization