Self Attention Module

Self-attention modules are a core component of transformer-based models, aiming to efficiently capture long-range dependencies within data sequences. Current research focuses on improving the efficiency of self-attention, particularly addressing its quadratic complexity with sequence length, through techniques like FlashAttention and various forms of sparse attention, and integrating it effectively with other modules such as in grouped residual self-attention or cascade attention blocks. These advancements are significant because they enable the application of transformer architectures to larger datasets and more complex tasks across diverse fields, including computer vision, natural language processing, and signal processing, while reducing computational costs.

Papers

May 22, 2023

Bright Channel Prior Attention for Multispectral Pedestrian Detection
Chenhang Cui, Jinyu Xie, Yechenhao Yang
Channel Attention Self Attention Layer Pedestrian Detection Self Attention Module MultiSpectral Pedestrian Detection

May 9, 2023

LSAS: Lightweight Sub-attention Strategy for Alleviating Attention Bias Problem
Shanshan Zhong, Wushao Wen, Jinghui Qin, Qiangpu Chen, Zhongzhan Huang
Self Attention Module Lightweight Attention Attention Bias

April 13, 2023

RSIR Transformer: Hierarchical Vision Transformer using Random Sampling Windows and Important Region Windows
Zhemin Zhang, Xun Gong
Self Attention Self Attention Module Hierarchical Vision Transformer Local Self Attention Local Window

March 2, 2023

Self-attention in Vision Transformers Performs Perceptual Grouping, Not Attention
Paria Mehrani, John K. Tsotsos
Vision Transformer Self Attention Attention Mechanism Human Attention Visual Attention Self Attention Module Perceptual Grouping

February 28, 2023

VQA with Cascade of Self- and Co-Attention Blocks
Aakansha Mishra, Ashish Anand, Prithwijit Guha
Visual Question Answering Attention Module Self Information Self Attention Module Attention Block TF Cascade Cross Attention Block

February 12, 2023

A Theoretical Understanding of Shallow Vision Transformers: Learning, Generalization, and Sample Complexity
Hongkang Li, Meng Wang, Sijia Liu, Pin-yu Chen
LeArning Abstract Vision Transformer Strong Generalization Theoretical Understanding Sample Complexity Attention Map Self Attention Layer Self Attention Module Shallow Transformer

November 25, 2022

Spatial-Temporal Attention Network for Open-Set Fine-Grained Image Recognition
Jiayin Sun, Hong Wang, Qiulei Dong
Temporal Attention Self Attention Module Spatial Self Attention Temporal Attention Network

October 27, 2022

October 22, 2022

Accumulated Trivial Attention Matters in Vision Transformers on Small Datasets
Xiangyu Chen, Qinghao Hu, Kaidong Li, Cuncong Zhong, Guanghui Wang
Vision Transformer Small Datasets Self Attention Module

October 10, 2022

Parameter-Efficient Tuning with Special Token Adaptation
Xiaocong Yang, James Y. Huang, Wenxuan Zhou, Muhao Chen
Language Model Transformer Based Model Parameter Efficient Tuning Self Attention Module Token Representation Adaptive Token

October 4, 2022

Pay Self-Attention to Audio-Visual Navigation
Yinfeng Yu, Lele Cao, Fuchun Sun, Xiaohong Liu, Liejun Wang
Self Attention Audio Visual Self Attention Module Audio Encoder Audio Visual Navigation

September 20, 2022

Dynamic Graph Message Passing Networks for Visual Recognition
Li Zhang, Mohan Chen, Anurag Arnab, Xiangyang Xue, Philip H. S. Torr
Visual Recognition Self Attention Module Message Passing Graph Neural Network Fully Connected Transformer Backbone

September 16, 2022

Self-Attentive Pooling for Efficient Deep Learning
Fang Chen, Gourav Datta, Souvik Kundu, Peter Beerel
Multi Head Self Attention Efficient Deep Self Attention Module Different Convolutional Neural Network Attentive Pooling

September 13, 2022

Switchable Self-attention Module
Shanshan Zhong, Wushao Wen, Jinghui Qin
Attention Mechanism Attention Module Self Attention Module Attention Operator

September 8, 2022

TF-GridNet: Making Time-Frequency Domain Models Great Again for Monaural Speaker Separation
Zhong-Qiu Wang, Samuele Cornell, Shukjae Choi, Younglo Lee, Byeong-Yeol Kim, Shinji Watanabe
Self Attention Module Input Mixture Spectro Temporal Temporal Shift Module Monaural Speech Separation

September 6, 2022

Sequential Cross Attention Based Multi-task Learning
Sunkyung Kim, Hyesong Choi, Dongbo Min
Multi Task Learning Multi Scale Attention Module Self Attention Module Cross Task Attention Sequential Multi Task Learning

August 26, 2022

SFusion: Self-attention based N-to-One Multimodal Fusion Block
Zecheng Liu, Jia Wei, Rui Li, Jianlong Zhou
Self Attention Multi Modality Multimodal Fusion Cross Modal Attention Self Attention Module

August 8, 2022

QSAM-Net: Rain streak removal by quaternion neural network with self-attention module
Vladimir Frants, Sos Agaian, Karen Panetta
Neural Network Self Attention Module Rain Removal Rainy Image Quaternion Domain Quaternion Valued Quantum Hard Attention

July 16, 2022

The Lottery Ticket Hypothesis for Self-attention in Convolutional Neural Network
Zhongzhan Huang, Senwei Liang, Mingfu Liang, Wei He, Haizhao Yang, Liang Lin
Convolutional Neural Network Self Attention Self Attention Module Lottery Ticket Hypothesis Self Attention Network