Efficient Attention

Efficient attention mechanisms aim to overcome the quadratic complexity of standard self-attention in Transformer networks, a major bottleneck for processing long sequences in various applications like natural language processing and image analysis. Current research focuses on developing faster algorithms, such as FlashAttention and its variants, and on architectural modifications like pruned token compression and linear attention via orthogonal memory, to reduce computational cost and memory footprint while maintaining accuracy. These advancements are crucial for scaling Transformer models to handle longer sequences and larger datasets, impacting fields ranging from large language models to medical image analysis and beyond.

Papers

February 9, 2023

Efficient Attention via Control Variates
Lin Zheng, Jianbo Yuan, Chong Wang, Lingpeng Kong
Attention Mechanism Softmax Attention Efficient Attention Control Variate

October 19, 2022

Domain generalization Person Re-identification on Attention-aware multi-operation strategery
Yingchun Guo, Huan He, Ye Zhu, Yang Yu
Domain Generalization Domain Invariant Feature Efficient Attention Domain Invariant

October 14, 2022

CAB: Comprehensive Attention Benchmarking on Long Sequence Modeling
Jun Zhang, Shuyang Jiang, Jiangtao Feng, Lin Zheng, Lingpeng Kong
Long Sequence Efficient Attention Causal Attention

September 15, 2022

Hydra Attention: Efficient Attention with Many Heads
Daniel Bolya, Cheng-Yang Fu, Xiaoliang Dai, Peizhao Zhang, Judy Hoffman
Vision Transformer Multi Head Attention Attention Matrix Multi Head Efficient Attention Hydra MDP

August 21, 2022

LWA-HAND: Lightweight Attention Hand for Interacting Hand Reconstruction
Xinhan Di, Pengqian Yu
Cross Attention Attention Module Hand Reconstruction Efficient Attention Lightweight Attention

August 15, 2022

Faster Attention Is What You Need: A Fast Self-Attention Neural Network Backbone Architecture for the Edge via Double-Condensing Attention Condensers
Alexander Wong, Mohammad Javad Shafiee, Saad Abbasi, Saeejith Nair, Mahmoud Famouri
Self Attention Extreme Edge Efficient Attention Efficient Neural Network Architecture Attention Condenser

July 28, 2022

Neural Architecture Search on Efficient Transformers and Beyond
Zexiang Liu, Dong Li, Kaiyue Lu, Zhen Qin, Weixuan Sun, Jiacheng Xu, Yiran Zhong
Neural Architecture Search Efficient Transformer Softmax Attention Efficient Attention

June 1, 2022

Fair Comparison between Efficient Attentions
Jiuk Hong, Chaehyeon Lee, Soyoun Bang, Heechul Jung
Computer Vision Attention Operation Efficient Attention Attention Framework Fair Comparison

May 17, 2022

Attention-aware contrastive learning for predicting T cell receptor-antigen binding specificity
Yiming Fang, Xuejun Liu, Hui Liu
Contrastive Learning Efficient Attention Cell Receptor TCR Sequence

April 18, 2022

Usage of specific attention improves change point detection
Anna Dmitrienko, Evgenia Romanenkova, Alexey Zaytsev
Attention Mechanism Greater Public Use Change Point Detection Efficient Attention Special Emphasis

February 23, 2022

U-Attention to Textures: Hierarchical Hourglass Vision Transformer for Universal Texture Synthesis
Shouchang Guo, Valentin Deschaintre, Douglas Noll, Arthur Roullier
Vision Transformer Hierarchical Attention Texture Synthesis Hierarchical Vision Transformer Efficient Attention Texture Quality Diverse Texture

January 18, 2022

Solving Dynamic Principal-Agent Problems with a Rationally Inattentive Principal
Tong Mu, Stephan Zheng, Alexander Trott
Efficient Attention Principal Agent Social Dilemma Sub Optimal Agent Welfare Incentive Security