Attention Flow

Attention flow research focuses on understanding and manipulating the influence of individual input elements on a model's output, particularly within transformer-based architectures. Current work explores efficient methods for computing and visualizing these flows, often leveraging flow network theory to address computational limitations and improve model performance in tasks like video generation and scene graph construction. This research aims to enhance model interpretability, improve efficiency for handling long sequences, and ultimately lead to more robust and accurate AI systems across various domains. The development of novel attention mechanisms, such as flow-attention, demonstrates a significant step towards achieving these goals.

Papers

July 18, 2024

Attention Overflow: Language Model Input Blur during Long-Context Missing Items Recommendation
Damien Sileo
Large Language Model Language Model Context Aware Long Input Sequence Completion Attention Flow

June 10, 2024

Compositional Video Generation as Flow Equalization
Xingyi Yang, Xinchao Wang
Text to Video Text to Video Generation T2V Generation Equalization Great Attention Flow

November 28, 2023

HAtt-Flow: Hierarchical Attention-Flow Mechanism for Group Activity Scene Graph Generation in Videos
Naga VS Raviteja Chappa, Pha Nguyen, Thi Hoang Ngan Le, Khoa Luu
Gameplay Video Scene Graph Hierarchical Attention Video Scene Graph Generation Flow Attention Attention Flow

May 30, 2022

Attention Flows for General Transformers
Niklas Metzger, Christopher Hahn, Julian Siber, Frederik Schmitt, Bernd Finkbeiner
General Transformer Encoder Only Transformer Attention Flow

April 26, 2022

Stochastic Coherence Over Attention Trajectory For Continuous Learning In Video Streams
Matteo Tiezzi, Simone Marullo, Lapo Faggi, Enrico Meloni, Alessandro Betti, Stefano Melacci
Artificial Intelligence Pixel Level Continuous Learning Spatial Coherence Coherence Aware Video Stream Attention Flow

February 13, 2022

Flowformer: Linearizing Transformers with Conservation Flows
Haixu Wu, Jialong Wu, Jiehui Xu, Jianmin Wang, Mingsheng Long
Transformer Megatron Decepticons Attention Mechanism Linear Attention General Flow Attention Flow Flow Attention