Shallow Decoder

Shallow decoder architectures aim to improve the efficiency of encoder-decoder models, primarily by reducing computational cost and latency during inference, without significantly sacrificing performance. Current research focuses on developing strategies like dynamic early exiting, employing simpler decoder structures (e.g., linear transforms), and using multiple shallow decoders specialized for subsets of tasks or languages. This work is significant because it addresses a critical bottleneck in deploying complex models like those used in machine translation and image compression, enabling faster and more resource-efficient applications.

Papers

November 15, 2023

DEED: Dynamic Early Exit on Decoder for Accelerating Encoder-Decoder Transformer Models
Peng Tang, Pengkai Zhu, Tian Li, Srikar Appalaraju, Vijay Mahadevan, R. Manmatha
Early Exit MEG Decoder Encoder Decoder Transformer Model Shallow Decoder

April 13, 2023

Computationally-Efficient Neural Image Compression with Shallow Decoders
Yibo Yang, Stephan Mandt
Neural Image Compression Rate Distortion Performance Recursive Encoding Shallow Decoder

June 5, 2022

Multilingual Neural Machine Translation with Deep Encoder and Multiple Shallow Decoders
Xiang Kong, Adithya Renduchintala, James Cross, Yuqing Tang, Jiatao Gu, Xian Li
Multilingual Neural Machine Translation Multilingual Translation Many to Many Deep Encoder Shallow Decoder

Shallow Decoder

Papers

DEED: Dynamic Early Exit on Decoder for Accelerating Encoder-Decoder Transformer Models

Computationally-Efficient Neural Image Compression with Shallow Decoders

Multilingual Neural Machine Translation with Deep Encoder and Multiple Shallow Decoders