Memory Efficient

Memory-efficient deep learning focuses on developing and optimizing neural network models and training algorithms to minimize memory consumption, enabling deployment on resource-constrained devices and scaling to larger models. Current research emphasizes techniques like neural architecture search (NAS) to design inherently efficient architectures (e.g., optimized convolutional and transformer networks, memory-efficient Graph Neural Networks), low-rank approximations, mixed-precision training, and novel optimization algorithms (e.g., variants of Adam, Shampoo) that reduce memory overhead during both training and inference. These advancements are crucial for expanding the accessibility and applicability of deep learning across diverse hardware platforms and datasets, particularly in edge computing and large language model training.

Papers

September 22, 2022

Nesting Forward Automatic Differentiation for Memory-Efficient Deep Neural Network Training
Cong Guo, Yuxian Qiu, Jingwen Leng, Chen Zhang, Ying Cao, Quanlu Zhang, Yunxin Liu, Fan Yang, Minyi Guo
Activation Function Deep Learning Framework Automatic Differentiation Memory Efficient Mode Automatic Differentiation

September 19, 2022

Accelerating Neural Network Inference with Processing-in-DRAM: From the Edge to the Cloud
Geraldo F. Oliveira, Juan Gómez-Luna, Saugata Ghose, Amirali Boroumand, Onur Mutlu
Extreme Edge Cloud Computing Memory Efficient Neural Network Inference Processing in Memory Random Access Memory

June 9, 2022

A New Frontier of AI: On-Device AI Training and Personalization
Ji Joong Moon, Hyun Suk Lee, Jiho Chu, Donghak Park, Seungbaek Hong, Hyungjun Seo, Donghyeon Jeong, Sungsik Kong, MyungJoo Ham
Machine Learning Artificial Intelligence New Frontier Memory Efficient Device Training

June 1, 2022

Multi-Complexity-Loss DNAS for Energy-Efficient and Memory-Constrained Deep Neural Networks
Matteo Risso, Alessio Burrello, Luca Benini, Enrico Macii, Massimo Poncino, Daniele Jahier Pagliari
Neural Architecture Search Trade Offs Memory Efficient Differentiable Neural Architecture Search Memory Constraint Differentiable Na

May 12, 2022

Mondrian Forest for Data Stream Classification Under Memory Constraints
Martin Khannouz, Tristan Glatard
Memory Efficient Memory Constraint Data Stream Classification Mondrian Forest

May 10, 2022

SmartSAGE: Training Large-scale Graph Neural Networks using In-Storage Processing Architectures
Yunjae Lee, Jinha Chung, Minsoo Rhu
Memory Efficient Large Scale GNN

April 22, 2022

MEKER: Memory Efficient Knowledge Embedding Representation for Link Prediction and Question Answering
Viktoriia Chekalina, Anton Razzhigaev, Albert Sayapin, Evgeny Frolov, Alexander Panchenko
Knowledge Graph Question Answering Link Prediction NLP Task Memory Efficient High Bandwidth Memory

April 18, 2022

Fast and Memory-Efficient Network Towards Efficient Image Super-Resolution
Zongcai Du, Ding Liu, Jie Liu, Jie Tang, Gangshan Wu, Lean Fu
Super Resolution Image Super Resolution Memory Efficient

March 31, 2022

Stochastic Backpropagation: A Memory Efficient Strategy for Training Video Models
Feng Cheng, Mingze Xu, Yuanjun Xiong, Hao Chen, Xinyu Li, Wei Li, Wei Xia
Deep Neural Network Action Recognition Back Propagation Temporal Action Detection Memory Efficient Stochastic Neural Network Video Based Model

February 28, 2022

DropIT: Dropping Intermediate Tensors for Memory-Efficient DNN Training
Joya Chen, Kai Xu, Yuhui Wang, Yifei Cheng, Angela Yao
Convolutional Neural Network Convolutional Layer Memory Efficient Order Tensor

December 21, 2021

iSegFormer: Interactive Segmentation via Transformers with Application to 3D Knee MR Images
Qin Liu, Zhenlin Xu, Yining Jiao, Marc Niethammer
Neural Network Transformer Megatron Decepticons Application Proficiency Swin Transformer Interactive Segmentation Memory Efficient 3D Knee MR Image

November 18, 2021

COMET: A Novel Memory-Efficient Deep Learning Training Framework by Using Error-Bounded Lossy Compression
Sian Jin, Chengming Zhang, Xintong Jiang, Yunhe Feng, Hui Guan, Guanpeng Li, Shuaiwen Leon Song, Dingwen Tao
Memory Efficient Chasing COMET Error Bounded Lossy Activation Compression

November 9, 2021

A Survey and Empirical Evaluation of Parallel Deep Learning Frameworks
Daniel Nichols, Siddharth Singh, Shu-Huai Lin, Abhinav Bhatele
Neural Network Deep Learning Timely Survey Hardware Accelerator Empirical Evaluation Memory Efficient Parallel Text

Memory Efficient

Papers

Nesting Forward Automatic Differentiation for Memory-Efficient Deep Neural Network Training

Accelerating Neural Network Inference with Processing-in-DRAM: From the Edge to the Cloud

A New Frontier of AI: On-Device AI Training and Personalization

Multi-Complexity-Loss DNAS for Energy-Efficient and Memory-Constrained Deep Neural Networks

Mondrian Forest for Data Stream Classification Under Memory Constraints

SmartSAGE: Training Large-scale Graph Neural Networks using In-Storage Processing Architectures

MEKER: Memory Efficient Knowledge Embedding Representation for Link Prediction and Question Answering

Fast and Memory-Efficient Network Towards Efficient Image Super-Resolution

Stochastic Backpropagation: A Memory Efficient Strategy for Training Video Models

DropIT: Dropping Intermediate Tensors for Memory-Efficient DNN Training

iSegFormer: Interactive Segmentation via Transformers with Application to 3D Knee MR Images

COMET: A Novel Memory-Efficient Deep Learning Training Framework by Using Error-Bounded Lossy Compression

A Survey and Empirical Evaluation of Parallel Deep Learning Frameworks