Energy Efficient Inference

Energy-efficient inference focuses on minimizing the computational resources and power consumption required to run deep learning models, particularly at the edge where resources are limited. Current research emphasizes techniques like model compression (e.g., pruning, quantization, knowledge distillation), efficient algorithms (e.g., spiking neural networks, dynamic decision trees), and hardware-aware optimization (e.g., mapping DNNs to multi-accelerator SoCs, specialized hardware accelerators). These advancements are crucial for deploying AI in resource-constrained environments like embedded systems and IoT devices, reducing the environmental impact of AI, and enabling broader accessibility to AI applications.

Papers

November 14, 2022

Impact of spiking neurons leakages and network recurrences on event-based spatio-temporal pattern recognition
Mohamed Sadek Bouanane, Dalila Cherifi, Elisabetta Chicca, Lyes Khacef
Global Impact Neuromorphic Hardware Neuron Model Energy Efficient Inference Event Based Object Recognition Gradient Leakage Energy Efficient Neuromorphic Recurrence Network

June 9, 2022

Predictive Exit: Prediction of Fine-Grained Early Exits for Computation- and Energy-Efficient Inference
Xiangjie Li, Chenfei Lou, Zhengping Zhu, Yuchi Chen, Yingtao Shen, Yehan Ma, An Zou
Human Prediction Early Exit Early eXit Energy Efficient Inference Energy Efficient Deep Learning

May 27, 2022

Adaptive Random Forests for Energy-Efficient Inference on Microcontrollers
Francesco Daghero, Alessio Burrello, Chen Xie, Luca Benini, Andrea Calimera, Enrico Macii, Massimo Poncino, Daniele Jahier Pagliari
Decision Tree Random Forest Early Stopping 32 Bit Microcontrollers Weak Learner Energy Efficient Inference Weak Learning

May 24, 2022

Multi-Agent Collaborative Inference via DNN Decoupling: Intermediate Feature Compression and Edge Learning
Zhiwei Hao, Guanyu Xu, Yong Luo, Han Hu, Jianping An, Shiwen Mao
Inference Latency Feature Decoupling Edge Learning Collaborative Inference Feature Compression Energy Efficient Inference Multi Agent Proximal Policy Optimization

April 22, 2022

Energy-efficient and Privacy-aware Social Distance Monitoring with Low-resolution Infrared Sensors and Adaptive Inference
Chen Xie, Daniele Jahier Pagliari, Andrea Calimera
Convolutional Neural Network Internet of Thing Energy Efficiency Energy Efficient Inference Adaptive Inference Infrared Sensor Social Distance

April 11, 2022

MIME: Adapting a Single Neural Network for Multi-task Inference with Memory-efficient Dynamic Pruning
Abhiroop Bhattacharjee, Yeshwanth Venkatesha, Abhishek Moitra, Priyadarshini Panda
Multi Task Learning Single Neural Network Energy Efficient Inference Random Access Memory Multi Task Inference

February 4, 2022

The Ecological Footprint of Neural Machine Translation Systems
Dimitar Shterionov, Eva Vanmassenhove
Deep Learning Neural Machine Translation Model Large Deep Learning Model Energy Efficient Inference NMT Model Neural Machine Translation System

January 20, 2022

Neural Network Quantization with AI Model Efficiency Toolkit (AIMET)
Sangeetha Siddegowda, Marios Fournarakis, Markus Nagel, Tijmen Blankevoort, Chirag Patel, Abhijit Khobare
AI Tool Energy Efficient Inference Neural Network Quantization State of the Art Quantization

November 30, 2021

Energy-Efficient Inference on the Edge Exploiting TinyML Capabilities for UAVs
Wamiq Raza, Anas Osman, Francesco Ferrini, Francesco De Natale
Unmanned Aerial Vehicle Single Drone TinyML Model Energy Efficient Inference Tello Drone

November 8, 2021

A Survey on Green Deep Learning
Jingjing Xu, Wangchunshu Zhou, Zhiyi Fu, Hao Zhou, Lei Li
Timely Survey Deep Learning Technology Green AI Energy Efficient Inference Deeper Model Energy Efficient Training

Energy Efficient Inference

Papers

Impact of spiking neurons leakages and network recurrences on event-based spatio-temporal pattern recognition

Predictive Exit: Prediction of Fine-Grained Early Exits for Computation- and Energy-Efficient Inference

Adaptive Random Forests for Energy-Efficient Inference on Microcontrollers

Multi-Agent Collaborative Inference via DNN Decoupling: Intermediate Feature Compression and Edge Learning

Energy-efficient and Privacy-aware Social Distance Monitoring with Low-resolution Infrared Sensors and Adaptive Inference

MIME: Adapting a Single Neural Network for Multi-task Inference with Memory-efficient Dynamic Pruning

The Ecological Footprint of Neural Machine Translation Systems

Neural Network Quantization with AI Model Efficiency Toolkit (AIMET)

Energy-Efficient Inference on the Edge Exploiting TinyML Capabilities for UAVs

A Survey on Green Deep Learning