Edge Pruning

Edge pruning is a neural network compression technique aiming to reduce computational costs and memory usage by removing less important connections or parameters without significant performance degradation. Current research focuses on developing efficient pruning algorithms for various architectures, including convolutional neural networks (CNNs), vision transformers (ViTs), and large language models (LLMs), often incorporating techniques like knowledge distillation and optimization-based methods to improve performance after pruning. This work is significant because it enables the deployment of large, powerful models on resource-constrained devices and improves the energy efficiency of training and inference, impacting both scientific understanding of model redundancy and practical applications across diverse fields.

Papers

September 25, 2023

On the Impact of Quantization and Pruning of Self-Supervised Speech Models for Downstream Speech Recognition Tasks "In-the-Wild''
Arthur Pimentel, Heitor Guimarães, Anderson R. Avila, Mehdi Rezagholizadeh, Tiago H. Falk
Self Supervised Learning Global Impact Quantization Operator Edge Pruning Speech Recognition System Self Supervised Speech Model Speech Recognition Accuracy Speech Recognition Task Edge Speech Application

September 22, 2023

ThinResNet: A New Baseline for Structured Convolutional Networks Pruning
Hugo Tessier, Ghouti Boukli Hacene, Vincent Gripon
Edge Pruning Structured Pruning Neural Network Pruning New Baseline

September 15, 2023

Frustratingly Simple Memory Efficiency for Pre-trained Language Models via Dynamic Embedding Pruning
Miles Williams, Nikolaos Aletras
Pre Trained Language Model Edge Pruning PLM Based Memory Constraint Memory Efficiency Large Vocabulary Memory Allocation

September 12, 2023

Learning Minimalistic Tsetlin Machine Clauses with Markov Boundary-Guided Pruning
Ole-Christoffer Granmo, Per-Arne Andersen, Lei Jiao, Xuan Zhang, Christian Blakely, Tor Tveit
Edge Pruning Markov Chain Tsetlin Machine Markov Model Markov Blanket Context Specific Independence

August 27, 2023

Pruning the Unlabeled Data to Improve Semi-Supervised Learning
Guy Hacohen, Daphna Weinshall
Deep Learning Model Semi Supervised Learning Edge Pruning Unlabeled Data Image Classification Task Unlabeled Dataset

August 19, 2023

August 17, 2023

How Does Pruning Impact Long-Tailed Multi-Label Medical Image Classifiers?
Gregory Holste, Ziyu Jiang, Ajay Jaiswal, Maria Hanna, Shlomo Minkowitz, Alan C. Legasto, Joanna G. Escalon, Sharon Steinberger, Mark Bittman, Thomas C. Shen, Ying Ding, Ronald M. Summers, George Shih, Yifan Peng, Zhangyang Wang
Edge Pruning Multi Label Image Classification Model Error Pruned Model

August 14, 2023

Unified Data-Free Compression: Pruning and Quantization without Fine-Tuning
Shipeng Bai, Jun Chen, Xintian Shen, Yixuan Qian, Yong Liu
Quantization Operator Edge Pruning Structured Pruning Free Pruning

August 13, 2023

Influence Function Based Second-Order Channel Pruning-Evaluating True Loss Changes For Pruning Is Possible Without Retraining
Hongrong Cheng, Miao Zhang, Javen Qinfeng Shi
Edge Pruning Pruning Method Influence Function Channel Pruning Pruning Mask Aware Loss

August 4, 2023

Pruning a neural network using Bayesian inference
Sunil Mathew, Daniel B. Rowe
Neural Network Bayesian Inference Edge Pruning Neural Network Pruning Iterative Pruning Posterior Probability

July 22, 2023

Sparse then Prune: Toward Efficient Vision Transformers
Yogi Prasetyo, Novanto Yudistira, Agus Wahyu Widodo
Vision Transformer Many Sparse Edge Pruning Model Pruning Vision Transformer Architecture Vision Transformer Model Sparse Regularization

July 20, 2023

July 10, 2023

One-Shot Pruning for Fast-adapting Pre-trained Models on Devices
Haiyan Zhao, Guodong Long
Convolutional Neural Network Pre Trained Model Edge Pruning Model Pruning Large Scale Pre Trained Model Smart Device Shot Pruning

July 6, 2023

Pruning vs Quantization: Which is Better?
Andrey Kuzmin, Markus Nagel, Mart van Baalen, Arash Behboodi, Tijmen Blankevoort
Neural Network Deep Neural Network Quantization Operator Edge Pruning Quantization Technique Neural Network Quantization Layer Wise Pruning

June 22, 2023

Pruning for Better Domain Generalizability
Xinglong Sun
Edge Pruning Pruning Method Domain Generalizability Domain Performance Channel Sparsity

June 21, 2023

June 13, 2023

Pruning the Way to Reliable Policies: A Multi-Objective Deep Q-Learning Approach to Critical Care
Ali Shirali, Alexander Schubert, Ahmed Alaa
Reinforcement Learning Constructive Approach Reward Function Edge Pruning Deep Q Learning Conservative Q Learning Intensive Care Multi Objective Deep Reinforcement Learning

Edge Pruning

Papers

On the Impact of Quantization and Pruning of Self-Supervised Speech Models for Downstream Speech Recognition Tasks "In-the-Wild''

ThinResNet: A New Baseline for Structured Convolutional Networks Pruning

Frustratingly Simple Memory Efficiency for Pre-trained Language Models via Dynamic Embedding Pruning

Learning Minimalistic Tsetlin Machine Clauses with Markov Boundary-Guided Pruning

Pruning the Unlabeled Data to Improve Semi-Supervised Learning

HollowNeRF: Pruning Hashgrid-Based NeRFs with Trainable Collision Mitigation

To prune or not to prune : A chaos-causality approach to principled pruning of dense neural networks

How Does Pruning Impact Long-Tailed Multi-Label Medical Image Classifiers?

Unified Data-Free Compression: Pruning and Quantization without Fine-Tuning

Influence Function Based Second-Order Channel Pruning-Evaluating True Loss Changes For Pruning Is Possible Without Retraining

Pruning a neural network using Bayesian inference

Sparse then Prune: Toward Efficient Vision Transformers

PATROL: Privacy-Oriented Pruning for Collaborative Inference Against Model Inversion Attacks

Learned Thresholds Token Merging and Pruning for Vision Transformers

One-Shot Pruning for Fast-adapting Pre-trained Models on Devices

Pruning vs Quantization: Which is Better?

Pruning for Better Domain Generalizability

Fantastic Weights and How to Find Them: Where to Prune in Dynamic Sparse Training

Quantifying lottery tickets under label noise: accuracy, calibration, and complexity

Pruning the Way to Reliable Policies: A Multi-Objective Deep Q-Learning Approach to Critical Care