Structural Pruning

Structural pruning aims to improve the efficiency of large neural networks, such as large language models (LLMs) and convolutional neural networks (CNNs), by removing less important parameters without significantly sacrificing performance. Current research focuses on developing novel pruning algorithms, including those incorporating reinforcement learning and optimal transport, and applying these techniques to various architectures, with a particular emphasis on LLMs and vision transformers. These advancements are significant because they enable the deployment of powerful models on resource-constrained devices and reduce the computational cost of training and inference, impacting both scientific research and practical applications.

Papers

October 26, 2023

Joint Entity and Relation Extraction with Span Pruning and Hypergraph Neural Networks
Zhaohui Yan, Songlin Yang, Wei Liu, Kewei Tu
Hypergraph Neural Network Joint Entity Structural Pruning Marker Selection High Order Inference

May 19, 2023

LLM-Pruner: On the Structural Pruning of Large Language Models
Xinyin Ma, Gongfan Fang, Xinchao Wang
Large Language Model Training Corpus Structural Pruning LD Pruner

May 18, 2023

Structural Pruning for Diffusion Models
Gongfan Fang, Xinyin Ma, Xinchao Wang
Diffusion Model Generative Modeling Diffusion Probabilistic Model Structural Pruning

May 3, 2023

Bicubic++: Slim, Slimmer, Slimmest -- Designing an Industry-Grade Super-Resolution Network
Bahri Batuhan Bilecen, Mustafa Ayazoglu
Super Resolution Convolutional Layer Structural Pruning

March 17, 2023

Dynamic Structure Pruning for Compressing CNNs
Jun-Hyung Park, Yeachan Kim, Junho Kim, Joon-Young Choi, SangKeun Lee
Neural Network Channel Pruning Structural Pruning

January 30, 2023

DepGraph: Towards Any Structural Pruning
Gongfan Fang, Xinyin Ma, Mingli Song, Michael Bi Mi, Xinchao Wang
SE SPP DenseNet Structural Pruning Parameter Selection

December 6, 2022

Attend Who is Weak: Pruning-assisted Medical Image Localization under Sophisticated and Implicit Imbalances
Ajay Jaiswal, Tianlong Chen, Justin F. Rousseau, Yifan Peng, Ying Ding, Zhangyang Wang
Deep Neural Network Medical Image Deep Learning Classification Structural Pruning Output Imbalance

October 13, 2022

Structural Pruning via Latency-Saliency Knapsack
Maying Shen, Hongxu Yin, Pavlo Molchanov, Lei Mao, Jianna Liu, Jose M. Alvarez
Structural Pruning

September 15, 2022

Graph-to-Text Generation with Dynamic Structure Pruning
Liang Li, Ruiying Geng, Bowen Li, Can Ma, Yinliang Yue, Binhua Li, Yongbin Li
Cross Attention Structural Pruning Input Graph Structure Encoder Feature Graph Graph to Text

December 28, 2021

Speedup deep learning models on GPU by taking advantage of efficient unstructured pruning and bit-width reduction
Marcin Pietroń, Dominik Żurek
Deep Learning Model Single GPU Linear Speedup Low Bit Unstructured Pruning Pruning Performance Structural Pruning Neural Network Library

November 30, 2021

A Unified Pruning Framework for Vision Transformers
Hao Yu, Jianxin Wu
Vision Transformer Supervised ImageNet Model Compression Based Pruning Structural Pruning