Layer Adaptive Weight Pruning

Layer adaptive weight pruning aims to reduce the size and computational cost of deep neural networks by selectively removing less important weights, but doing so in a way that prioritizes preserving model accuracy. Current research focuses on developing efficient algorithms, often employing dynamic programming or coarse-to-fine approaches, to determine optimal pruning ratios across different layers of various architectures, including vision-language models and convolutional neural networks. This research is significant because it enables the deployment of larger, more powerful models on resource-constrained devices while maintaining performance, impacting both the efficiency of machine learning and its accessibility.

Papers

October 4, 2023

ECoFLaP: Efficient Coarse-to-Fine Layer-Wise Pruning for Vision-Language Models
Yi-Lin Sung, Jaehong Yoon, Mohit Bansal
Vision Language Model Large Vision Language Model Global Pruning Layer Adaptive Weight Pruning

August 21, 2023

Efficient Joint Optimization of Layer-Adaptive Weight Pruning in Deep Neural Networks
Kaixin Xu, Zhe Wang, Xue Geng, Jie Lin, Min Wu, Xiaoli Li, Weisi Lin
Deep Neural Network Joint Optimization Pruning Ratio Layer Adaptive Weight Pruning

December 2, 2021

Equal Bits: Enforcing Equally Distributed Binary Network Weights
Yunqiang Li, Silvia L. Pintea, Jan C. van Gemert
P Bit Binary Weight Binary Network Exact Enforcement Layer Adaptive Weight Pruning

Layer Adaptive Weight Pruning

Papers

ECoFLaP: Efficient Coarse-to-Fine Layer-Wise Pruning for Vision-Language Models

Efficient Joint Optimization of Layer-Adaptive Weight Pruning in Deep Neural Networks

Equal Bits: Enforcing Equally Distributed Binary Network Weights