Pruning Pipeline

Pruning pipelines aim to reduce the size and computational cost of neural networks, particularly for resource-constrained devices, while preserving accuracy. Current research focuses on developing more efficient and generalizable pruning algorithms, encompassing both structured and unstructured methods applied to various architectures like CNNs and transformers, and incorporating techniques like knowledge distillation and cyclical pruning to improve performance. These advancements are crucial for deploying sophisticated AI models in edge computing and other applications where computational resources are limited, enabling wider accessibility and deployment of powerful AI systems.

Papers

August 6, 2024

Comb, Prune, Distill: Towards Unified Pruning for Vision Model Compression
Jonas Schmitt, Ruiping Liu, Junwei Zheng, Jiaming Zhang, Rainer Stiefelhagen
Convolutional Neural Network Model Compression Edge Pruning Structured Pruning Based Pruning Hair Strand Pruning Pipeline

January 31, 2024

PipeNet: Question Answering with Semantic Pruning over Knowledge Graphs
Ying Su, Jipeng Zhang, Yangqiu Song, Tong Zhang
Knowledge Graph Yes No Question Knowledge Representation Graph Reasoning Answer Prediction Tool Grounding Differentiable Pruning Pruning Pipeline

December 4, 2023

An End-to-End Network Pruning Pipeline with Sparsity Enforcement
Evan Dogariu
Random Sparsification Sparsity Penalty Pruning Pipeline

August 26, 2022

Complexity-Driven CNN Compression for Resource-constrained Edge AI
Muhammad Zawish, Steven Davy, Lizy Abraham
Convolutional Layer Edge Device Edge Intelligence DNN Compression Edge Artificial Intelligence Pruning Pipeline

February 2, 2022

Cyclical Pruning for Sparse Neural Networks
Suraj Srinivas, Andrey Kuzmin, Markus Nagel, Mart van Baalen, Andrii Skliar, Tijmen Blankevoort
Large Scale Sparse Neural Network Neural Network Weight Progressive Pruning Magnitude Based Pruning Pruning Pipeline

Pruning Pipeline

Papers

Comb, Prune, Distill: Towards Unified Pruning for Vision Model Compression

PipeNet: Question Answering with Semantic Pruning over Knowledge Graphs

An End-to-End Network Pruning Pipeline with Sparsity Enforcement

Complexity-Driven CNN Compression for Resource-constrained Edge AI

Cyclical Pruning for Sparse Neural Networks