Magnitude Based Pruning
Magnitude-based pruning is a neural network compression technique aiming to reduce model size and computational cost while preserving accuracy. Current research focuses on applying this method to various architectures, including vision-language models, graph neural networks, and those used in federated learning and speech recognition, often incorporating iterative pruning strategies and novel metrics to guide the pruning process. This work is significant because it enables deployment of large, powerful models on resource-constrained devices and improves the efficiency of training and inference, impacting both scientific understanding of model sparsity and practical applications across diverse fields.
Papers
November 1, 2024
June 27, 2024
March 12, 2024
August 5, 2023
July 10, 2023
June 21, 2023
May 30, 2023
May 23, 2023
April 15, 2023
December 28, 2022
April 24, 2022
March 31, 2022
March 16, 2022
February 2, 2022