Pruning Strategy
Neural network pruning aims to reduce model size and computational cost without significant accuracy loss, improving efficiency and deployment on resource-constrained devices. Current research focuses on developing sophisticated pruning strategies, including structured and unstructured approaches, often employing techniques like magnitude-based ranking, mutual information analysis, and bi-level optimization, across various architectures such as convolutional neural networks (CNNs), vision transformers (ViTs), and even capsule networks. These advancements are crucial for deploying large-scale deep learning models in mobile and edge computing environments, and for enhancing model interpretability by identifying key features.
Papers
January 5, 2025
December 17, 2024
October 31, 2024
October 9, 2024
August 24, 2024
August 15, 2024
June 3, 2024
March 21, 2024
November 21, 2023
October 19, 2023
August 19, 2023
April 21, 2023
April 15, 2023
October 8, 2022
August 13, 2022
July 16, 2022
June 14, 2022
April 4, 2022
March 15, 2022