Filter Pruning
Filter pruning is a model compression technique aiming to reduce the computational cost and memory footprint of convolutional neural networks (CNNs) without significant accuracy loss. Current research focuses on developing efficient algorithms to identify and remove less important filters, often employing iterative pruning strategies combined with fine-tuning and exploring various importance metrics (e.g., norm-based, similarity-based, attention-based). This work is significant because it enables the deployment of deep learning models on resource-constrained devices like mobile phones and embedded systems, impacting applications ranging from object detection and face recognition to medical image analysis and real-time UAV tracking.
Papers
November 14, 2024
May 4, 2024
May 3, 2024
November 28, 2023
October 10, 2023
August 8, 2023
July 20, 2023
July 1, 2023
May 5, 2023
April 26, 2023
April 19, 2023
April 13, 2023
April 5, 2023
March 16, 2023
March 7, 2023
February 16, 2023
January 12, 2023
October 25, 2022
October 22, 2022