Shelf Model
"Shelf models" refer to the utilization of pre-trained models, developed for one task, and applied directly ("off-the-shelf") to a different task without further training or with minimal fine-tuning. Current research focuses on exploring the effectiveness of various pre-trained architectures, including Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs), across diverse applications like image recognition, natural language processing, and robotics. This approach offers significant advantages in terms of efficiency and resource reduction, particularly valuable for resource-constrained environments and tasks with limited labeled data, while also highlighting and mitigating biases present in the original training data.
Papers
November 15, 2024
November 13, 2024
July 28, 2024
July 1, 2024
February 23, 2024
September 28, 2023
September 1, 2023
February 16, 2023
November 3, 2022
July 14, 2022
June 14, 2022
May 11, 2022
March 23, 2022