Learngene Layer
The "learngene" framework aims to improve the efficiency and effectiveness of transferring knowledge from large pre-trained models to smaller, task-specific models. Current research focuses on developing methods to condense and transfer crucial information—the "learngene"—from pre-trained models, often using techniques inspired by singular value decomposition and employing architectures like diffusion models and transformers. This approach promises to significantly reduce training time and computational costs while improving performance on downstream tasks, impacting both resource-constrained applications and the broader field of transfer learning.
Papers
September 28, 2024
August 14, 2024
April 25, 2024
January 16, 2024
December 10, 2023
December 9, 2023
October 16, 2023
June 17, 2023