Model Expansion
Model expansion focuses on efficiently scaling up existing machine learning models, particularly deep neural networks and transformers, to improve performance without requiring complete retraining from scratch. Current research explores various expansion techniques, including graph-based methods, iterative local expansions, and function-preserving transformations, often applied to enhance generative models, continual learning systems, and information retrieval. These advancements aim to reduce the substantial computational costs associated with training increasingly large models, thereby accelerating progress in AI and enabling the development of more powerful and efficient systems for diverse applications.
Papers
August 16, 2024
July 11, 2024
May 24, 2024
December 14, 2023
December 2, 2023
October 12, 2023
September 15, 2023
August 11, 2023