Model Fusion
Model fusion aims to combine the strengths of multiple machine learning models, improving performance and robustness beyond what any single model can achieve. Current research focuses on efficient fusion techniques for large language models (LLMs) and other deep learning architectures, exploring methods like weight averaging, optimal transport, and mixture-of-experts models to address challenges such as parameter interference and computational cost. These advancements are significant for improving the accuracy and reliability of AI systems across diverse applications, from natural language processing and computer vision to personalized medicine and federated learning.
Papers
December 9, 2024
December 4, 2024
December 3, 2024
November 27, 2024
November 17, 2024
November 11, 2024
October 27, 2024
October 16, 2024
October 1, 2024
August 22, 2024
August 19, 2024
July 29, 2024
July 7, 2024
June 14, 2024
May 15, 2024
May 6, 2024
April 17, 2024
March 17, 2024