Model Fusion
Model fusion aims to combine the strengths of multiple machine learning models, improving performance and robustness beyond what any single model can achieve. Current research focuses on efficient fusion techniques for large language models (LLMs) and other deep learning architectures, exploring methods like weight averaging, optimal transport, and mixture-of-experts models to address challenges such as parameter interference and computational cost. These advancements are significant for improving the accuracy and reliability of AI systems across diverse applications, from natural language processing and computer vision to personalized medicine and federated learning.
Papers
February 12, 2024
February 2, 2024
December 17, 2023
December 12, 2023
November 13, 2023
November 10, 2023
November 5, 2023
October 9, 2023
October 7, 2023
October 5, 2023
October 2, 2023
September 27, 2023
September 8, 2023
September 6, 2023
July 25, 2023
July 18, 2023
March 20, 2023
February 6, 2023
December 20, 2022