Online Merging
Online merging focuses on combining multiple trained neural network models into a single, more powerful model, aiming to reduce resource consumption, improve generalization, and streamline model development. Current research emphasizes scaling merging techniques to larger models (e.g., transformers with billions of parameters) and exploring various merging methods, including averaging, least squares optimization, and specialized techniques like Foldable SuperNets. This area is significant because efficient model merging can improve the performance and cost-effectiveness of machine learning systems across diverse applications, from image processing and natural language processing to autonomous driving and program repair.
Papers
September 21, 2023
August 15, 2023
June 29, 2023
June 6, 2023
June 2, 2023
May 31, 2023
March 29, 2023
March 12, 2023
December 1, 2022
November 15, 2022
October 20, 2022
September 30, 2022
July 14, 2022
July 11, 2022
June 11, 2022
March 2, 2022
January 28, 2022