Data Mixture
Data mixture research focuses on optimizing the composition of training datasets to improve the performance and efficiency of machine learning models, particularly large language models and robotics systems. Current research emphasizes automated methods for determining optimal data mixtures, employing techniques like distributionally robust optimization, bilevel optimization, and regression models to predict performance based on mixture composition. These advancements are significant because carefully curated data mixtures can substantially improve model generalization, reduce training time, and enhance performance on diverse downstream tasks, impacting various fields from natural language processing to robotics.
Papers
December 23, 2024
December 18, 2024
November 8, 2024
October 21, 2024
October 15, 2024
October 8, 2024
August 26, 2024
July 29, 2024
July 23, 2024
July 1, 2024
June 17, 2024
June 3, 2024
May 23, 2024
April 1, 2024
March 25, 2024
March 13, 2024
December 25, 2023
September 19, 2023
September 13, 2023