Effective Training Data
Effective training data is crucial for the success of machine learning models, driving research into optimizing its quantity, quality, and characteristics for various tasks. Current efforts focus on developing methodologies for creating and evaluating datasets, including exploring the impact of dataset size and composition on model performance across diverse domains like natural language processing, graph classification, and medical imaging, often employing techniques like contrastive learning and transfer learning. These investigations aim to improve model robustness, generalization, and efficiency, ultimately leading to more reliable and impactful machine learning applications.
Papers
November 13, 2024
October 1, 2024
July 6, 2024
May 22, 2024
April 24, 2024
March 7, 2024
October 17, 2023
July 24, 2023
April 27, 2023
March 22, 2023
February 14, 2022