Model Generalization
Model generalization, the ability of a machine learning model to perform well on unseen data, is a central challenge in the field. Current research focuses on improving generalization through techniques like sharpness-aware minimization (finding flatter minima in the loss landscape), data augmentation (especially learnable augmentation to address bias), and coreset selection (using influence functions to identify the most informative training data). These efforts, often applied to various architectures including large language models and convolutional neural networks, aim to enhance model robustness and reliability across diverse datasets and real-world applications, ultimately leading to more trustworthy and effective AI systems.
Papers
November 18, 2024
November 12, 2024
November 2, 2024
October 29, 2024
October 28, 2024
October 22, 2024
October 21, 2024
October 10, 2024
August 15, 2024
August 9, 2024
August 7, 2024
July 29, 2024
July 25, 2024
July 22, 2024
June 25, 2024
June 18, 2024
June 13, 2024
June 11, 2024
June 10, 2024