Model Generalization
Model generalization, the ability of a machine learning model to perform well on unseen data, is a central challenge in the field. Current research focuses on improving generalization through techniques like sharpness-aware minimization (finding flatter minima in the loss landscape), data augmentation (especially learnable augmentation to address bias), and coreset selection (using influence functions to identify the most informative training data). These efforts, often applied to various architectures including large language models and convolutional neural networks, aim to enhance model robustness and reliability across diverse datasets and real-world applications, ultimately leading to more trustworthy and effective AI systems.
Papers
January 31, 2024
January 14, 2024
January 3, 2024
November 6, 2023
November 3, 2023
October 24, 2023
October 19, 2023
October 18, 2023
October 6, 2023
September 22, 2023
September 12, 2023
August 28, 2023
August 4, 2023
June 29, 2023
June 17, 2023
June 9, 2023
May 25, 2023
May 14, 2023
May 3, 2023