Model Generalization
Model generalization, the ability of a machine learning model to perform well on unseen data, is a central challenge in the field. Current research focuses on improving generalization through techniques like sharpness-aware minimization (finding flatter minima in the loss landscape), data augmentation (especially learnable augmentation to address bias), and coreset selection (using influence functions to identify the most informative training data). These efforts, often applied to various architectures including large language models and convolutional neural networks, aim to enhance model robustness and reliability across diverse datasets and real-world applications, ultimately leading to more trustworthy and effective AI systems.
Papers
February 3, 2023
January 24, 2023
December 8, 2022
November 7, 2022
November 4, 2022
November 1, 2022
October 25, 2022
October 24, 2022
October 22, 2022
October 13, 2022
October 11, 2022
October 10, 2022
September 30, 2022
September 16, 2022
September 13, 2022
August 21, 2022
August 14, 2022
July 2, 2022