Generalization Property
Generalization, a machine learning model's ability to perform well on unseen data, is a central research focus, aiming to understand why and how models generalize beyond their training data. Current research investigates this through various lenses, including analyzing the impact of training schedules, loss landscape sharpness (e.g., using SAM), and model architectures like ResNets and GFlowNets, as well as exploring the role of data variability and the effects of quantization. Improved understanding of generalization properties is crucial for building more reliable and robust machine learning systems across diverse applications, from scientific computing to medical diagnosis.
Papers
Algorithmic Stability of Heavy-Tailed Stochastic Gradient Descent on Least Squares
Anant Raj, Melih Barsbey, Mert Gürbüzbalaban, Lingjiong Zhu, Umut Şimşekli
Score-Based Generative Models Detect Manifolds
Jakiw Pidstrigach
Beyond accuracy: generalization properties of bio-plausible temporal credit assignment rules
Yuhan Helena Liu, Arna Ghosh, Blake A. Richards, Eric Shea-Brown, Guillaume Lajoie