Sharpness Aware Minimization
Sharpness-Aware Minimization (SAM) is an optimization technique aiming to improve the generalization ability of machine learning models by finding "flatter" minima in the loss landscape, reducing overfitting and enhancing robustness. Current research focuses on refining SAM's algorithm, including adaptive radius selection, bilateral sharpness estimation, and integrating it with other techniques like federated learning and uncertainty quantification, often applied to models such as transformers and convolutional neural networks. This approach holds significant promise for improving the performance and reliability of machine learning models across diverse applications, particularly in areas like medical image analysis and time series forecasting where generalization is crucial.
Papers
Improving Sharpness-Aware Minimization with Fisher Mask for Better Generalization on Language Models
Qihuang Zhong, Liang Ding, Li Shen, Peng Mi, Juhua Liu, Bo Du, Dacheng Tao
Make Sharpness-Aware Minimization Stronger: A Sparsified Perturbation Approach
Peng Mi, Li Shen, Tianhe Ren, Yiyi Zhou, Xiaoshuai Sun, Rongrong Ji, Dacheng Tao