Strong Generalization
Strong generalization, the ability of machine learning models to perform well on unseen data, is a central objective in current research. Active areas of investigation include improving the robustness of self-supervised learning, understanding the optimization dynamics of transformers and other architectures (including CNNs and RNNs), and developing methods to enhance generalization through data augmentation, regularization techniques (e.g., logical regularization, consistency regularization), and improved training strategies (e.g., few-shot learning, meta-learning). These advancements are crucial for building reliable and adaptable AI systems across diverse applications, from image classification and natural language processing to healthcare and robotics.
Papers
Unknown Domain Inconsistency Minimization for Domain Generalization
Seungjae Shin, HeeSun Bae, Byeonghu Na, Yoon-Yeong Kim, Il-Chul Moon
How does promoting the minority fraction affect generalization? A theoretical study of the one-hidden-layer neural network on group imbalance
Hongkang Li, Shuai Zhang, Yihua Zhang, Meng Wang, Sijia Liu, Pin-Yu Chen
Near-Interpolators: Rapid Norm Growth and the Trade-Off between Interpolation and Generalization
Yutong Wang, Rishi Sonthalia, Wei Hu
Tune without Validation: Searching for Learning Rate and Weight Decay on Training Sets
Lorenzo Brigato, Stavroula Mougiakakou
Beyond Finite Data: Towards Data-free Out-of-distribution Generalization via Extrapolation
Yijiang Li, Sucheng Ren, Weipeng Deng, Yuzhi Xu, Ying Gao, Edith Ngai, Haohan Wang
PromptIQA: Boosting the Performance and Generalization for No-Reference Image Quality Assessment via Prompts
Zewen Chen, Haina Qin, Juan Wang, Chunfeng Yuan, Bing Li, Weiming Hu, Liang Wang
Neural Redshift: Random Networks are not Random Functions
Damien Teney, Armand Nicolicioiu, Valentin Hartmann, Ehsan Abbasnejad
A Survey on Evaluation of Out-of-Distribution Generalization
Han Yu, Jiashuo Liu, Xingxuan Zhang, Jiayun Wu, Peng Cui
Improving generalisation via anchor multivariate analysis
Homer Durand, Gherardo Varando, Nathan Mankovich, Gustau Camps-Valls
Improving out-of-distribution generalization in graphs via hierarchical semantic environments
Yinhua Piao, Sangseon Lee, Yijingxiu Lu, Sun Kim
On Robustness and Generalization of ML-Based Congestion Predictors to Valid and Imperceptible Perturbations
Chester Holtz, Yucheng Wang, Chung-Kuan Cheng, Bill Lin
Improving Group Connectivity for Generalization of Federated Deep Learning
Zexi Li, Jie Lin, Zhiqi Li, Didi Zhu, Rui Ye, Tao Shen, Tao Lin, Chao Wu
Generalizability Under Sensor Failure: Tokenization + Transformers Enable More Robust Latent Spaces
Geeling Chau, Yujin An, Ahamed Raffey Iqbal, Soon-Jo Chung, Yisong Yue, Sabera Talukder
From Generalization to Precision: Exploring SAM for Tool Segmentation in Surgical Environments
Kanyifeechukwu J. Oguine, Roger D. Soberanis-Mukul, Nathan Drenkow, Mathias Unberath