Strong Generalization
Strong generalization, the ability of machine learning models to perform well on unseen data, is a central objective in current research. Active areas of investigation include improving the robustness of self-supervised learning, understanding the optimization dynamics of transformers and other architectures (including CNNs and RNNs), and developing methods to enhance generalization through data augmentation, regularization techniques (e.g., logical regularization, consistency regularization), and improved training strategies (e.g., few-shot learning, meta-learning). These advancements are crucial for building reliable and adaptable AI systems across diverse applications, from image classification and natural language processing to healthcare and robotics.
Papers
A Statistical Model for Predicting Generalization in Few-Shot Classification
Yassir Bendou, Vincent Gripon, Bastien Pasdeloup, Lukas Mauch, Stefan Uhlich, Fabien Cardinaux, Ghouthi Boukli Hacene, Javier Alonso Garcia
Improving generalization in reinforcement learning through forked agents
Olivier Moulin, Vincent Francois-Lavet, Mark Hoogendoorn
How Does Independence Help Generalization? Sample Complexity of ERM on Product Distributions
Tao Lin
Continuation KD: Improved Knowledge Distillation through the Lens of Continuation Optimization
Aref Jafari, Ivan Kobyzev, Mehdi Rezagholizadeh, Pascal Poupart, Ali Ghodsi
Improving Generalization of Pre-trained Language Models via Stochastic Weight Averaging
Peng Lu, Ivan Kobyzev, Mehdi Rezagholizadeh, Ahmad Rashid, Ali Ghodsi, Philippe Langlais
On Generalization and Regularization via Wasserstein Distributionally Robust Optimization
Qinyu Wu, Jonathan Yu-Meng Li, Tiantian Mao
Walk These Ways: Tuning Robot Control for Generalization with Multiplicity of Behavior
Gabriel B Margolis, Pulkit Agrawal
Causal Inference via Style Transfer for Out-of-distribution Generalisation
Toan Nguyen, Kien Do, Duc Thanh Nguyen, Bao Duong, Thin Nguyen
Tackling Data Heterogeneity in Federated Learning with Class Prototypes
Yutong Dai, Zeyuan Chen, Junnan Li, Shelby Heinecke, Lichao Sun, Ran Xu