Strong Generalization
Strong generalization, the ability of machine learning models to perform well on unseen data, is a central objective in current research. Active areas of investigation include improving the robustness of self-supervised learning, understanding the optimization dynamics of transformers and other architectures (including CNNs and RNNs), and developing methods to enhance generalization through data augmentation, regularization techniques (e.g., logical regularization, consistency regularization), and improved training strategies (e.g., few-shot learning, meta-learning). These advancements are crucial for building reliable and adaptable AI systems across diverse applications, from image classification and natural language processing to healthcare and robotics.
Papers
Self-training of Machine Learning Models for Liver Histopathology: Generalization under Clinical Shifts
Jin Li, Deepta Rajan, Chintan Shah, Dinkar Juyal, Shreya Chakraborty, Chandan Akiti, Filip Kos, Janani Iyer, Anand Sampat, Ali Behrooz
Towards Generalization on Real Domain for Single Image Dehazing via Meta-Learning
Wenqi Ren, Qiyu Sun, Chaoqiang Zhao, Yang Tang
Where Do We Go From Here? Guidelines For Offline Recommender Evaluation
Tobias Schnabel
Instance-Dependent Generalization Bounds via Optimal Transport
Songyan Hou, Parnian Kassraie, Anastasis Kratsios, Andreas Krause, Jonas Rothfuss
An Information-Theoretic Approach for Estimating Scenario Generalization in Crowd Motion Prediction
Gang Qiao, Kaidong Hu, Seonghyeon Moon, Samuel S. Sohn, Sejong Yoon, Mubbasir Kapadia, Vladimir Pavlovic
What is my math transformer doing? -- Three results on interpretability and generalization
François Charton
Rethinking Generalization: The Impact of Annotation Style on Medical Image Segmentation
Brennan Nichyporuk, Jillian Cardinell, Justin Szeto, Raghav Mehta, Jean-Pierre R. Falet, Douglas L. Arnold, Sotirios A. Tsaftaris, Tal Arbel
Language Control Diffusion: Efficiently Scaling through Space, Time, and Tasks
Edwin Zhang, Yujie Lu, William Wang, Amy Zhang
Learning on the Job: Self-Rewarding Offline-to-Online Finetuning for Industrial Insertion of Novel Connectors from Vision
Ashvin Nair, Brian Zhu, Gokul Narayanan, Eugen Solowjow, Sergey Levine