Strong Generalization

Strong generalization, the ability of machine learning models to perform well on unseen data, is a central objective in current research. Active areas of investigation include improving the robustness of self-supervised learning, understanding the optimization dynamics of transformers and other architectures (including CNNs and RNNs), and developing methods to enhance generalization through data augmentation, regularization techniques (e.g., logical regularization, consistency regularization), and improved training strategies (e.g., few-shot learning, meta-learning). These advancements are crucial for building reliable and adaptable AI systems across diverse applications, from image classification and natural language processing to healthcare and robotics.

1046papers

Papers - Page 24

June 7, 2024

June 5, 2024

June 4, 2024

June 3, 2024

June 2, 2024

June 1, 2024

Slow and Steady Wins the Race: Maintaining Plasticity with Hare and Tortoise Networks
Strong Generalization Plasticity Rule

Strong Generalization

Papers - Page 24

Learning Divergence Fields for Shift-Robust Graph Representations

Skill-aware Mutual Information Optimisation for Generalisation in Reinforcement Learning

Confidence-aware Contrastive Learning for Selective Classification

Feature contamination: Neural networks learn uncorrelated features and fail to generalize

Harder or Different? Understanding Generalization of Audio Deepfake Detection

Prediction-powered Generalization of Causal Inferences

Representations as Language: An Information-Theoretic Framework for Interpretability

On the Limitations of Fractal Dimension as a Measure of Generalization

DNCs Require More Planning Steps

Verifying the Generalization of Deep Learning to Out-of-Distribution Domains

What Improves the Generalization of Graph Transformers? A Theoretical Dive into the Self-attention and Positional Encoding

Improving Generalization in Aerial and Terrestrial Mobile Robots Control Through Delayed Policy Learning

DEFT: Efficient Fine-Tuning of Diffusion Models by Learning the Generalised h-transform

Do Large Language Models Perform the Way People Expect? Measuring the Human Generalization Function

IENE: Identifying and Extrapolating the Node Environment for Out-of-Distribution Generalization on Graphs

Generalization Bound and New Algorithm for Clean-Label Backdoor Attack

Slow and Steady Wins the Race: Maintaining Plasticity with Hare and Tortoise Networks

μLO: Compute-Efficient Meta-Generalization of Learned Optimizers

What Makes CLIP More Robust to Long-Tailed Pre-Training Data? A Controlled Study for Transferable Insights

Improving Generalization and Convergence by Enhancing Implicit Regularization