Early Stage Convergence
Early stage convergence in machine learning focuses on understanding and improving the initial phases of training algorithms, aiming to accelerate convergence speed and enhance generalization performance. Current research investigates this through the lens of various optimization algorithms (e.g., Adam, SGD, FedProx), model architectures (e.g., transformers, diffusion models), and specific problem domains (e.g., federated learning, collaborative filtering). These studies leverage techniques from dynamical systems theory and optimal transport to establish convergence guarantees and bounds, ultimately contributing to more efficient and robust machine learning systems across diverse applications.
Papers
A new approach to generalisation error of machine learning algorithms: Estimates and convergence
Michail Loulakis, Charalambos G. Makridakis
Synthetic data shuffling accelerates the convergence of federated learning under data heterogeneity
Bo Li, Yasin Esfandiari, Mikkel N. Schmidt, Tommy S. Alstrøm, Sebastian U. Stich
The Behavior and Convergence of Local Bayesian Optimization
Kaiwen Wu, Kyurae Kim, Roman Garnett, Jacob R. Gardner
Non-Parametric Learning of Stochastic Differential Equations with Non-asymptotic Fast Rates of Convergence
Riccardo Bonalli, Alessandro Rudi
On the Convergence of Black-Box Variational Inference
Kyurae Kim, Jisu Oh, Kaiwen Wu, Yi-An Ma, Jacob R. Gardner