Early Stage Convergence

Early stage convergence in machine learning focuses on understanding and improving the initial phases of training algorithms, aiming to accelerate convergence speed and enhance generalization performance. Current research investigates this through the lens of various optimization algorithms (e.g., Adam, SGD, FedProx), model architectures (e.g., transformers, diffusion models), and specific problem domains (e.g., federated learning, collaborative filtering). These studies leverage techniques from dynamical systems theory and optimal transport to establish convergence guarantees and bounds, ultimately contributing to more efficient and robust machine learning systems across diverse applications.

Papers

October 17, 2022

On the convergence of policy gradient methods to Nash equilibria in general stochastic games
Angeliki Giannou, Kyriakos Lotidis, Panayotis Mertikopoulos, Emmanouil-Vasileios Vlatakis-Gkaragkounis
Policy Gradient Early Stage Convergence Nash Equilibrium Stochastic Game Deterministic Policy Nash Equilibrium Policy

October 11, 2022

Divergence Results and Convergence of a Variance Reduced Version of ADAM
Ruiqi Wang, Diego Klabjan
Deep Neural Network Early Stage Convergence Stochastic Optimization Inverse Divergence Gradient Correction Variance Aware Adam Algorithm

October 8, 2022

Convergence of the Backward Deep BSDE Method with Applications to Optimal Stopping Problems
Chengfan Gao, Siping Gao, Ruimeng Hu, Zimu Zhu
Financial Application Early Stage Convergence Backward Stochastic Differential Equation Optimal Stopping Problem

September 30, 2022

September 29, 2022

September 7, 2022

September 6, 2022

Rates of Convergence for Regression with the Graph Poly-Laplacian
Nicolás García Trillos, Ryan Murray, Matthew Thorpe
Early Stage Convergence Novel Regression Graph Laplacian B Spline Universal Rate Laplacian Regularization

September 3, 2022

Suppressing Noise from Built Environment Datasets to Reduce Communication Rounds for Convergence of Federated Learning
Rahul Mishra, Hari Prabhat Gupta, Tanima Dutta, Sajal K. Das
Data Driven Early Stage Convergence Privacy Preserving Federated Learning Approach Building Dataset Noise Suppression Intelligent Sensing Communication Round

August 23, 2022

Naive Penalized Spline Estimators of Derivatives Achieve Optimal Rates of Convergence
Bright Antwi Boasiako, John Staudenmayer
Early Stage Convergence Optimal Rate Derivative Process

August 16, 2022

A Review of the Convergence of 5G/6G Architecture and Deep Learning
Olusola T. Odeyomi, Olubiyi O. Akintade, Temitayo O. Olowu, Gergely Zaruba
Deep Learning Artificial Intelligence Early Stage Convergence Deep Learning Technology

August 10, 2022

Convergence of denoising diffusion models under the manifold hypothesis
Valentin De Bortoli
Diffusion Model Generative Model Early Stage Convergence Audio Synthesis Manifold Hypothesis

July 28, 2022

Regret Minimization and Convergence to Equilibria in General-sum Markov Games
Liad Erez, Tal Lancewicki, Uri Sherman, Tomer Koren, Yishay Mansour
Early Stage Convergence Regret Minimization Sublinear Regret Markov Game Efficient Equilibrium General Sum Markov Game

July 21, 2022

Metropolis Monte Carlo sampling: convergence, localization transition and optimality
Alexei D. Chepelianskii, Satya N. Majumdar, Hendrik Schawe, Emmanuel Trizac
Large Scale Early Stage Convergence Random Walk Near Optimality Monte Carlo Sampling

July 3, 2022

On Convergence of Gradient Descent Ascent: A Tight Local Analysis
Haochuan Li, Farzan Farnia, Subhro Das, Ali Jadbabaie
Generative Adversarial Network Early Stage Convergence Convergence Rate Gradient Descent Ascent GAN Algorithm Stochastic Gradient Descent Ascent Tight Analysis

Early Stage Convergence

Papers

On the convergence of policy gradient methods to Nash equilibria in general stochastic games

Divergence Results and Convergence of a Variance Reduced Version of ADAM

Convergence of the Backward Deep BSDE Method with Applications to Optimal Stopping Problems

Convergence of weak-SINDy Surrogate Models

On Convergence of Average-Reward Off-Policy Control Algorithms in Weakly Communicating MDPs

On the Convergence of AdaGrad(Norm) on $\R^{d}$: Beyond Convexity, Non-Asymptotic Rate and Acceleration

Convergence of the mini-batch SIHT algorithm

Targeted Separation and Convergence with Kernel Discrepancies

$O(T^{-1})$ Convergence of Optimistic-Follow-the-Regularized-Leader in Two-Player Zero-Sum Markov Games

Convergence of score-based generative modeling for general data distributions

On the Convergence of the ELBO to Entropy Sums

On the Convergence of Monte Carlo UCB for Random-Length Episodic MDPs

Rates of Convergence for Regression with the Graph Poly-Laplacian

Suppressing Noise from Built Environment Datasets to Reduce Communication Rounds for Convergence of Federated Learning

Naive Penalized Spline Estimators of Derivatives Achieve Optimal Rates of Convergence

A Review of the Convergence of 5G/6G Architecture and Deep Learning

Convergence of denoising diffusion models under the manifold hypothesis

Regret Minimization and Convergence to Equilibria in General-sum Markov Games

Metropolis Monte Carlo sampling: convergence, localization transition and optimality

On Convergence of Gradient Descent Ascent: A Tight Local Analysis