Early Stage Convergence

Early stage convergence in machine learning focuses on understanding and improving the initial phases of training algorithms, aiming to accelerate convergence speed and enhance generalization performance. Current research investigates this through the lens of various optimization algorithms (e.g., Adam, SGD, FedProx), model architectures (e.g., transformers, diffusion models), and specific problem domains (e.g., federated learning, collaborative filtering). These studies leverage techniques from dynamical systems theory and optimal transport to establish convergence guarantees and bounds, ultimately contributing to more efficient and robust machine learning systems across diverse applications.

Papers

February 8, 2022

Convergence of a New Learning Algorithm
Feng Lin
Neural Network Early Stage Convergence Back Propagation Learning Algorithm

February 7, 2022

Finite-Sum Optimization: A New Perspective for Convergence to a Global Solution
Lam M. Nguyen, Trang H. Tran, Marten van Dijk
Neural Network Deep Neural Network Early Stage Convergence New Perspective Minimization Problem Finite Sum Optimization Global Solution Gradient Evaluation

February 2, 2022

Temporal Heterogeneity Improves Speed and Convergence in Genetic Algorithms
Yoshio Martinez, Katya Rodriguez, Carlos Gershenson
Practical Algorithm Early Stage Convergence Genetic Algorithm Speed Effect Unobserved Heterogeneity Crossover Probability

January 27, 2022

On the Convergence of Heterogeneous Federated Learning with Arbitrary Adaptive Online Model Pruning
Hanhan Zhou, Tian Lan, Guru Venkataramani, Wenbo Ding
Early Stage Convergence Convergence Analysis Heterogeneous Federated Learning Local Heterogeneous Model

January 26, 2022

On the Convergence of mSGD and AdaGrad for Stochastic Optimization
Ruinan Jin, Yu Xing, Xingkang He
Gradient Descent Stochastic Gradient Descent Early Stage Convergence Stochastic Optimization Adaptive Gradient

January 25, 2022

January 18, 2022

Convergence of Policy Gradient for Entropy Regularized MDPs with Neural Network Approximation in the Mean-Field Regime
Bekzhan Kerimkulov, James-Michael Leahy, David Šiška, Lukasz Szpruch
Policy Gradient Early Stage Convergence Gradient Flow Mean Field Neural Network Approximation Entropy Regularized Markov Decision Process

January 4, 2022

Survey on the Convergence of Machine Learning and Blockchain
Shengwen Ding, Chenhui Hu
Machine Learning Timely Survey Machine Learning Model Early Stage Convergence Blockchain Based Platform Collaborative Machine Learning

December 21, 2021

A Theoretical View of Linear Backpropagation and Its Convergence
Ziang Li, Yiwen Guo, Haodi Liu, Changshui Zhang
Adversarial Attack Adversarial Example Early Stage Convergence Back Propagation

December 20, 2021

Strong Consistency and Rate of Convergence of Switched Least Squares System Identification for Autonomous Markov Jump Linear Systems
Borna Sayedana, Mohammad Afshari, Peter E. Caines, Aditya Mahajan
Early Stage Convergence Strong Consistency Least Square System Identification Universal Rate Autonomous Dynamical System Markov Jump

December 15, 2021

On the Convergence and Robustness of Adversarial Training
Yisen Wang, Xingjun Ma, James Bailey, Jinfeng Yi, Bowen Zhou, Quanquan Gu
Native Robustness Adversarial Example Adversarial Training Early Stage Convergence Better Robustness Inner Optimization

December 11, 2021

Convergence of Generalized Belief Propagation Algorithm on Graphs with Motifs
Yitao Chen, Deepanshu Vasal
Graph Drawing Early Stage Convergence Message Passing Belief Propagation Jamdani Motif

December 9, 2021

On Convergence of Federated Averaging Langevin Dynamics
Wei Deng, Qian Zhang, Yi-An Ma, Zhao Song, Guang Lin
Early Stage Convergence Langevin Dynamic Correlated Noise Gradient Noise

December 5, 2021

On the Convergence of Shallow Neural Network Training with Randomly Masked Neurons
Fangshuo Liao, Anastasios Kyrillidis
Neural Network Early Stage Convergence Shallow Neural Network Random Masking

December 3, 2021

Regularized Newton Method with Global $O(1/k^2)$ Convergence
Konstantin Mishchenko
Early Stage Convergence World Event Hessian Matrix Convex Objective Levenberg Marquardt Cubic Regularization Newton Type Method

November 30, 2021

A Comprehensive Survey on the Convergence of Vehicular Social Networks and Fog Computing
Farimasadat Miri, Richard Pazzi
Comprehensive Survey Early Stage Convergence Vehicular Network Fog Computing