Early Stage Convergence

Early stage convergence in machine learning focuses on understanding and improving the initial phases of training algorithms, aiming to accelerate convergence speed and enhance generalization performance. Current research investigates this through the lens of various optimization algorithms (e.g., Adam, SGD, FedProx), model architectures (e.g., transformers, diffusion models), and specific problem domains (e.g., federated learning, collaborative filtering). These studies leverage techniques from dynamical systems theory and optimal transport to establish convergence guarantees and bounds, ultimately contributing to more efficient and robust machine learning systems across diverse applications.

Papers

August 11, 2024

On the Convergence of a Federated Expectation-Maximization Algorithm
Zhixu Tao, Rajita Chandak, Sanjeev Kulkarni
Early Stage Convergence Convergence Rate Expectation Maximization Federated Learning Algorithm Federated Automatic Differentiation

August 9, 2024

Interventional Causal Structure Discovery over Graphical Models with Convergence and Optimality Guarantees
Qiu Chengbo, Yang Kai
Early Stage Convergence Causal Structure Graphical Model Causal Structure Learning Optimality Guarantee Causal Structure Discovery

August 3, 2024

Can LLMs predict the convergence of Stochastic Gradient Descent?
Oussama Zekri, Abdelhakim Benechehab, Ievgen Redko
Large Language Model Medical LLM Stochastic Gradient Descent Early Stage Convergence Markov Chain Non Convex Optimization Large Deep Learning Model

July 20, 2024

Universally Harmonizing Differential Privacy Mechanisms for Federated Learning: Boosting Accuracy and Convergence
Shuya Feng, Meisam Mohammady, Hanbin Hong, Shenao Yan, Ashish Kundu, Binghui Wang, Yuan Hong
Differential Privacy Early Stage Convergence Privacy Guarantee Inference Attack Differential Privacy Mechanism FL Framework

July 4, 2024

C$^3$DG: Conditional Domain Generalization for Hyperspectral Imagery Classification with Convergence and Constrained-risk Theories
Zhe Gao, Bin Pan, Zhenwei Shi
Hyperspectral Image Image Restoration Early Stage Convergence Hyperspectral Image Classification Single Pixel Infrared Hyperspectral Imaging Constrained Risk

July 3, 2024

Convergence of Implicit Gradient Descent for Training Two-Layer Physics-Informed Neural Networks
Xianliang Xu, Ting Du, Wang Kong, Ye Li, Zhongyi Huang
Neural Network Physic Informed Neural Network Early Stage Convergence Two Layer Neural Network Gradient Sharing Smooth Activation Function

June 29, 2024

Weighted mesh algorithms for general Markov decision processes: Convergence and tractability
Denis Belomestny, John Schoenmakers
Markov Decision Process Early Stage Convergence State Space Discrete Time Finite Horizon Infinite Horizon Moral Tractability Adaptive Meshing

June 26, 2024

Innovating for Tomorrow: The Convergence of SE and Green AI
Luís Cruz, Xavier Franch Gutierrez, Silverio Martínez-Fernández
Artificial Intelligence Early Stage Convergence Software Engineering Software System Green AI

June 18, 2024

On the Convergence of T\^atonnement for Linear Fisher Markets
Tianlong Nan, Yuan Gao, Christian Kroer
Early Stage Convergence Price Prediction Equilibrium Price Fisher Market

June 16, 2024

On Convergence Analysis of Policy Iteration Algorithms for Entropy-Regularized Stochastic Control Problems
Jin Ma, Gaozhan Wang, Jianfeng Zhang
Early Stage Convergence Policy Improvement Universal Rate Iterative Method Generalized Policy Improvement Diffusion Control

June 2, 2024

Augmenting the FedProx Algorithm by Minimizing Convergence
Anomitra Sarkar, Lavanya Vajpayee
Early Stage Convergence IOT Technology Proximity Search FedProx Algorithm

June 1, 2024

May 31, 2024

Improving Generalization and Convergence by Enhancing Implicit Regularization
Mingze Wang, Jinbo Wang, Haotian He, Zilin Wang, Guanhua Huang, Feiyu Xiong, Zhiyu Li, Weinan E, Lei Wu
Strong Generalization Early Stage Convergence Generalization Performance Sharpness Aware Minimization Implicit Regularization Regularization Property Sharpness Reduction

May 29, 2024

MGDA Converges under Generalized Smoothness, Provably
Qi Zhang, Peiyao Xiao, Shaofeng Zou, Kaiyi Ji
Early Stage Convergence Multi Objective Optimization Multi Objective Gradient Norm Smooth Loss Function Generalized Smoothness

May 27, 2024

Convergence of SGD with momentum in the nonconvex case: A time window-based analysis
Junwen Qiu, Bohao Ma, Andre Milzarek
Stochastic Gradient Descent Early Stage Convergence Convergence Analysis Residual Momentum General Nonconvex Local Convergence Convergence Behavior

May 21, 2024

On Convergence of the Alternating Directions SGHMC Algorithm
Soumyadip Ghosh, Yingdong Lu, Tomasz Nowicki
Early Stage Convergence Convergence Rate Markov Chain Hamiltonian Monte Carlo Hamiltonian Dynamic Novel Single Integrator

May 13, 2024

Analysis of the rate of convergence of an over-parametrized convolutional neural network image classifier learned by gradient descent
Michael Kohler, Adam Krzyzak, Benjamin Walter
Neural Network Deep Learning General Analysis Gradient Descent Image Classification Early Stage Convergence Universal Rate Global Pooling

May 12, 2024

A geometric decomposition of finite games: Convergence vs. recurrence under exponential weights
Davide Legacci, Panayotis Mertikopoulos, Bary Pradelski
Early Stage Convergence Non Cooperative Game Temporal Difference Type Recurrence Game Dynamic Game Tree Exponential Weight

May 3, 2024

Triadic-OCD: Asynchronous Online Change Detection with Provable Robustness, Optimality, and Convergence
Yancheng Huang, Kai Yang, Zelin Zhu, Leian Chen
Early Stage Convergence Near Optimality Online Change Detection Synchronous Algorithm

Early Stage Convergence

Papers

On the Convergence of a Federated Expectation-Maximization Algorithm

Interventional Causal Structure Discovery over Graphical Models with Convergence and Optimality Guarantees

Can LLMs predict the convergence of Stochastic Gradient Descent?

Universally Harmonizing Differential Privacy Mechanisms for Federated Learning: Boosting Accuracy and Convergence

C$^3$DG: Conditional Domain Generalization for Hyperspectral Imagery Classification with Convergence and Constrained-risk Theories

Convergence of Implicit Gradient Descent for Training Two-Layer Physics-Informed Neural Networks

Weighted mesh algorithms for general Markov decision processes: Convergence and tractability

Innovating for Tomorrow: The Convergence of SE and Green AI

On the Convergence of T\^atonnement for Linear Fisher Markets

On Convergence Analysis of Policy Iteration Algorithms for Entropy-Regularized Stochastic Control Problems

Augmenting the FedProx Algorithm by Minimizing Convergence

Efficient Sign-Based Optimization: Accelerating Convergence via Variance Reduction

Understanding the Convergence in Balanced Resonate-and-Fire Neurons

Improving Generalization and Convergence by Enhancing Implicit Regularization

MGDA Converges under Generalized Smoothness, Provably

Convergence of SGD with momentum in the nonconvex case: A time window-based analysis

On Convergence of the Alternating Directions SGHMC Algorithm

Analysis of the rate of convergence of an over-parametrized convolutional neural network image classifier learned by gradient descent

A geometric decomposition of finite games: Convergence vs. recurrence under exponential weights

Triadic-OCD: Asynchronous Online Change Detection with Provable Robustness, Optimality, and Convergence