Early Stage Convergence

Early stage convergence in machine learning focuses on understanding and improving the initial phases of training algorithms, aiming to accelerate convergence speed and enhance generalization performance. Current research investigates this through the lens of various optimization algorithms (e.g., Adam, SGD, FedProx), model architectures (e.g., transformers, diffusion models), and specific problem domains (e.g., federated learning, collaborative filtering). These studies leverage techniques from dynamical systems theory and optimal transport to establish convergence guarantees and bounds, ultimately contributing to more efficient and robust machine learning systems across diverse applications.

Papers

September 23, 2023

RTrack: Accelerating Convergence for Visual Object Tracking via Pseudo-Boxes Exploration
Guotian Zeng, Bi Zeng, Hong Zhang, Jianqi Liu, Qingmao Wei
Early Stage Convergence Single Object Tracking State of the Art Tracker Visual Object

September 21, 2023

Convergence and Recovery Guarantees of Unsupervised Neural Networks for Inverse Problems
Nathan Buskulic, Jalal Fadili, Yvain Quéau
Neural Network Inverse Problem Early Stage Convergence Overparametrization Bound Recovery Guarantee Network Prior

September 15, 2023

A Theoretical and Empirical Study on the Convergence of Adam with an "Exact" Constant Step Size in Non-Convex Settings
Alokendu Mazumder, Rishabh Sabharwal, Manan Tayal, Bhartendu Kumar, Punit Rathore
Empirical Study Early Stage Convergence Objective Function Gradient Norm Convex Function Non Convex Optimization Step Size Adaptive Gradient

September 14, 2023

Rates of Convergence in Certain Native Spaces of Approximations used in Reinforcement Learning
Ali Bouland, Shengyuan Niu, Sai Tej Paruchuri, Andrew Kurdila, John Burns, Eugenio Schuster
Reinforcement Learning Early Stage Convergence Optimal Control Average Approximation Value Function Kernel Hilbert Space Universal Rate Convergence Rate Analysis

September 12, 2023

Convergence of Gradient-based MAML in LQR
Negin Musavi, Geir E. Dullerud
Reinforcement Learning Early Stage Convergence Convergence Guarantee Local Convergence Model Agnostic Meta Learning Quadratic Control

September 11, 2023

Generalized Graphon Process: Convergence of Graph Frequencies in Stretched Cut Distance
Xingchao Jian, Feng Ji, Wee Peng Tay
Early Stage Convergence Sparse Graph Random Graph Dense Graph Graphon Estimation Graph Frequency Match Cut

September 3, 2023

Modified Step Size for Enhanced Stochastic Gradient Descent: Convergence and Experiments
M. Soheil Shamaee, S. Fathi Hafshejani
Gradient Descent Stochastic Gradient Descent Early Stage Convergence Optical Experiment Step Size

September 1, 2023

How Does Forecasting Affect the Convergence of DRL Techniques in O-RAN Slicing?
Ahmad M. Nagib, Hatem Abou-Zeid, Hossam S. Hassanein
Deep Reinforcement Learning Early Stage Convergence State of the Art Forecasting Virtual Reality Deep Learning Based Slice by Slice RAN Architecture Radio Access Network Slicing

August 16, 2023

Convergence of Two-Layer Regression with Nonlinear Units
Yichuan Deng, Zhao Song, Shenghao Xie
Large Language Model Loss Function Early Stage Convergence Softmax Function Attention Computation Layer Regression

August 7, 2023

Implicit Graph Neural Diffusion Networks: Convergence, Generalization, and Over-Smoothing
Guoji Fu, Mohammed Haroon Dupty, Yanfei Dong, Lee Wee Sun
Strong Generalization Early Stage Convergence Graph Neural Graph Laplacian Implicit Neural Network Undisciplined Over Smoothing Implicit Graph Neural Network

July 30, 2023

On Neural Network approximation of ideal adversarial attack and convergence of adversarial training
Rajdeep Haldar, Qifan Song
Adversarial Attack Adversarial Training Early Stage Convergence Adversarial Loss Learned Function Neural Network Approximation Optimal Attack

July 26, 2023

Stability of Multi-Agent Learning: Convergence in Network Games with Many Players
Aamal Hussain, Dan Leonte, Francesco Belardinelli, Georgios Piliouras
Early Stage Convergence Core Stability Multi Agent Learning Zero Sum Game Two Player Stable Learning Convergence Behavior

July 23, 2023

DyPP: Dynamic Parameter Prediction to Accelerate Convergence of Variational Quantum Algorithms
Satwik Kundu, Debarshi Kundu, Swaroop Ghosh
Early Stage Convergence Quantum Neural Network Variational Quantum Algorithm Variational Quantum Quantum Approximate Optimization Algorithm Quantum Simulator

July 21, 2023

July 20, 2023

July 10, 2023

Invex Programs: First Order Algorithms and Their Convergence
Adarsh Barik, Suvrit Sra, Jean Honorio
Early Stage Convergence Gradient Method First Order Algorithm First Order Gradient

July 5, 2023

Convergence of Communications, Control, and Machine Learning for Secure and Autonomous Vehicle Navigation
Tengchan Zeng, Aidin Ferdowsi, Omid Semiari, Walid Saad, Choong Seon Hong
Machine Learning Autonomous Vehicle External Control Early Stage Convergence Autonomous Navigation Timely Communication Autonomous Navigation System Adaptive Controller CAV Network Connected and Automated Vehicle

June 23, 2023

A new approach to generalisation error of machine learning algorithms: Estimates and convergence
Michail Loulakis, Charalambos G. Makridakis
Neural Network Deep Learning Estimation Task Early Stage Convergence Novel Approach Machine Learning Algorithm Generalization Error Learned Function Neural Interpolation

Early Stage Convergence

Papers

RTrack: Accelerating Convergence for Visual Object Tracking via Pseudo-Boxes Exploration

Convergence and Recovery Guarantees of Unsupervised Neural Networks for Inverse Problems

A Theoretical and Empirical Study on the Convergence of Adam with an "Exact" Constant Step Size in Non-Convex Settings

Rates of Convergence in Certain Native Spaces of Approximations used in Reinforcement Learning

Convergence of Gradient-based MAML in LQR

Generalized Graphon Process: Convergence of Graph Frequencies in Stretched Cut Distance

Modified Step Size for Enhanced Stochastic Gradient Descent: Convergence and Experiments

How Does Forecasting Affect the Convergence of DRL Techniques in O-RAN Slicing?

Convergence of Two-Layer Regression with Nonlinear Units

Implicit Graph Neural Diffusion Networks: Convergence, Generalization, and Over-Smoothing

On Neural Network approximation of ideal adversarial attack and convergence of adversarial training

Stability of Multi-Agent Learning: Convergence in Network Games with Many Players

DyPP: Dynamic Parameter Prediction to Accelerate Convergence of Variational Quantum Algorithms

Convergence of SGD for Training Neural Networks with Sliced Wasserstein Losses

Beyond Convergence: Identifiability of Machine Learning and Deep Learning Models

On the Convergence of Bounded Agents

Convergence of Adam for Non-convex Objectives: Relaxed Hyperparameters and Non-ergodic Case

Invex Programs: First Order Algorithms and Their Convergence

Convergence of Communications, Control, and Machine Learning for Secure and Autonomous Vehicle Navigation

A new approach to generalisation error of machine learning algorithms: Estimates and convergence