Stochastic Approximation

Stochastic approximation (SA) is an iterative method for finding the root of an operator using noisy observations, crucial for solving optimization problems where exact gradients are unavailable. Current research emphasizes improving SA's efficiency and robustness, focusing on variance reduction techniques, adaptive step sizes, and handling Markovian noise and delays in various settings, including distributed and federated learning, reinforcement learning, and temporal difference learning. These advancements are significant for tackling large-scale optimization problems in diverse fields, leading to improved algorithms with stronger convergence guarantees and enhanced applicability to real-world scenarios.

Papers

November 22, 2021

November 18, 2021

Second-Order Mirror Descent: Convergence in Games Beyond Averaging and Discounting
Bolin Gao, Lacra Pavel
Early Stage Convergence Video Game Markov Chain Stochastic Approximation Mirror Descent

November 16, 2021

Online Estimation and Optimization of Utility-Based Shortfall Risk
Vishwajit Hegde, Arvind S. Menon, L. A. Prashanth, Krishna Jagannathan
Optimization Purpose Stochastic Gradient Descent Stochastic Approximation Online Estimation

November 11, 2021

Stationary Behavior of Constant Stepsize SGD Type Algorithms: An Asymptotic Characterization
Zaiwei Chen, Shancong Mou, Siva Theja Maguluri
Gradient Descent Stochastic Approximation Stable Convergence

November 4, 2021

October 27, 2021

The ODE Method for Asymptotic Statistics in Stochastic Approximation and Reinforcement Learning
Vivek Borkar, Shuhang Chen, Adithya Devraj, Ioannis Kontoyiannis, Sean Meyn
Reinforcement Learning Lyapunov Function Markov Chain Stochastic Approximation Central Limit Theorem Asymptotic Inference

September 29, 2021

A Two-Time-Scale Stochastic Optimization Framework with Applications in Control and Reinforcement Learning
Sihan Zeng, Thinh T. Doan, Justin Romberg
Reinforcement Learning Financial Application External Control Actor Critic Stochastic Approximation Actor Critic Algorithm

Stochastic Approximation

Papers

Policy Gradient and Actor-Critic Learning in Continuous Time and Space: Theory and Algorithms

Gradient Temporal Difference with Momentum: Stability and Convergence

Second-Order Mirror Descent: Convergence in Games Beyond Averaging and Discounting

Online Estimation and Optimization of Utility-Based Shortfall Risk

Stationary Behavior of Constant Stepsize SGD Type Algorithms: An Asymptotic Characterization

Global Optimality and Finite Sample Analysis of Softmax Off-Policy Actor Critic under State Distribution Mismatch

Model-Free Risk-Sensitive Reinforcement Learning

The ODE Method for Asymptotic Statistics in Stochastic Approximation and Reinforcement Learning

A Two-Time-Scale Stochastic Optimization Framework with Applications in Control and Reinforcement Learning