Paper ID: 2409.07401

Convergence of continuous-time stochastic gradient descent with applications to linear deep neural networks

Gabor Lugosi, Eulalia Nualart

We study a continuous-time approximation of the stochastic gradient descent process for minimizing the expected loss in learning problems. The main results establish general sufficient conditions for the convergence, extending the results of Chatterjee (2022) established for (nonstochastic) gradient descent. We show how the main result can be applied to the case of overparametrized linear neural network training.

Submitted: Sep 11, 2024

Topics

Deep Neural Network
Financial Application
Gradient Descent
Stochastic Gradient Descent
Early Stage Convergence
Linear Neural Network
Sufficient Condition

Links

arXiv PDF