Heavy Ball

The heavy ball method, a momentum-based optimization algorithm, aims to accelerate convergence in solving large-scale optimization problems, particularly prevalent in machine learning. Current research focuses on analyzing its convergence rates under various conditions, including stochasticity, data heterogeneity, and different step-size and momentum parameter selection strategies, often within the context of stochastic gradient descent and its variants applied to neural networks. These investigations are crucial for improving the efficiency and robustness of training complex models, impacting fields like deep learning and federated learning where computational cost and data distribution are significant challenges. Understanding the implicit regularization properties of heavy ball methods is also a key area of ongoing research.

Papers