Global Convergence Rate

Global convergence rate research focuses on determining how quickly optimization algorithms reach a global optimum, a crucial aspect for the efficiency and scalability of machine learning models. Current investigations explore this rate across various architectures, including deep equilibrium models and over-parameterized neural networks with ReLU activations, and algorithms like Sharpness-Aware Minimization and Frank-Wolfe, often analyzing the impact of factors such as perturbation size, activation functions, and data dimensionality. Understanding these rates is vital for improving the performance and theoretical understanding of numerous machine learning applications, particularly in high-dimensional settings and distributed environments like federated learning. The field is actively refining convergence bounds and exploring techniques to accelerate convergence, particularly in challenging non-convex optimization landscapes.

Papers