Early Stage Convergence
Early stage convergence in machine learning focuses on understanding and improving the initial phases of training algorithms, aiming to accelerate convergence speed and enhance generalization performance. Current research investigates this through the lens of various optimization algorithms (e.g., Adam, SGD, FedProx), model architectures (e.g., transformers, diffusion models), and specific problem domains (e.g., federated learning, collaborative filtering). These studies leverage techniques from dynamical systems theory and optimal transport to establish convergence guarantees and bounds, ultimately contributing to more efficient and robust machine learning systems across diverse applications.
Papers
December 20, 2021
December 15, 2021
December 11, 2021
December 9, 2021
December 5, 2021
December 3, 2021
December 1, 2021
November 30, 2021
November 29, 2021
November 28, 2021
November 25, 2021
November 24, 2021
November 22, 2021
November 18, 2021