Model Training
Model training focuses on developing efficient and effective methods for creating accurate and robust machine learning models. Current research emphasizes improving training efficiency through techniques like low-precision computation, optimized memory management (e.g., using recomputation and memory-aware scheduling), and efficient communication strategies in distributed and federated learning settings. These advancements are crucial for scaling model training to larger datasets and more complex architectures, impacting various fields from computer vision and natural language processing to healthcare and industrial applications.
Papers
February 22, 2024
February 17, 2024
February 15, 2024
February 12, 2024
February 5, 2024
January 30, 2024
January 29, 2024
January 27, 2024
January 16, 2024
January 10, 2024
January 1, 2024
December 29, 2023
December 18, 2023
December 15, 2023
December 12, 2023
December 11, 2023
December 7, 2023
December 1, 2023