Model Training
Model training focuses on developing efficient and effective methods for creating accurate and robust machine learning models. Current research emphasizes improving training efficiency through techniques like low-precision computation, optimized memory management (e.g., using recomputation and memory-aware scheduling), and efficient communication strategies in distributed and federated learning settings. These advancements are crucial for scaling model training to larger datasets and more complex architectures, impacting various fields from computer vision and natural language processing to healthcare and industrial applications.
Papers
August 9, 2023
July 29, 2023
July 4, 2023
July 2, 2023
June 29, 2023
June 21, 2023
June 16, 2023
June 3, 2023
May 24, 2023
May 19, 2023
May 16, 2023
May 4, 2023
April 21, 2023
April 8, 2023
March 24, 2023
March 21, 2023
February 27, 2023
February 19, 2023
February 16, 2023