Model Training
Model training focuses on developing efficient and effective methods for creating accurate and robust machine learning models. Current research emphasizes improving training efficiency through techniques like low-precision computation, optimized memory management (e.g., using recomputation and memory-aware scheduling), and efficient communication strategies in distributed and federated learning settings. These advancements are crucial for scaling model training to larger datasets and more complex architectures, impacting various fields from computer vision and natural language processing to healthcare and industrial applications.
Papers
November 28, 2023
November 21, 2023
November 11, 2023
November 7, 2023
November 6, 2023
October 23, 2023
October 12, 2023
October 11, 2023
October 8, 2023
October 4, 2023
October 3, 2023
September 28, 2023
September 15, 2023
September 14, 2023
September 11, 2023
September 3, 2023
August 28, 2023
August 21, 2023