Latency Training
Latency training focuses on optimizing the speed of machine learning model training, particularly in resource-constrained or distributed environments like federated learning. Current research emphasizes techniques like model splitting, layer-wise updates, and dynamic latency adjustment to reduce training time without sacrificing accuracy, often employing transformer-based architectures or modifications to existing models. These advancements are crucial for deploying machine learning in real-time applications, such as speech recognition and recommendation systems, where low latency is paramount for a positive user experience.
Papers
July 1, 2024
March 27, 2024
July 21, 2023
April 24, 2023
January 8, 2023
December 6, 2022
November 4, 2022
March 29, 2022
December 24, 2020