Workload Aware
Workload-aware systems aim to optimize resource allocation and performance by dynamically adapting to the characteristics of incoming tasks. Current research focuses on applying machine learning, particularly reinforcement learning, to develop adaptive scheduling policies and resource allocation strategies across diverse applications, from cloud computing and large language model inference to key-value stores and edge computing. This approach promises significant improvements in efficiency, latency, and energy consumption in various computing environments, impacting both the design of future systems and the performance of existing ones.
Papers
November 18, 2024
June 3, 2024
April 25, 2024
March 20, 2024
March 9, 2024
December 27, 2023
October 11, 2023
August 14, 2023
June 1, 2023
February 28, 2023
October 24, 2022
February 21, 2022
January 19, 2022
December 14, 2021
November 22, 2021