Cloud Platform
Cloud platforms are evolving to efficiently manage and allocate diverse computational resources, primarily driven by the increasing demands of machine learning and large-scale data analysis. Current research focuses on optimizing resource utilization through techniques like NPU virtualization, machine learning-based scheduling algorithms (including deep learning and genetic algorithms), and intelligent resource allocation strategies across heterogeneous hardware (e.g., CPUs, GPUs, NPUs). These advancements are crucial for improving the cost-effectiveness, performance, and reliability of cloud services across various applications, from AI model training and inference to scientific computing and IoT deployments.
Papers
October 24, 2024
August 7, 2024
February 27, 2024
February 16, 2024
January 13, 2024
November 9, 2023
September 12, 2023
July 4, 2023
April 10, 2023
March 12, 2023
October 12, 2022
October 1, 2022
August 2, 2022
July 20, 2022
March 27, 2022