Cloud Platform

Cloud platforms are evolving to efficiently manage and allocate diverse computational resources, primarily driven by the increasing demands of machine learning and large-scale data analysis. Current research focuses on optimizing resource utilization through techniques like NPU virtualization, machine learning-based scheduling algorithms (including deep learning and genetic algorithms), and intelligent resource allocation strategies across heterogeneous hardware (e.g., CPUs, GPUs, NPUs). These advancements are crucial for improving the cost-effectiveness, performance, and reliability of cloud services across various applications, from AI model training and inference to scientific computing and IoT deployments.

Papers