GPU Cluster
GPU clusters are high-performance computing systems used to accelerate computationally intensive tasks, particularly in deep learning. Current research focuses on optimizing resource utilization within these clusters, addressing challenges like efficient scheduling for diverse workloads (including large language models and graph neural networks), minimizing communication overhead, and managing heterogeneous hardware configurations. This work is crucial for advancing the capabilities of AI and other computationally demanding fields by enabling faster training and inference of increasingly complex models, ultimately impacting both scientific discovery and practical applications.
Papers
July 14, 2023
June 24, 2023
June 16, 2023
May 18, 2023
December 10, 2022
November 11, 2022
August 30, 2022