GPU Cluster
GPU clusters are high-performance computing systems used to accelerate computationally intensive tasks, particularly in deep learning. Current research focuses on optimizing resource utilization within these clusters, addressing challenges like efficient scheduling for diverse workloads (including large language models and graph neural networks), minimizing communication overhead, and managing heterogeneous hardware configurations. This work is crucial for advancing the capabilities of AI and other computationally demanding fields by enabling faster training and inference of increasingly complex models, ultimately impacting both scientific discovery and practical applications.
Papers
October 22, 2024
October 5, 2024
October 2, 2024
August 7, 2024
June 24, 2024
May 28, 2024
May 17, 2024
April 22, 2024
March 25, 2024
March 12, 2024
February 14, 2024
January 29, 2024
January 9, 2024
December 8, 2023
December 6, 2023
November 17, 2023
October 31, 2023
October 28, 2023
October 4, 2023