GPU Implementation
GPU implementation focuses on optimizing algorithms and model architectures for efficient execution on Graphics Processing Units, aiming to accelerate computationally intensive tasks. Current research emphasizes improving the speed and memory efficiency of various models, including Graph Neural Networks (GNNs), large language models (LLMs), and probabilistic circuits, often through techniques like low-rank approximations, optimized kernels, and data reuse strategies. These advancements significantly impact diverse fields, from accelerating scientific simulations and AI model training to enabling real-time processing in applications like autonomous navigation and image generation on resource-constrained devices.
Papers
September 23, 2024
September 19, 2024
June 4, 2024
June 2, 2024
February 21, 2024
February 13, 2024
December 12, 2023
December 2, 2023
September 28, 2023
August 29, 2023
April 21, 2023
December 15, 2022
July 15, 2022
June 28, 2022
April 26, 2022
February 16, 2022