GPU Implementation

GPU implementation focuses on optimizing algorithms and model architectures for efficient execution on Graphics Processing Units, aiming to accelerate computationally intensive tasks. Current research emphasizes improving the speed and memory efficiency of various models, including Graph Neural Networks (GNNs), large language models (LLMs), and probabilistic circuits, often through techniques like low-rank approximations, optimized kernels, and data reuse strategies. These advancements significantly impact diverse fields, from accelerating scientific simulations and AI model training to enabling real-time processing in applications like autonomous navigation and image generation on resource-constrained devices.

Papers