Tensor Program
Tensor program optimization focuses on automatically generating efficient code for executing deep learning models on diverse hardware platforms, aiming to maximize performance and minimize development time. Current research emphasizes developing novel compiler techniques, including advanced auto-tuning strategies (like reinforcement learning and probabilistic programming), and efficient cost models to predict performance across different hardware and model architectures (e.g., transformers, ResNets). These advancements significantly impact the deployment of large-scale machine learning models by accelerating inference and training, ultimately enabling broader access to powerful AI applications.
Papers
October 7, 2024
July 31, 2024
June 13, 2024
June 5, 2024
May 9, 2024
February 4, 2024
November 16, 2023
November 1, 2023
October 3, 2023
September 16, 2023
August 3, 2023
April 11, 2023
February 16, 2023
November 21, 2022
November 7, 2022
October 18, 2022
September 10, 2022
August 2, 2022
July 9, 2022