Tensor Program

Tensor program optimization focuses on automatically generating efficient code for executing deep learning models on diverse hardware platforms, aiming to maximize performance and minimize development time. Current research emphasizes developing novel compiler techniques, including advanced auto-tuning strategies (like reinforcement learning and probabilistic programming), and efficient cost models to predict performance across different hardware and model architectures (e.g., transformers, ResNets). These advancements significantly impact the deployment of large-scale machine learning models by accelerating inference and training, ultimately enabling broader access to powerful AI applications.

Papers

March 7, 2022

Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer
Greg Yang, Edward J. Hu, Igor Babuschkin, Szymon Sidor, Xiaodong Liu, David Farhi, Nick Ryder, Jakub Pachocki, Weizhu Chen, Jianfeng Gao
Neural Network Pytorch Model New Hyperparameter Tensor Program

March 3, 2022

Query Processing on Tensor Computation Runtimes
Dong He, Supun Nakandala, Dalitso Banda, Rathijit Sen, Karla Saur, Kwanghyun Park, Carlo Curino, Jesús Camacho-Rodríguez, Konstantinos Karanasos, Matteo Interlandi
Tensor Data Tensor Program Tensor Compiler Query Processing Runtime Elastic Tensor Selection

January 15, 2022

Moses: Efficient Exploitation of Cross-device Transferable Features for Tensor Program Optimization
Zhihe Zhao, Xian Shuai, Yang Bai, Neiwen Ling, Nan Guan, Zhenyu Yan, Guoliang Xing
Domain Adaptation Hardware Accelerator Tensor Program Benchmarking Deep Learning

January 14, 2022

Transfer-Tuning: Reusing Auto-Schedules for Efficient Tensor Program Code Generation
Perry Gibson, José Cano
Task Scheduling Tensor Program Pre Trained Deep Neural Network Adaptive Scheduling

Tensor Program

Papers

Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer

Query Processing on Tensor Computation Runtimes

Moses: Efficient Exploitation of Cross-device Transferable Features for Tensor Program Optimization

Transfer-Tuning: Reusing Auto-Schedules for Efficient Tensor Program Code Generation