Runtime Elastic Tensor Selection

Runtime elastic tensor selection focuses on optimizing the efficiency of tensor computations, particularly within machine learning models, by dynamically choosing which parts of a model are actively used during training or inference. Current research explores techniques like reinforcement learning-based auto-schedulers and asynchronous multi-model approaches to achieve this dynamic selection, aiming to improve training speed, reduce energy consumption, and enhance inference performance. This research is significant because it addresses the computational bottlenecks inherent in large-scale machine learning, impacting both the development of more efficient algorithms and the deployment of AI applications on resource-constrained devices.

Papers

December 21, 2023

ElasticTrainer: Speeding Up On-Device Training with Runtime Elastic Tensor Selection
Kai Huang, Boyuan Yang, Wei Gao
Device Training Elastic Net Trainable Layer Offline Training Runtime Elastic Tensor Selection

May 22, 2023

Asynchronous Multi-Model Dynamic Federated Learning over Wireless Networks: Theory, Modeling, and Optimization
Zhan-Lun Chang, Seyyedali Hosseinalipour, Mung Chiang, Christopher G. Brinton
Machine Learning Optimization Purpose Theoretical Understanding Wireless Network Convergence Guarantee Asynchronous Federated Learning Asynchronous Update Runtime Elastic Tensor Selection

November 21, 2022

HARL: Hierarchical Adaptive Reinforcement Learning Based Auto Scheduler for Neural Networks
Zining Zhang, Bingsheng He, Zhenjie Zhang
Reinforcement Learning Neural Network Tensor Program Heterogeneous Multi Agent Reinforcement Learning Runtime Elastic Tensor Selection

September 10, 2022

Share the Tensor Tea: How Databases can Leverage the Machine Learning Ecosystem
Yuki Asada, Victor Fu, Apurva Gandhi, Advitya Gemawat, Lihao Zhang, Dong He, Vivek Gupta, Ehi Nosakhare, Dalitso Banda, Rathijit Sen, Matteo Interlandi
Single GPU New Database Tensor Program Query Processing Runtime Elastic Tensor Selection

March 3, 2022

Query Processing on Tensor Computation Runtimes
Dong He, Supun Nakandala, Dalitso Banda, Rathijit Sen, Karla Saur, Kwanghyun Park, Carlo Curino, Jesús Camacho-Rodríguez, Konstantinos Karanasos, Matteo Interlandi
Tensor Data Tensor Program Tensor Compiler Query Processing Runtime Elastic Tensor Selection

Runtime Elastic Tensor Selection

Papers

ElasticTrainer: Speeding Up On-Device Training with Runtime Elastic Tensor Selection

Asynchronous Multi-Model Dynamic Federated Learning over Wireless Networks: Theory, Modeling, and Optimization

HARL: Hierarchical Adaptive Reinforcement Learning Based Auto Scheduler for Neural Networks

Share the Tensor Tea: How Databases can Leverage the Machine Learning Ecosystem

Query Processing on Tensor Computation Runtimes