Uncertainty Aware Deployment
Uncertainty-aware deployment focuses on reliably deploying machine learning models, particularly deep learning models like transformers and large language models (LLMs), in real-world applications where conditions may deviate from training data. Current research emphasizes techniques like mixed-precision quantization, contrastive learning, and Bayesian methods to improve model efficiency and robustness while quantifying and managing uncertainty. This work is crucial for bridging the gap between successful model training and dependable real-world performance across diverse hardware platforms and application domains, including robotics, edge computing, and wireless communication.
Papers
November 26, 2024
October 21, 2024
October 4, 2024
September 5, 2024
August 7, 2024
July 25, 2024
May 30, 2024
March 29, 2024
March 27, 2024
November 6, 2023
May 20, 2023
February 3, 2023
December 5, 2022
December 1, 2022
December 10, 2021