Conservative Value Estimation
Conservative value estimation focuses on mitigating overestimation in machine learning models, particularly in reinforcement learning and related areas like automatic speech recognition, by producing lower-bound estimates of values or likelihoods. Current research emphasizes techniques like conservative Q-learning, density estimation, and data filtering to address the challenges of out-of-distribution data and improve robustness, often leveraging neural networks and Bayesian methods. This work is crucial for enhancing the reliability and safety of AI systems in applications ranging from robotics and autonomous driving to natural language processing, where overconfidence can lead to undesirable or even dangerous outcomes.
Papers
July 18, 2024
June 6, 2024
January 16, 2024
September 22, 2023
August 7, 2023
July 20, 2023
April 21, 2023
February 26, 2023
February 14, 2023
December 8, 2022
October 28, 2022
October 7, 2022
September 27, 2022
June 6, 2022