Optimistic Exploration
Optimistic exploration in reinforcement learning aims to efficiently discover high-reward actions in complex environments by prioritizing uncertain, potentially rewarding areas. Current research focuses on improving sample efficiency through techniques like Thompson sampling, scaling model capacity with regularization, and decoupling exploration and exploitation using optimistic and pessimistic actors. These advancements are significantly impacting the field by enabling faster learning in challenging scenarios, such as continuous control tasks with sparse rewards and safety constraints, and leading to improved performance in robotics and other applications.
Papers
October 7, 2024
May 25, 2024
December 26, 2023
December 19, 2023
October 25, 2023
October 11, 2023
June 13, 2023
March 16, 2023
March 3, 2023
November 16, 2022
July 29, 2022
May 16, 2022
April 5, 2022
December 1, 2021