Offline to Online Reinforcement Learning
Offline-to-online reinforcement learning (RL) aims to improve the efficiency and safety of RL by leveraging pre-trained policies from offline datasets for online fine-tuning. Current research focuses on addressing challenges like distribution shift between offline and online data, improving the accuracy of Q-value estimation, and developing more robust exploration strategies, often employing techniques like diffusion models, Bayesian methods, and ensemble approaches. This hybrid approach holds significant promise for real-world applications where data acquisition is expensive or risky, enabling more efficient and reliable learning in domains such as robotics and autonomous systems.
Papers
October 31, 2024
October 19, 2024
August 27, 2024
July 17, 2024
May 31, 2024
May 12, 2024
December 12, 2023
October 27, 2023
October 9, 2023
September 22, 2023
June 13, 2023
May 25, 2023
March 14, 2023
October 13, 2022
October 11, 2022
June 27, 2022