Offline Learning

Offline learning in reinforcement learning (RL) aims to train effective policies using only pre-collected datasets, avoiding the need for costly online interaction with the environment. Current research focuses on improving generalization to unseen tasks and environments, addressing distribution shifts between training and deployment data, and developing more efficient algorithms, including those based on successor representations, decision transformers, and various forms of Q-learning with conservatism. These advancements are crucial for deploying RL in real-world scenarios where online learning is impractical or risky, with applications ranging from robotics and game playing to wireless network optimization and healthcare.

Papers