Generalized Policy Improvement
Generalized Policy Improvement (GPI) is a reinforcement learning framework aiming to efficiently improve policies by leveraging previously learned knowledge, particularly through the use of successor features (SFs). Current research focuses on developing theoretically sound GPI algorithms, often incorporating neural network architectures like SF-DQN and MSFA to learn and combine SFs for improved sample efficiency and knowledge transfer across tasks, including multi-objective and safety-constrained scenarios. This work holds significant importance for advancing reinforcement learning, enabling faster learning, better generalization, and more robust policy transfer in complex real-world applications such as robotics and control systems.
Papers
November 2, 2024
June 16, 2024
May 24, 2024
October 24, 2023
January 28, 2023
January 18, 2023
November 10, 2022
June 28, 2022
June 22, 2022
June 17, 2022