Policy Deep Reinforcement Learning
Policy deep reinforcement learning (DRL) aims to develop efficient algorithms that learn optimal policies from data, particularly focusing on off-policy methods which leverage past experiences. Current research emphasizes improving sample efficiency and stability through techniques like refined experience replay mechanisms (e.g., corrected uniform replay, neighborhood mixup), novel critic updates independent of the actor, and adaptive blending of online and offline learning. These advancements are significant for robotics and other domains requiring efficient learning from limited data, leading to more robust and sample-efficient control policies in complex environments.
Papers
November 17, 2024
August 4, 2024
June 13, 2024
May 28, 2024
May 1, 2024
April 24, 2024
March 7, 2024
January 29, 2024
November 30, 2023
July 9, 2023
June 20, 2023
June 5, 2023
December 26, 2022
December 11, 2022
July 27, 2022
July 3, 2022
June 25, 2022
June 9, 2022
June 8, 2022
May 18, 2022