Deterministic Policy
Deterministic policy learning in reinforcement learning aims to find optimal action selection strategies that are predictable and repeatable, unlike stochastic policies. Current research focuses on developing efficient algorithms for finding these policies, particularly in continuous state and action spaces, often employing techniques like policy gradient methods, primal-dual approaches, and model predictive control integrated with neural networks (e.g., LSTM networks). These advancements are significant because deterministic policies are often preferred in real-world applications demanding robustness, safety, and traceability, such as robotics and control systems, while also presenting unique challenges for efficient learning.
Papers
December 1, 2024
October 25, 2024
August 27, 2024
August 19, 2024
July 18, 2024
June 20, 2024
May 29, 2024
May 23, 2024
May 3, 2024
January 18, 2024
November 30, 2023
November 9, 2023
August 28, 2023
July 22, 2023
July 18, 2023
May 11, 2023
March 29, 2023
February 21, 2023
February 7, 2023