Deterministic Policy
Deterministic policy learning in reinforcement learning aims to find optimal action selection strategies that are predictable and repeatable, unlike stochastic policies. Current research focuses on developing efficient algorithms for finding these policies, particularly in continuous state and action spaces, often employing techniques like policy gradient methods, primal-dual approaches, and model predictive control integrated with neural networks (e.g., LSTM networks). These advancements are significant because deterministic policies are often preferred in real-world applications demanding robustness, safety, and traceability, such as robotics and control systems, while also presenting unique challenges for efficient learning.
Papers
December 1, 2022
October 24, 2022
October 21, 2022
October 17, 2022
September 27, 2022
July 13, 2022
May 6, 2022
April 8, 2022
March 5, 2022
January 20, 2022
December 11, 2021