Approximate Policy Iteration
Approximate Policy Iteration (API) aims to efficiently solve reinforcement learning problems by iteratively improving policies through approximate policy evaluation and improvement steps. Current research focuses on addressing the "curse of dimensionality" using deep neural networks, particularly Galerkin methods and physics-informed neural networks, and on developing formally verified algorithms for improved reliability. These advancements enhance the applicability of API to high-dimensional control problems and safety-critical systems, improving both theoretical understanding and practical performance in various domains like robotics and adaptive filtering.
Papers
June 13, 2024
June 11, 2024
February 21, 2024
January 3, 2024
June 19, 2023
February 2, 2023
October 27, 2022
October 21, 2022
October 20, 2022
August 5, 2022
June 28, 2022
April 8, 2022