MaxEnt RL
Maximum Entropy Reinforcement Learning (MaxEnt RL) aims to find optimal policies that balance maximizing expected rewards with maximizing policy entropy, leading to more robust and exploratory behavior. Current research focuses on developing efficient algorithms, particularly for continuous action spaces, using architectures like Energy-Based Models and Normalizing Flows, and improving the estimation of entropy for expressive policies. These advancements are improving the performance of MaxEnt RL in complex tasks, such as robotics and wildfire prediction, by addressing challenges like high dimensionality and non-smoothness in the underlying models.
Papers
July 19, 2024
May 22, 2024
May 2, 2024
March 11, 2024
February 15, 2024
November 29, 2023
January 5, 2023