Regularized Reinforcement Learning

Regularized reinforcement learning (RL) aims to improve the stability, efficiency, and robustness of RL algorithms by incorporating penalty terms or constraints into the learning process. Current research focuses on addressing issues like over-regularization in various action spaces, improving the transferability of learned rewards in inverse RL, and mitigating estimation biases in offline RL through techniques such as robust averaging and selective penalization. These advancements enhance the reliability and sample efficiency of RL, leading to improved performance in diverse applications, including robotics, autonomous driving, and language model control.

Papers