Residual Reinforcement Learning

Residual reinforcement learning (RRL) enhances existing control policies by learning only the necessary adjustments or "residuals" needed to improve performance, rather than training a completely new policy from scratch. Current research focuses on applying RRL to diverse robotics tasks, including locomotion, manipulation, and autonomous driving, often integrating it with model-based methods, policy gradient algorithms, and mixture-of-experts frameworks. This approach offers significant advantages in sample efficiency and training speed, leading to faster development and deployment of robust controllers for complex real-world applications.

Papers