Online Policy Customization
Online policy customization focuses on adapting pre-trained policies to new, unforeseen situations or requirements without extensive retraining. Current research emphasizes developing efficient algorithms, such as those based on residual learning and actor-critic methods, that can quickly adjust policies using minimal additional data, even leveraging programmatic representations for improved interpretability. This field is crucial for deploying robust AI systems in real-world settings, enabling personalized interventions and improving the adaptability of reinforcement learning agents across diverse tasks and environments.
Papers
July 1, 2024
August 4, 2023
June 15, 2023
April 20, 2023
March 2, 2023
September 21, 2022
February 7, 2022