Programmatic Reinforcement Learning

Programmatic reinforcement learning (PRL) aims to represent reinforcement learning policies as executable programs, enhancing interpretability and generalizability compared to traditional black-box neural network approaches. Current research focuses on improving the efficiency of program search, often leveraging large language models to guide the process and incorporating hierarchical structures like state machines to handle long-horizon tasks. This approach holds significant promise for addressing challenges in cooperative AI, robot control, and other domains requiring explainable and robust AI agents, particularly where human-agent interaction or adaptation to novel environments is crucial.

Papers