Reward Machine

Reward machines are automata-based formalisms used to represent complex, non-Markovian reward functions in reinforcement learning, aiming to improve learning efficiency and enable the specification of intricate tasks. Current research focuses on synthesizing reward machines from various sources (e.g., planning, expert demonstrations, large language models), developing algorithms for learning with noisy or uncertain interpretations of the reward structure, and extending their application to multi-agent settings and partially observable environments. This work holds significant promise for advancing reinforcement learning's capabilities in complex real-world scenarios by providing structured representations of reward functions and facilitating knowledge transfer across tasks.

Papers