Markovian Reward

Markovian reward functions, commonly used in reinforcement learning, assign rewards based solely on the current state, simplifying the learning process. However, recent research highlights limitations of this approach, particularly in handling multi-objective tasks, risk sensitivity, and scenarios with delayed or non-Markovian rewards. Current efforts focus on developing alternative reward structures, such as reward machines and multi-dimensional rewards, and on methods for mapping non-Markovian rewards into equivalent Markovian representations to leverage existing algorithms. These advancements are crucial for improving the robustness and applicability of reinforcement learning in complex real-world scenarios.

Papers