Robust Markov Decision Process

Robust Markov Decision Processes (RMDPs) extend standard Markov Decision Processes by incorporating uncertainty in transition probabilities, aiming to find policies optimal under various possible system dynamics. Current research focuses on developing efficient algorithms, particularly policy iteration and gradient methods, to solve RMDPs with diverse uncertainty sets, including non-rectangular and ambiguity sets defined by phi-divergences. These advancements address computational challenges and improve the applicability of RMDPs to real-world problems where model uncertainty is prevalent, leading to more reliable and robust decision-making in various domains.

Papers