Interval Markov Decision Process

Interval Markov Decision Processes (IMDPs) are mathematical models used to represent and solve decision-making problems under uncertainty, where transition probabilities are only known to lie within specified intervals rather than being precisely defined. Current research focuses on developing efficient algorithms for synthesizing robust and optimal policies within IMDP frameworks, particularly addressing challenges posed by continuous action spaces and limited data availability, often employing techniques like Gaussian processes and scenario approaches to quantify uncertainty. This work is significant for its applications in various fields requiring robust control under uncertainty, including robotics, autonomous systems, and safety-critical systems, enabling the design of controllers with provable performance guarantees even with incomplete knowledge of the system dynamics.

Papers