MDP Model
Markov Decision Processes (MDPs) model sequential decision-making under uncertainty, aiming to find optimal policies maximizing cumulative rewards. Current research focuses on addressing challenges like model uncertainty (e.g., using multi-model MDPs and robust reinforcement learning), improving sample efficiency through abstract state representations and efficient algorithms, and handling complex real-world scenarios with heterogeneous environments and non-Markovian safety constraints. These advancements are crucial for improving the performance and reliability of reinforcement learning agents in various applications, from robotics and healthcare to online advertising and autonomous systems.
Papers
July 8, 2024
June 22, 2024
February 5, 2024
April 6, 2023
March 22, 2023
February 10, 2023
February 4, 2023
October 5, 2022
September 25, 2022
September 15, 2022
September 14, 2022