Dynamic Restless
Dynamic restless multi-armed bandits (RMABs) model sequential decision-making problems where choosing an action affects not only immediate rewards but also the future states of multiple options, each evolving independently. Current research focuses on extending RMABs to handle more complex scenarios, including global, non-separable rewards, adversarial environments, contextual information, and fairness constraints, often employing index policies, reinforcement learning algorithms, and decision-focused learning approaches. These advancements are improving resource allocation in diverse fields like public health, smart grids, and mobile interventions, by enabling more efficient and equitable strategies.
Papers
December 11, 2024
June 2, 2024
May 2, 2024
March 22, 2024
February 22, 2024
December 16, 2023
October 3, 2023
August 17, 2023
July 27, 2022
June 8, 2022
February 28, 2022
February 2, 2022