Feedback Graph
Feedback graphs model online learning scenarios where observing the outcome of one action reveals information about related actions, as defined by the graph's structure. Current research focuses on developing efficient algorithms, often based on variations of UCB or online mirror descent, that achieve optimal or near-optimal regret bounds under various graph structures (e.g., strongly/weakly observable, stochastic/adversarial environments) and contextual information. This framework improves upon traditional bandit problems by leveraging the inherent dependencies between actions, leading to more sample-efficient learning and impacting diverse applications like inventory control, recommendation systems, and clinical trials.
Papers
June 30, 2024
June 26, 2024
May 18, 2024
February 12, 2024
August 29, 2023
June 5, 2023
May 24, 2023
May 6, 2023
February 17, 2023
October 9, 2022
October 4, 2022
June 20, 2022
June 16, 2022
June 2, 2022
June 1, 2022