Adversarial Linear Contextual Bandit
Adversarial linear contextual bandits address the challenge of online decision-making under uncertainty where rewards are determined by an adversary and depend on contextual information. Current research focuses on developing efficient algorithms, such as variations of EXP3 and LinUCB, that minimize regret in this challenging setting, often incorporating techniques to handle delayed feedback, switching costs, and high-dimensional contexts. These advancements are significant for improving the performance of applications like collaborative edge inference and federated learning, where adaptive adversaries and limited feedback are common challenges.
Papers
October 2, 2024
May 16, 2024
December 14, 2023
November 29, 2023
October 29, 2023
October 2, 2023
September 2, 2023
May 1, 2023
January 25, 2023
July 18, 2022
June 29, 2022