Kernel Bandit
Kernel bandits address the challenge of sequentially optimizing an unknown function residing in a Reproducing Kernel Hilbert Space (RKHS), aiming to minimize cumulative regret over time. Current research focuses on improving confidence bounds for algorithms like KernelUCB (GP-UCB), developing distributed and communication-efficient versions for multi-agent settings, and exploring the use of neural networks and quantum computing to enhance performance. This field is significant because it provides powerful tools for optimizing complex, non-linear reward functions in various applications, from Bayesian optimization to personalized recommendations and distributed control systems.
Papers
October 22, 2024
October 21, 2024
July 8, 2024
March 19, 2024
February 20, 2024
December 7, 2023
October 23, 2023
October 9, 2023
April 26, 2023
February 1, 2023
January 28, 2023
November 10, 2022
September 7, 2022
July 16, 2022
June 10, 2022
May 31, 2022
March 12, 2022
November 5, 2021