Paper ID: 2402.09467

Optimal Thresholding Linear Bandit

Eduardo Ochoa Rivera, Ambuj Tewari

We study a novel pure exploration problem: the $\epsilon$-Thresholding Bandit Problem (TBP) with fixed confidence in stochastic linear bandits. We prove a lower bound for the sample complexity and extend an algorithm designed for Best Arm Identification in the linear case to TBP that is asymptotically optimal.

Submitted: Feb 11, 2024

Topics

Practical Algorithm
Sample Complexity
Linear Bandit
Bandit Problem
Pure Exploration
Thresholding Linear Bandit

Links

arXiv PDF