Linear Bandit

Linear bandits are a class of online learning problems where an agent sequentially selects actions (arms) from a set characterized by linear features, receiving stochastic rewards dependent on an unknown linear function of those features. Current research focuses on improving algorithm efficiency and robustness, exploring variations such as contextual bandits, incorporating human response times for preference learning, and addressing misspecified models or non-stationary environments. These advancements are significant for applications requiring efficient sequential decision-making under uncertainty, including personalized recommendations, clinical trials, and resource allocation, by enabling more accurate and adaptable algorithms.

Papers

June 3, 2024

Sparsity-Agnostic Linear Bandits with Adaptive Adversaries
Tianyuan Jin, Kyoungseok Jang, Nicolò Cesa-Bianchi
Regret Bound Linear Bandit Adversarial Behavior Stochastic Reward Adaptive Adversary Linear Reward Sparse Linear Bandit

May 24, 2024

Indexed Minimum Empirical Divergence-Based Algorithms for Linear Bandits
Jie Bian, Vincent Y. F. Tan
Multi Armed Bandit Contextual Bandit Linear Bandit Confidence Bound Bregman Divergence

May 23, 2024

Pragmatic Feature Preferences: Learning Reward-Relevant Preferences from Human Input
Andi Peng, Yuying Sun, Tianmin Shu, David Abel
Reward Model Linear Bandit Reward Learning Natural Language Input Social Learning Preference Based Reward Feature Preference

May 22, 2024

Symmetric Linear Bandits with Hidden Symmetry
Nam Phuong Tran, The Anh Ta, Debmalya Mandal, Long Tran-Thanh
High Dimensional Linear Bandit Approximate Symmetry Low Dimensional Structure Low Dimensional Subspace Dimensional Subspace

May 17, 2024

Restless Linear Bandits
Azadeh Khaleghi
Linear Bandit Sub Linear Regret Restless Bandit

May 12, 2024

Stochastic Bandits with ReLU Neural Networks
Kan Xu, Hamsa Bastani, Surbhi Goel, Osbert Bastani
Linear Bandit Stochastic Bandit ReLU Neural Network Regret Rate ReLU Function

May 9, 2024

Imprecise Multi-Armed Bandits
Vanessa Kosoy
Multi Armed Bandit Linear Bandit Two Player Zero Sum Credal Set Imprecise Query

May 5, 2024

FedConPE: Efficient Federated Conversational Bandits with Heterogeneous Clients
Zhuohua Li, Maoli Liu, John C. S. Lui
Linear Bandit Heterogeneous Client Contextual Linear Bandit Federated Bandit Conversational Bandit

March 15, 2024

Variance-Dependent Regret Bounds for Non-stationary Linear Bandits
Zhiyong Wang, Jize Xie, Yi Chen, John C. S. Lui, Dongruo Zhou
Linear Bandit Non Stationary Non Stationarity Variance Dependent Regret

March 10, 2024

LinearAPT: An Adaptive Algorithm for the Fixed-Budget Thresholding Linear Bandit Problem
Yun-Ang Wu, Yun-Da Tsai, Shou-De Lin
Multi Armed Bandit Linear Bandit Sequential Decision Adaptive Algorithm Thresholding Linear Bandit

March 1, 2024

Adaptive Learning Rate for Follow-the-Regularized-Leader: Competitive Analysis and Best-of-Both-Worlds
Shinji Ito, Taira Tsuchiya, Junya Honda
Multi Armed Bandit Contextual Bandit Learning Rate Linear Bandit Follow the Regularized Leader Best of Both World Algorithm Competitive Ratio

February 24, 2024

Low-Rank Bandits via Tight Two-to-Infinity Singular Subspace Recovery
Yassir Jedra, William Réveillard, Stefan Stojanovic, Alexandre Proutiere
Contextual Bandit Linear Bandit Regret Minimization

February 20, 2024

Uniform Last-Iterate Guarantee for Bandits and Reinforcement Learning
Junyan Liu, Yunfan Li, Ruosong Wang, Lin F. Yang
Linear Bandit Last Iterate Convergence Order Optimal Regret

February 19, 2024

Linear bandits with polylogarithmic minimax regret
Josep Lumbreras, Marco Tomamichel
Linear Bandit Least Square Bandit Algorithm Cumulative Regret Minimax Regret

February 12, 2024

February 11, 2024

Optimal Thresholding Linear Bandit
Eduardo Ochoa Rivera, Ambuj Tewari
Practical Algorithm Sample Complexity Linear Bandit Bandit Problem Pure Exploration Thresholding Linear Bandit

January 21, 2024

Distributed Multi-Task Learning for Stochastic Bandits with Context Distribution and Stage-wise Constraints
Jiabin Lin, Shana Moothedath
Multi Task Learning Linear Bandit Heterogeneous Agent Linear Contextual Bandit Stochastic Bandit State Constraint Context Distribution

January 17, 2024

Fixed-Budget Differentially Private Best Arm Identification
Zhirui Chen, P. N. Karthik, Yeow Meng Chee, Vincent Y. F. Tan
Differential Privacy Linear Bandit Best Arm Identification DP Algorithm Differential Privacy Constraint

January 14, 2024

Efficient Frameworks for Generalized Low-Rank Matrix Bandit Problems
Yue Kang, Cho-Jui Hsieh, Thomas C. M. Lee
Low Rank Linear Bandit Efficient Framework Subspace Identification