Linear Bandit

Linear bandits are a class of online learning problems where an agent sequentially selects actions (arms) from a set characterized by linear features, receiving stochastic rewards dependent on an unknown linear function of those features. Current research focuses on improving algorithm efficiency and robustness, exploring variations such as contextual bandits, incorporating human response times for preference learning, and addressing misspecified models or non-stationary environments. These advancements are significant for applications requiring efficient sequential decision-making under uncertainty, including personalized recommendations, clinical trials, and resource allocation, by enabling more accurate and adaptable algorithms.

Papers

May 26, 2022

Distributed Contextual Linear Bandits with Minimax Optimal Communication Cost
Sanae Amani, Tor Lattimore, András György, Lin F. Yang
Linear Bandit Regret Minimization Single Agent Linear Contextual Bandit Optimal Communication

May 22, 2022

On Elimination Strategies for Bandit Fixed-Confidence Identification
Andrea Tirinzoni, Rémy Degenne
Sample Complexity Linear Bandit Bandit Identification Adaptive Strategy Parameter Identifiability

May 12, 2022

May 3, 2022

Norm-Agnostic Linear Bandits
Spencer, Gales, Sunder Sethuraman, Kwang-Sung Jun
Regret Bound Linear Bandit Novel Algorithm Reward Structure

April 3, 2022

Byzantine-Robust Federated Linear Bandits
Ali Jadbabaie, Haochuan Li, Jian Qian, Yi Tian
Linear Bandit O$ Regret Robust Aggregation Federated Bandit

March 29, 2022

March 8, 2022

Leveraging Initial Hints for Free in Stochastic Linear Bandits
Ashok Cutkosky, Chris Dann, Abhimanyu Das, Qiuyi, Zhang
Linear Bandit Bandit Feedback O$ Regret Natural Language Hint

March 2, 2022

February 28, 2022

Optimal Online Generalized Linear Regression with Stochastic Noise and Its Application to Heteroscedastic Bandits
Heyang Zhao, Dongruo Zhou, Jiafan He, Quanquan Gu
Application Proficiency Linear Bandit Follow the Regularized Leader Online Regression Variance Dependent Regret Stochastic Noise LabEl Noise

February 23, 2022

February 21, 2022

Multi-task Representation Learning with Stochastic Linear Bandits
Leonardo Cella, Karim Lounici, Grégoire Pacreau, Massimiliano Pontil
Transfer Learning Linear Bandit Regret Guarantee Stochastic Bandit Multi Task Representation

February 14, 2022

The Impact of Batch Learning in Stochastic Linear Bandits
Danil Provodin, Pratik Gajane, Mykola Pechenizkiy, Maurits Kaptein
Global Impact Linear Bandit Bandit Problem Batch Learning

February 12, 2022

Corralling a Larger Band of Bandits: A Case Study on Switching Regret for Linear Bandits
Haipeng Luo, Mengxiao Zhang, Peng Zhao, Zhi-Hua Zhou
Linear Bandit Simple Regret Band Selection Adversarial Bandit Herding Algorithm Adversarial Linear Contextual Bandit

February 7, 2022

Bayesian Non-stationary Linear Bandits for Large-Scale Recommender Systems
Saeed Ghoorchian, Evgenii Kortukov, Setareh Maghsudi
Recommender System Linear Bandit Non Stationary Linear Contextual Bandit Large Scale Recommender System Policy Sampling

February 4, 2022

An Experimental Design Approach for Regret Minimization in Logistic Bandits
Blake Mason, Kwang-Sung Jun, Lalit Jain
Linear Bandit Regret Minimization Experimental Design Cumulative Regret Logistic Bandit

January 13, 2022

Non-Stationary Representation Learning in Sequential Linear Bandits
Yuzhen Qin, Tommaso Menara, Samet Oymak, ShiNung Ching, Fabio Pasqualetti
Representation Learning Meaningful Representation Linear Bandit Non Stationary