Constrained Bandit

Constrained bandit problems address the challenge of optimizing rewards while simultaneously satisfying constraints, a crucial aspect in many real-world applications like safe robotics and personalized recommendations. Current research focuses on developing efficient algorithms for various model architectures, including linear bandits, tensor bandits, and kernelized bandits, often leveraging convex optimization and techniques like upper confidence bounds or Thompson sampling to balance exploration and exploitation under constraints. This field is significant because it provides theoretically grounded and computationally practical methods for decision-making under safety or resource limitations, impacting areas requiring safe and efficient sequential learning.

Papers

June 21, 2024

Testing the Feasibility of Linear Programs with Bandit Feedback
Aditya Gangrade, Aditya Gopalan, Venkatesh Saligrama, Clayton Scott
Linear Bandit Bandit Feedback Task Feasibility Linear Programming Constrained Bandit

November 7, 2023

Convex Methods for Constrained Linear Bandits
Amirhossein Afsharrad, Ahmadreza Moradipari, Sanjay Lall
Convex Optimization Constrained Bandit Safe Linear Bandit

May 6, 2023

On High-dimensional and Low-rank Tensor Bandits
Chengshuai Shi, Cong Shen, Nicholas D. Sidiropoulos
Linear Bandit High Dimension Bandit Algorithm Tensor Model Constrained Bandit Tensor Time Series

June 22, 2022

Active Learning with Safety Constraints
Romain Camilleri, Andrew Wagenmaker, Jamie Morgenstern, Lalit Jain, Kevin Jamieson
Active Learning Linear Bandit Safety Constraint Adaptive Experimental Design Constrained Bandit

March 29, 2022

On Kernelized Multi-Armed Bandits with Constraints
Xingyu Zhou, Bo Ji
Multi Armed Bandit Linear Bandit Participation Constraint Constrained Bandit

Constrained Bandit

Papers

Testing the Feasibility of Linear Programs with Bandit Feedback

Convex Methods for Constrained Linear Bandits

On High-dimensional and Low-rank Tensor Bandits

Active Learning with Safety Constraints

On Kernelized Multi-Armed Bandits with Constraints