Combinatorial Semi Bandit

Combinatorial semi-bandits address the challenge of sequentially selecting subsets of items (arms) to maximize cumulative reward, receiving feedback only on the selected subset. Current research focuses on improving algorithm efficiency for large-scale problems, handling delayed or non-stationary rewards, incorporating fairness constraints, and addressing risk aversion through methods like Thompson Sampling, Upper Confidence Bound (UCB) variations, and Follow The Regularized Leader (FTRL). These advancements are significant for applications such as online advertising, crowdsourcing, and resource allocation in transportation networks, where efficient and robust decision-making under uncertainty is crucial. The field is also actively exploring the impact of causal relationships between rewards and the use of approximation oracles to handle computationally complex problems.

Papers

October 7, 2024

Thompson Sampling For Combinatorial Bandits: Polynomial Regret and Mismatched Sampling Paradox
Raymond Zhang, Richard Combes
Thompson Sampling Combinatorial Bandit Combinatorial Semi Bandit Finite Time Regret Gaussian Reward

July 22, 2024

Merit-based Fair Combinatorial Semi-Bandit with Unrestricted Feedback Delays
Ziqun Chen, Kechao Cai, Zhuoyue Chen, Jinbei Zhang, John C. S. Lui
Bandit Algorithm Fairness Constraint Delayed Feedback Combinatorial Semi Bandit Reward Delay

May 28, 2024

Matroid Semi-Bandits in Sublinear Time
Ruo-Chun Tzeng, Naoto Ohsaka, Kaito Ariu
Sublinear Time Combinatorial Semi Bandit Semi Bandit Matroid Rank Valuation

February 23, 2024

Towards Efficient and Optimal Covariance-Adaptive Algorithms for Combinatorial Semi-Bandits
Julien Zhou (Thoth, STATIFY), Pierre Gaillard (Thoth), Thibaud Rahier, Houssam Zenati (SODA, MIND), Julyan Arbel (STATIFY)
Bandit Feedback Proxy Dataset Combinatorial Semi Bandit Adaptive Variance

July 26, 2023

Piecewise-Stationary Combinatorial Semi-Bandit with Causally Related Rewards
Behzad Nourani-Koliji, Steven Bilaj, Amir Rezaei Balef, Setareh Maghsudi
Confidence Bound Optimal Decision Combinatorial Semi Bandit

July 18, 2023

Non-stationary Delayed Combinatorial Semi-Bandit with Causally Related Rewards
Saeed Ghoorchian, Setareh Maghsudi
Sequential Decision Causal Relation Delayed Feedback Combinatorial Semi Bandit

May 15, 2023

A Unified Analysis of Nonstochastic Delayed Feedback for Combinatorial Semi-Bandits, Linear Bandits, and MDPs
Dirk van der Hoeven, Lukas Zierahn, Tal Lancewicki, Aviv Rosenberg, Nicoló Cesa-Bianchi
Unified Framework Linear Bandit Bandit Feedback Delayed Feedback Combinatorial Semi Bandit

January 31, 2023

Probably Anytime-Safe Stochastic Combinatorial Semi-Bandits
Yunlong Hou, Vincent Y. F. Tan, Zixin Zhong
Information Theoretic Combinatorial Semi Bandit Semi Bandit

January 17, 2023

A Combinatorial Semi-Bandit Approach to Charging Station Selection for Electric Vehicles
Niklas Åkerblom, Morteza Haghir Chehreghani
Electric Vehicle Road Network Long Horizon Navigation Optimal Service Station Design Combinatorial Semi Bandit

December 25, 2022

Linear Combinatorial Semi-Bandit with Causally Related Rewards
Behzad Nourani-Koliji, Saeed Ghoorchian, Setareh Maghsudi
Sublinear Regret Causal Relation Sequential Decision Making Problem Combinatorial Semi Bandit Semi Bandit

August 31, 2022

Batch-Size Independent Regret Bounds for Combinatorial Semi-Bandits with Probabilistically Triggered Arms or Independent Arms
Xutong Liu, Jinhang Zuo, Siwei Wang, Carlee Joe-Wong, John C.S. Lui, Wei Chen
Probabilistic Model Regret Bound Regret Analysis Influence Maximization Best Arm Combinatorial Semi Bandit Arm Selection

June 16, 2022

A Contextual Combinatorial Semi-Bandit Approach to Network Bottleneck Identification
Fazeleh Hoseini, Niklas Åkerblom, Morteza Haghir Chehreghani
Contextual Bandit Network Analysis Combinatorial Semi Bandit Bottleneck Analysis Contextual Combinatorial

June 1, 2022

Incentivizing Combinatorial Bandit Exploration
Xinyan Hu, Dung Daniel Ngo, Aleksandrs Slivkins, Zhiwei Steven Wu
Bandit Algorithm Exploration Problem Combinatorial Bandit Combinatorial Semi Bandit Incentive Compatible

December 2, 2021

Risk-Aware Algorithms for Combinatorial Semi-Bandits
Shaarad Ayyagari, Ambedkar Dukkipati
Regret Bound Combinatorial Bandit Combinatorial Multi Armed Bandit Combinatorial Semi Bandit Risk Sensitive Algorithm

November 8, 2021

The Hardness Analysis of Thompson Sampling for Combinatorial Semi-bandits with Greedy Oracle
Fang Kong, Yueran Yang, Wei Chen, Shuai Li
Thompson Sampling Hardness Result Test Oracle Combinatorial Multi Armed Bandit Linear Minimization Oracle Combinatorial Semi Bandit Linear Optimization Oracle