Multi Agent Multi

Multi-agent multi-armed bandits (MAB) research focuses on designing algorithms for multiple agents collaboratively or competitively learning optimal strategies in uncertain environments, often aiming to optimize overall system performance or individual agent fairness. Current research emphasizes developing efficient algorithms, such as those based on distributed auctions or follow-the-regularized-leader approaches, to minimize regret (the difference between optimal and achieved performance) while addressing challenges like asynchronous agent actions, communication delays, and fairness constraints. These advancements have significant implications for resource allocation in areas like wireless networks (e.g., O-RAN optimization) and offer improved theoretical understanding of collaborative learning in decentralized systems.

Papers

November 12, 2024

Multi-Agent Stochastic Bandits Robust to Adversarial Corruptions
Fatemeh Ghaffari, Xuchuang Wang, Jinhang Zuo, Mohammad Hajiesmaili
Single Agent Multi Agent Multi Armed Bandit Adversarial Corruption Cooperative Multi Agent Learning Multi Agent Multi

June 7, 2023

Fair Multi-Agent Bandits
Amir Leshem
Regret Analysis Upper Bound Multi Agent Multi Sample Classifier Matching

March 25, 2023

Intelligent Load Balancing and Resource Allocation in O-RAN: A Multi-Agent Multi-Armed Bandit Approach
Chia-Hsiang Lai, Li-Hsiang Shen, Kai-Ten Feng
Resource Allocation Open Radio Access Network Network Optimization Load Balancing Efficient Network RAN Architecture Multi Agent Multi

February 15, 2023

On-Demand Communication for Asynchronous Multi-Agent Bandits
Yu-Zhen Janice Chen, Lin Yang, Xuchuang Wang, Xutong Liu, Mohammad Hajiesmaili, John C. S. Lui, Don Towsley
Communication Efficient Communication Complexity Multi Agent Multi Armed Bandit Armed Bandit Cooperative Bandit Flexible Communication Multi Agent Multi

November 30, 2022

On Regret-optimal Cooperative Nonstochastic Multi-armed Bandits
Jialin Yi, Milan Vojnović
Simple Regret Follow the Regularized Leader Multi Agent Multi Armed Bandit Multi Agent Multi

September 23, 2022

An Efficient Algorithm for Fair Multi-Agent Multi-Armed Bandit with Low Regret
Matthew Jones, Huy Lê Nguyen, Thy Nguyen
Multi Agent Multi Armed Bandit Optimal Regret Efficient Algorithm Low Regret Multi Agent Multi