Multi Agent Reinforcement Learning
Multi-agent reinforcement learning (MARL) focuses on developing algorithms that enable multiple independent agents to learn optimal strategies within a shared environment, often to achieve a common goal. Current research emphasizes improving sample efficiency and generalization, exploring novel architectures like equivariant graph neural networks and specialized network structures (e.g., Bottom-Up Networks), and addressing challenges posed by non-stationarity and partial observability through techniques such as auxiliary prioritization and global state inference with diffusion models. MARL's significance lies in its potential to solve complex real-world problems across diverse domains, including robotics, traffic control, and healthcare, by enabling effective coordination and collaboration among multiple agents.
Papers
Think Smart, Act SMARL! Analyzing Probabilistic Logic Driven Safety in Multi-Agent Reinforcement Learning
Satchit Chatterji, Erman Acar
Semantic-Aware Resource Management for C-V2X Platooning via Multi-Agent Reinforcement Learning
Zhiyu Shao, Qiong Wu, Pingyi Fan, Kezhi Wang, Qiang Fan, Wen Chen, Khaled B. Letaief
A Multi-Agent Reinforcement Learning Testbed for Cognitive Radio Applications
Sriniketh Vangaru, Daniel Rosen, Dylan Green, Raphael Rodriguez, Maxwell Wiecek, Amos Johnson, Alyse M. Jones, William C. Headley
FairStream: Fair Multimedia Streaming Benchmark for Reinforcement Learning Agents
Jannis Weil, Jonas Ringsdorf, Julian Barthel, Yi-Ping Phoebe Chen, Tobias Meuser
Offline-to-Online Multi-Agent Reinforcement Learning with Offline Value Function Memory and Sequential Exploration
Hai Zhong, Xun Wang, Zhuoran Li, Longbo Huang
Multi-Agent Reinforcement Learning with Selective State-Space Models
Jemma Daniel, Ruan de Kock, Louay Ben Nessir, Sasha Abramowitz, Omayma Mahjoub, Wiem Khlifi, Claude Formanek, Arnu Pretorius
Toward Finding Strong Pareto Optimal Policies in Multi-Agent Reinforcement Learning
Bang Giang Le, Viet Cuong Ta
Hierarchical Multi-agent Reinforcement Learning for Cyber Network Defense
Aditya Vikram Singh, Ethan Rathbun, Emma Graham, Lisa Oakley, Simona Boboila, Alina Oprea, Peter Chin
Delay-Constrained Grant-Free Random Access in MIMO Systems: Distributed Pilot Allocation and Power Control
Jianan Bai, Zheng Chen, Erik. G. Larsson