Adversarial Reinforcement Learning

Adversarial reinforcement learning (ARL) focuses on training agents that can perform optimally even when facing adversaries attempting to disrupt their actions or observations. Current research emphasizes developing robust algorithms, often employing minimax optimization within frameworks like Stackelberg games, and exploring various model architectures including deep neural networks and decision transformers to handle complex environments and high-dimensional data. This field is crucial for building reliable AI systems in safety-critical applications like autonomous driving and robotics, where robustness against unexpected events or malicious attacks is paramount. The development of provably robust and efficient ARL algorithms is a key focus, addressing challenges like convergence guarantees and sample efficiency.

Papers