Offline Safe Reinforcement Learning

Offline safe reinforcement learning (RL) focuses on training robots and AI agents to make optimal decisions while adhering to safety constraints, using only pre-collected data. Current research emphasizes developing algorithms that address the challenges of limited and potentially imperfect data, focusing on techniques like pessimistic value iteration, conditional distribution shaping, and risk-sensitive approaches within model architectures such as Decision Transformers and diffusion models. This field is crucial for deploying RL agents in real-world settings where online trial-and-error is too risky, with applications ranging from robotics and autonomous driving to healthcare.

Papers