Risk Averse Reinforcement Learning

Risk-averse reinforcement learning (RL) focuses on developing algorithms that optimize decision-making in uncertain environments while explicitly managing risk, rather than solely maximizing expected reward. Current research emphasizes the use of various risk measures, such as Conditional Value at Risk (CVaR), Entropic Value at Risk (EVaR), and Gini deviation, within different RL frameworks including Q-learning, policy gradient methods, and distributional RL, often incorporating techniques like robust optimization and dynamic programming. This field is significant because it enables the development of safer and more reliable AI agents for applications where risk mitigation is crucial, such as autonomous driving, finance, and robotics.

Papers