Paper ID: 2309.16960

On Generating Explanations for Reinforcement Learning Policies: An Empirical Study

Mikihisa Yuasa, Huy T. Tran, Ramavarapu S. Sreenivas

Understanding a \textit{reinforcement learning} policy, which guides state-to-action mappings to maximize rewards, necessitates an accompanying explanation for human comprehension. In this paper, we introduce a set of \textit{linear temporal logic} formulae designed to provide explanations for policies, and an algorithm for searching through those formulae for the one that best explains a given policy. Our focus is on explanations that elucidate both the ultimate objectives accomplished by the policy and the prerequisite conditions it upholds throughout its execution. The effectiveness of our proposed approach is illustrated through a simulated game of capture-the-flag and a car-parking environment,

Submitted: Sep 29, 2023