Discovering Exfiltration Paths Using Reinforcement Learning with Attack Graphs [2201.12416]