Natural Policy Gradients In Reinforcement Learning Explained [2209.01820]