A Policy Gradient Framework for Stochastic Optimal Control Problems with Global Convergence Guarantee [2302.05816]