Efficient Policy Evaluation with Safety Constraint for Reinforcement Learning [2410.05655]