Stability of Q-Learning Through Design and Optimism [2307.02632]