Paper ID: 2306.10216

Vanishing Bias Heuristic-guided Reinforcement Learning Algorithm

Qinru Li, Hao Xiang

Reinforcement Learning has achieved tremendous success in the many Atari games. In this paper we explored with the lunar lander environment and implemented classical methods including Q-Learning, SARSA, MC as well as tiling coding. We also implemented Neural Network based methods including DQN, Double DQN, Clipped DQN. On top of these, we proposed a new algorithm called Heuristic RL which utilizes heuristic to guide the early stage training while alleviating the introduced human bias. Our experiments showed promising results for our proposed methods in the lunar lander environment.

Submitted: Jun 17, 2023

Topics

Reinforcement Learning
Policy Gradient
Q Learning
DQN Agent
Double DQN

Links

arXiv PDF