Double DQN

Double Deep Q-Networks (Double DQN) is a reinforcement learning algorithm aiming to improve the stability and performance of Q-learning by mitigating overestimation bias. Current research focuses on enhancing Double DQN's efficiency and applicability through techniques like incorporating ensemble methods, leveraging problem structure (e.g., weakly coupled systems), and integrating heuristic guidance or local planning to improve sample efficiency. These advancements are impacting various fields, including traffic signal control and combinatorial optimization, by enabling the development of more robust and effective AI agents for complex decision-making problems.

Papers