Double Deep Q Learning
Double Deep Q-learning (DDQL) is a reinforcement learning technique aiming to improve the stability and efficiency of deep Q-networks by mitigating overestimation bias inherent in standard Q-learning. Current research focuses on enhancing DDQL's performance in various applications, including continuous control tasks, multi-agent systems, and federated learning, often incorporating modifications like prioritized experience replay, actor-critic methods, and novel loss functions to accelerate learning and improve robustness. This approach shows promise across diverse fields, from optimizing energy production in wind turbines and improving water quality monitoring to accelerating drug discovery and enhancing resource allocation in next-generation communication networks.
Papers
Double Deep Q-Learning-based Path Selection and Service Placement for Latency-Sensitive Beyond 5G Applications
Masoud Shokrnezhad, Tarik Taleb, Patrizio Dazzi
Self-Sustaining Multiple Access with Continual Deep Reinforcement Learning for Dynamic Metaverse Applications
Hamidreza Mazandarani, Masoud Shokrnezhad, Tarik Taleb, Richard Li