Dual Reinforcement Learning

Dual reinforcement learning (RL) tackles complex problems by decomposing them into coupled sub-problems, each addressed by a separate RL agent or policy. Current research focuses on developing efficient algorithms, such as those based on policy mirror descent or Deep Q-Networks (DQNs), to optimize these dual policies, often leveraging techniques like Bregman divergences for improved convergence. This approach finds applications in diverse fields, from optimizing resource allocation in real-world systems like bike-sharing to enhancing the performance of recommender systems and enabling more robust robot manipulation through imitation learning. The resulting improvements in efficiency and performance demonstrate the significant potential of dual RL for solving challenging sequential decision-making problems.

Papers