Advantage Actor Critic
Advantage Actor-Critic (A2C) is a reinforcement learning algorithm aiming to improve the efficiency and stability of policy optimization by combining an actor network (for selecting actions) and a critic network (for evaluating actions). Current research focuses on enhancing A2C's performance through techniques like experience replay, variance reduction, and hybrid quantum-classical implementations, as well as extending its application to multi-agent systems and interpretability. These advancements are significant for improving the sample efficiency and applicability of reinforcement learning in diverse fields, including robotics, game playing, and recommender systems.
Papers
May 24, 2024
February 15, 2024
January 13, 2024
September 9, 2023
May 9, 2023
November 29, 2022
November 19, 2022
June 14, 2022
May 18, 2022
April 19, 2022
January 15, 2022