Multi Agent Proximal Policy Optimization
Multi-agent proximal policy optimization (MAPPO) is a deep reinforcement learning approach designed to train multiple agents to collaborate effectively on complex tasks. Current research focuses on improving MAPPO's performance and scalability through techniques like attention mechanisms for better credit assignment, graph neural networks for representing agent interactions, and incorporating intent sharing or communication protocols to enhance coordination. These advancements are driving significant improvements in various applications, including traffic control, robotics, and resource management in wireless networks, by enabling more efficient and robust decentralized control systems.
Papers
September 8, 2024
August 29, 2024
August 13, 2024
August 8, 2024
March 19, 2024
February 5, 2024
January 18, 2024
January 3, 2024
December 10, 2023
November 6, 2023
September 20, 2023
August 8, 2023
July 21, 2023
May 26, 2023
May 23, 2023
May 17, 2023
March 15, 2023
January 6, 2023
November 11, 2022