Value Network
Value networks are crucial components in reinforcement learning (RL), particularly for assigning credit in complex sequential tasks and improving the efficiency of policy learning. Current research focuses on addressing limitations of value networks, such as inaccurate reward prediction and susceptibility to local optima, through methods like Monte Carlo estimation (bypassing large networks entirely), Mixture-of-Experts architectures for improved scalability, and value function search to refine approximations. These advancements aim to enhance the performance and sample efficiency of RL agents across diverse applications, from game playing and robotics to chemical synthesis planning.
Papers
October 28, 2024
October 2, 2024
February 13, 2024
October 15, 2023
April 25, 2023
February 20, 2023
January 31, 2023
June 28, 2022
April 28, 2022