Action Sampling
Action sampling, a technique used in reinforcement learning, focuses on optimizing how agents select actions, aiming to improve exploration-exploitation balance and learning efficiency. Current research explores adaptive action sampling methods within various reinforcement learning frameworks, including Proximal Policy Optimization (PPO) and off-policy actor-critic algorithms, often incorporating colored noise or dataset constraints to refine action selection. These advancements are improving the performance of reinforcement learning agents across diverse applications, from job shop scheduling to robotic manipulation and dialogue generation, by enhancing sample efficiency and mitigating issues like value overestimation.
Papers
October 22, 2024
June 11, 2024
December 18, 2023
June 11, 2023
June 5, 2023
March 22, 2023
October 24, 2022
March 1, 2022