Dense Reward
Dense reward in reinforcement learning (RL) focuses on designing reward functions that provide frequent, informative feedback to the agent, contrasting with sparse rewards that only signal success or failure. Current research explores methods for automatically generating dense rewards from sparse data, leveraging techniques like large language models and intrinsic motivation, and adapting existing RL algorithms (e.g., PPO, TD3, REDQ) to effectively utilize this richer feedback. This shift towards dense rewards promises to significantly improve sample efficiency and the ability to train RL agents for complex tasks in robotics and other domains, reducing the need for extensive manual reward engineering.
Papers
October 23, 2024
September 13, 2024
July 31, 2024
May 26, 2024
April 25, 2024
March 11, 2024
February 13, 2024
February 1, 2024
December 14, 2023
December 10, 2023
October 31, 2023
October 4, 2023