Reward Report
Reward report research centers on efficiently learning reward functions to guide reinforcement learning (RL) agents, particularly in complex domains like large language models (LLMs) and robotics. Current efforts focus on improving reward model accuracy and efficiency through techniques like active learning, parameter insertion within existing model architectures, and leveraging vision-language models (VLMs) to generate dense reward functions. This research is crucial for advancing RL's capabilities in safety-critical applications and for aligning AI systems more effectively with human preferences, ultimately leading to more robust and beneficial AI systems.
Papers
November 17, 2024
November 15, 2024
November 2, 2024
October 30, 2024
October 24, 2024
October 23, 2024
October 8, 2024
October 3, 2024
August 20, 2024
July 18, 2024
July 3, 2024
June 21, 2024
June 12, 2024
June 4, 2024
June 3, 2024
May 28, 2024
May 16, 2024
May 15, 2024
May 6, 2024