Reward Report
Reward report research centers on efficiently learning reward functions to guide reinforcement learning (RL) agents, particularly in complex domains like large language models (LLMs) and robotics. Current efforts focus on improving reward model accuracy and efficiency through techniques like active learning, parameter insertion within existing model architectures, and leveraging vision-language models (VLMs) to generate dense reward functions. This research is crucial for advancing RL's capabilities in safety-critical applications and for aligning AI systems more effectively with human preferences, ultimately leading to more robust and beneficial AI systems.
Papers
April 18, 2024
March 18, 2024
February 7, 2024
February 1, 2024
January 16, 2024
January 15, 2024
January 11, 2024
December 27, 2023
December 21, 2023
December 18, 2023
December 14, 2023
December 11, 2023
November 28, 2023
November 10, 2023
October 29, 2023
October 23, 2023
October 3, 2023
September 28, 2023
September 23, 2023