Reward Report
Reward report research centers on efficiently learning reward functions to guide reinforcement learning (RL) agents, particularly in complex domains like large language models (LLMs) and robotics. Current efforts focus on improving reward model accuracy and efficiency through techniques like active learning, parameter insertion within existing model architectures, and leveraging vision-language models (VLMs) to generate dense reward functions. This research is crucial for advancing RL's capabilities in safety-critical applications and for aligning AI systems more effectively with human preferences, ultimately leading to more robust and beneficial AI systems.
Papers
August 24, 2023
July 13, 2023
June 29, 2023
June 27, 2023
June 14, 2023
June 2, 2023
June 1, 2023
May 25, 2023
May 23, 2023
May 16, 2023
April 11, 2023
April 6, 2023
March 22, 2023
February 20, 2023
February 9, 2023
January 26, 2023
January 18, 2023
January 2, 2023
December 21, 2022