Reward Report
Reward report research centers on efficiently learning reward functions to guide reinforcement learning (RL) agents, particularly in complex domains like large language models (LLMs) and robotics. Current efforts focus on improving reward model accuracy and efficiency through techniques like active learning, parameter insertion within existing model architectures, and leveraging vision-language models (VLMs) to generate dense reward functions. This research is crucial for advancing RL's capabilities in safety-critical applications and for aligning AI systems more effectively with human preferences, ultimately leading to more robust and beneficial AI systems.
Papers
December 7, 2022
December 1, 2022
November 29, 2022
November 20, 2022
November 16, 2022
October 23, 2022
October 21, 2022
August 10, 2022
July 16, 2022
June 17, 2022
June 1, 2022
May 20, 2022
May 16, 2022
April 22, 2022
April 5, 2022
March 25, 2022
March 19, 2022
February 21, 2022
February 11, 2022