Q Function
The Q-function, central to reinforcement learning, estimates the expected cumulative reward for taking a specific action in a given state. Current research focuses on improving Q-function estimation accuracy and efficiency, particularly through variance reduction techniques, and exploring its application in diverse settings such as multi-agent systems, continuous action spaces, and large language model alignment. These advancements are driving progress in offline reinforcement learning, enabling more efficient and robust decision-making in complex environments and leading to improved performance in various applications, including robotics and healthcare.
Papers
September 18, 2023
July 30, 2023
July 25, 2023
June 28, 2023
June 5, 2023
May 29, 2023
May 2, 2023
March 15, 2023
February 15, 2023
February 7, 2023
February 1, 2023
January 15, 2023
January 5, 2023
November 22, 2022
October 14, 2022
September 7, 2022
July 22, 2022
June 15, 2022
June 1, 2022