Policy Constraint
Policy constraint in reinforcement learning focuses on ensuring learned agent policies adhere to predefined safety or behavioral limitations, preventing undesirable actions while optimizing for a primary objective. Current research emphasizes developing dynamic and adaptive constraint methods, often integrated into offline reinforcement learning algorithms like TD3-BC and CQL, or employing novel architectures such as decision transformers and conditional sequence models (e.g., SaFormer). These advancements aim to address limitations of static constraints, improve sample efficiency, and enable robust policy learning from diverse or imperfect datasets, with applications ranging from robotics to autonomous systems.
Papers
October 10, 2024
October 8, 2024
August 4, 2024
May 23, 2024
April 25, 2024
March 12, 2024
November 18, 2023
October 27, 2023
September 12, 2023
September 4, 2023
June 26, 2023
June 21, 2023
June 8, 2023
March 26, 2023
February 14, 2023
January 28, 2023
December 7, 2022
November 21, 2022
October 19, 2022