Hierarchical Reward
Hierarchical reward structures in reinforcement learning aim to improve agent training by decomposing complex tasks into simpler sub-goals, providing more effective guidance and faster learning. Current research focuses on developing methods to automatically discover or design these hierarchies, often employing techniques like contrastive learning and incorporating domain knowledge (e.g., physics models) to enhance efficiency and generalization. This approach is proving particularly valuable in challenging domains such as robotics and language model alignment, where sparse or inconsistent feedback makes traditional reinforcement learning methods less effective, leading to improved performance and sample efficiency.
Papers
March 11, 2024
March 4, 2024
July 7, 2023
April 30, 2023
January 21, 2023
May 26, 2022