Reinforcement Learning
Reinforcement learning (RL) focuses on training agents to make optimal decisions in an environment by learning through trial and error, aiming to maximize cumulative rewards. Current research emphasizes improving RL's efficiency and robustness, particularly in areas like human-in-the-loop training (e.g., using human feedback to refine models), handling uncertainty and sparse rewards, and scaling to complex tasks (e.g., robotics, autonomous driving). Prominent approaches involve various policy gradient methods, Monte Carlo Tree Search, and the integration of large language models for improved decision-making and task decomposition. These advancements are driving progress in diverse fields, including robotics, game playing, and the development of more human-aligned AI systems.
Papers
Transforming Location Retrieval at Airbnb: A Journey from Heuristics to Reinforcement Learning
Dillon Davis, Huiji Gao, Thomas Legrand, Weiwei Guo, Malay Haldar, Alex Deng, Han Zhao, Liwei He, Sanjeev Katariya
Reduce, Reuse, Recycle: Categories for Compositional Reinforcement Learning
Georgios Bakirtzis, Michail Savvas, Ruihan Zhao, Sandeep Chinchali, Ufuk Topcu
Mastering the Digital Art of War: Developing Intelligent Combat Simulation Agents for Wargaming Using Hierarchical Reinforcement Learning
Scotty Black
Localized Observation Abstraction Using Piecewise Linear Spatial Decay for Reinforcement Learning in Combat Simulations
Scotty Black, Christian Darken
Optimally Solving Simultaneous-Move Dec-POMDPs: The Sequential Central Planning Approach
Johan Peralez, Aurèlien Delage, Jacopo Castellini, Rafael F. Cunha, Jilles S. Dibangoye
In-Context Learning with Reinforcement Learning for Incomplete Utterance Rewriting
Haowei Du, Dongyan Zhao
PCGRL+: Scaling, Control and Generalization in Reinforcement Learning Level Generators
Sam Earle, Zehua Jiang, Julian Togelius
A Safety-Oriented Self-Learning Algorithm for Autonomous Driving: Evolution Starting from a Basic Model
Shuo Yang, Caojun Wang, Zhenyu Ma, Yanjun Huang, Hong Chen
A Safe and Efficient Self-evolving Algorithm for Decision-making and Control of Autonomous Driving Systems
Shuo Yang, Liwen Wang, Yanjun Huang, Hong Chen
S-EPOA: Overcoming the Indivisibility of Annotations with Skill-Driven Preference-Based Reinforcement Learning
Ni Mu, Yao Luan, Yiqin Yang, Qing-shan Jia
Balancing Act: Prioritization Strategies for LLM-Designed Restless Bandit Rewards
Shresth Verma, Niclas Boehmer, Lingkai Kong, Milind Tambe
Advances in Preference-based Reinforcement Learning: A Review
Youssef Abdelkareem, Shady Shehata, Fakhri Karray
Efficient Exploration and Discriminative World Model Learning with an Object-Centric Abstraction
Anthony GX-Chen, Kenneth Marino, Rob Fergus
FRAP: Faithful and Realistic Text-to-Image Generation with Adaptive Prompt Weighting
Liyao Jiang, Negar Hassanpour, Mohammad Salameh, Mohan Sai Singamsetti, Fengyu Sun, Wei Lu, Di Niu
Optimizing Interpretable Decision Tree Policies for Reinforcement Learning
Daniël Vos, Sicco Verwer
Subgoal-based Hierarchical Reinforcement Learning for Multi-Agent Collaboration
Cheng Xu, Changtian Zhang, Yuchen Shi, Ran Wang, Shihong Duan, Yadong Wan, Xiaotong Zhang
Accelerating Goal-Conditioned RL Algorithms and Research
Michał Bortkiewicz, Władek Pałucki, Vivek Myers, Tadeusz Dziarmaga, Tomasz Arczewski, Łukasz Kuciński, Benjamin Eysenbach
The Evolution of Reinforcement Learning in Quantitative Finance
Nikolaos Pippas, Cagatay Turkay, Elliot A. Ludvig
Knowledge Sharing and Transfer via Centralized Reward Agent for Multi-Task Reinforcement Learning
Haozhe Ma, Zhengding Luo, Thanh Vinh Vo, Kuankuan Sima, Tze-Yun Leong