Reinforcement Learning
Reinforcement learning (RL) focuses on training agents to make optimal decisions in an environment by learning through trial and error, aiming to maximize cumulative rewards. Current research emphasizes improving RL's efficiency and robustness, particularly in areas like human-in-the-loop training (e.g., using human feedback to refine models), handling uncertainty and sparse rewards, and scaling to complex tasks (e.g., robotics, autonomous driving). Prominent approaches involve various policy gradient methods, Monte Carlo Tree Search, and the integration of large language models for improved decision-making and task decomposition. These advancements are driving progress in diverse fields, including robotics, game playing, and the development of more human-aligned AI systems.
Papers
Interactive Dialogue Agents via Reinforcement Learning on Hindsight Regenerations
Joey Hong, Jessica Lin, Anca Dragan, Sergey Levine
Inverse Transition Learning: Learning Dynamics from Demonstrations
Leo Benac, Abhishek Sharma, Sonali Parbhoo, Finale Doshi-Velez
Noisy Zero-Shot Coordination: Breaking The Common Knowledge Assumption In Zero-Shot Coordination Games
Usman Anwar, Ashish Pandian, Jia Wan, David Krueger, Jakob Foerster
Think Smart, Act SMARL! Analyzing Probabilistic Logic Driven Safety in Multi-Agent Reinforcement Learning
Satchit Chatterji, Erman Acar
Navigating Trade-offs: Policy Summarization for Multi-Objective Reinforcement Learning
Zuzanna Osika, Jazmin Zatarain-Salazar, Frans A. Oliehoek, Pradeep K. Murukannaiah
Semantic-Aware Resource Management for C-V2X Platooning via Multi-Agent Reinforcement Learning
Zhiyu Shao, Qiong Wu, Pingyi Fan, Kezhi Wang, Qiang Fan, Wen Chen, Khaled B. Letaief
Sharp Analysis for KL-Regularized Contextual Bandits and RLHF
Heyang Zhao, Chenlu Ye, Quanquan Gu, Tong Zhang
Constrained Latent Action Policies for Model-Based Offline Reinforcement Learning
Marvin Alles, Philip Becker-Ehmck, Patrick van der Smagt, Maximilian Karl
Deep Heuristic Learning for Real-Time Urban Pathfinding
Mohamed Hussein Abo El-Ela, Ali Hamdi Fergany
Learning Generalizable Policy for Obstacle-Aware Autonomous Drone Racing
Yueqian Liu
Approximate Equivariance in Reinforcement Learning
Jung Yeon Park, Sujay Bhatt, Sihan Zeng, Lawson L.S. Wong, Alec Koppel, Sumitra Ganesh, Robin Walters
A Comparative Study of Deep Reinforcement Learning for Crop Production Management
Joseph Balderas, Dong Chen, Yanbo Huang, Li Wang, Ren-Cang Li
Interpretable and Efficient Data-driven Discovery and Control of Distributed Systems
Florian Wolf, Nicolò Botteghi, Urban Fasel, Andrea Manzoni
Non-Stationary Learning of Neural Networks with Automatic Soft Parameter Reset
Alexandre Galashov, Michalis K. Titsias, András György, Clare Lyle, Razvan Pascanu, Yee Whye Teh, Maneesh Sahani
AdaSociety: An Adaptive Environment with Social Structures for Multi-Agent Decision-Making
Yizhe Huang, Xingbo Wang, Hao Liu, Fanqi Kong, Aoyang Qin, Min Tang, Xiaoxi Wang, Song-Chun Zhu, Mingjie Bi, Siyuan Qi, Xue Feng
Beyond The Rainbow: High Performance Deep Reinforcement Learning On A Desktop PC
Tyler Clark, Mark Towers, Christine Evers, Jonathon Hare
From Novice to Expert: LLM Agent Policy Optimization via Step-wise Reinforcement Learning
Zhirui Deng, Zhicheng Dou, Yutao Zhu, Ji-Rong Wen, Ruibin Xiong, Mang Wang, Weipeng Chen
Opportunities of Reinforcement Learning in South Africa's Just Transition
Claude Formanek, Callum Rhys Tilbury, Jonathan P. Shock