Reinforcement Learning
Reinforcement learning (RL) focuses on training agents to make optimal decisions in an environment by learning through trial and error, aiming to maximize cumulative rewards. Current research emphasizes improving RL's efficiency and robustness, particularly in areas like human-in-the-loop training (e.g., using human feedback to refine models), handling uncertainty and sparse rewards, and scaling to complex tasks (e.g., robotics, autonomous driving). Prominent approaches involve various policy gradient methods, Monte Carlo Tree Search, and the integration of large language models for improved decision-making and task decomposition. These advancements are driving progress in diverse fields, including robotics, game playing, and the development of more human-aligned AI systems.
Papers
DR-MPC: Deep Residual Model Predictive Control for Real-world Social Navigation
James R. Han, Hugues Thomas, Jian Zhang, Nicholas Rhinehart, Timothy D. Barfoot
Burning RED: Unlocking Subtask-Driven Reinforcement Learning and Risk-Awareness in Average-Reward Markov Decision Processes
Juan Sebastian Rojas, Chi-Guhn Lee
Compositional Shielding and Reinforcement Learning for Multi-Agent Systems
Asger Horn Brorholt, Kim Guldstrand Larsen, Christian Schilling
Reinforcement Learning For Quadrupedal Locomotion: Current Advancements And Future Perspectives
Maurya Gurram, Prakash Kumar Uttam, Shantipal S. Ohol
QE-EBM: Using Quality Estimators as Energy Loss for Machine Translation
Gahyun Yoo, Jay Yoon Lee
Large Language Model-Enhanced Reinforcement Learning for Generic Bus Holding Control Strategies
Jiajie Yu, Yuhong Wang, Wei Ma
Stable Hadamard Memory: Revitalizing Memory-Augmented Agents for Reinforcement Learning
Hung Le, Kien Do, Dung Nguyen, Sunil Gupta, Svetha Venkatesh
Content Caching-Assisted Vehicular Edge Computing Using Multi-Agent Graph Attention Reinforcement Learning
Jinjin Shen, Yan Lin, Yijin Zhang, Weibin Zhang, Feng Shu, Jun Li
Improving Generalization on the ProcGen Benchmark with Simple Architectural Changes and Scale
Andrew Jesson, Yiding Jiang
Meta-Reinforcement Learning with Universal Policy Adaptation: Provable Near-Optimality under All-task Optimum Comparator
Siyuan Xu, Minghui Zhu
Transformers as Game Players: Provable In-context Game-playing Capabilities of Pre-trained Models
Chengshuai Shi, Kun Yang, Jing Yang, Cong Shen
Generalization of Compositional Tasks with Logical Specification via Implicit Planning
Duo Xu, Faramarz Fekri
ActSafe: Active Exploration with Safety Constraints for Reinforcement Learning
Yarden As, Bhavya Sukhija, Lenart Treven, Carmelo Sferrazza, Stelian Coros, Andreas Krause
Reinforcement Learning in Hyperbolic Spaces: Models and Experiments
Vladimir Jaćimović, Zinaid Kapić, Aladin Crnkić
Multiple Ships Cooperative Navigation and Collision Avoidance using Multi-agent Reinforcement Learning with Communication
Y. Wang, Y. Zhao
Towards a Domain-Specific Modelling Environment for Reinforcement Learning
Natalie Sinani, Sahil Salma, Paul Boutot, Sadaf Mustafiz
SeRA: Self-Reviewing and Alignment of Large Language Models using Implicit Reward Margins
Jongwoo Ko, Saket Dingliwal, Bhavana Ganesh, Sailik Sengupta, Sravan Bodapati, Aram Galstyan
Enhancing Multi-Step Reasoning Abilities of Language Models through Direct Q-Function Optimization
Guanlin Liu, Kaixuan Ji, Renjie Zheng, Zheng Wu, Chen Dun, Quanquan Gu, Lin Yan
Physical Simulation for Multi-agent Multi-machine Tending
Abdalwhab Abdalwhab, Giovanni Beltrame, David St-Onge
Hierarchical Universal Value Function Approximators
Rushiv Arora