Reinforcement Learning
Reinforcement learning (RL) focuses on training agents to make optimal decisions in an environment by learning through trial and error, aiming to maximize cumulative rewards. Current research emphasizes improving RL's efficiency and robustness, particularly in areas like human-in-the-loop training (e.g., using human feedback to refine models), handling uncertainty and sparse rewards, and scaling to complex tasks (e.g., robotics, autonomous driving). Prominent approaches involve various policy gradient methods, Monte Carlo Tree Search, and the integration of large language models for improved decision-making and task decomposition. These advancements are driving progress in diverse fields, including robotics, game playing, and the development of more human-aligned AI systems.
Papers
ARLBench: Flexible and Efficient Benchmarking for Hyperparameter Optimization in Reinforcement Learning
Jannis Becktepe, Julian Dierkes, Carolin Benjamins, Aditya Mohan, David Salinas, Raghu Rajan, Frank Hutter, Holger Hoos, Marius Lindauer, Theresa Eimer
Climate Adaptation with Reinforcement Learning: Experiments with Flooding and Transportation in Copenhagen
Miguel Costa, Morten W. Petersen, Arthur Vandervoort, Martin Drews, Karyn Morrissey, Francisco C. Pereira
Cost-Aware Dynamic Cloud Workflow Scheduling using Self-Attention and Evolutionary Reinforcement Learning
Ya Shen, Gang Chen, Hui Ma, Mengjie Zhang
VickreyFeedback: Cost-efficient Data Construction for Reinforcement Learning from Human Feedback
Guoxi Zhang, Jiuding Duan
CurricuLLM: Automatic Task Curricula Design for Learning Complex Robot Skills using Large Language Models
Kanghyun Ryu, Qiayuan Liao, Zhongyu Li, Koushil Sreenath, Negar Mehr
iWalker: Imperative Visual Planning for Walking Humanoid Robot
Xiao Lin, Yuhao Huang, Taimeng Fu, Xiaobin Xiong, Chen Wang
Improving Agent Behaviors with RL Fine-tuning for Autonomous Driving
Zhenghao Peng, Wenjie Luo, Yiren Lu, Tianyi Shen, Cole Gulino, Ari Seff, Justin Fu
Criticality and Safety Margins for Reinforcement Learning
Alexander Grushin, Walt Woods, Alvaro Velasquez, Simon Khan
Autonomous Network Defence using Reinforcement Learning
Myles Foley, Chris Hicks, Kate Highnam, Vasilios Mavroudis
A Survey on Neural Architecture Search Based on Reinforcement Learning
Wenzhu Shao
LoopSR: Looping Sim-and-Real for Lifelong Policy Adaptation of Legged Robots
Peilin Wu, Weiji Xie, Jiahang Cao, Hang Lai, Weinan Zhang
FactorSim: Generative Simulation via Factorized Representation
Fan-Yun Sun, S. I. Harini, Angela Yi, Yihan Zhou, Alex Zook, Jonathan Tremblay, Logan Cross, Jiajun Wu, Nick Haber
Learning Occlusion-aware Decision-making from Agent Interaction via Active Perception
Jie Jia, Yiming Shu, Zhongxue Gan, Wenchao Ding
Autoregressive Multi-trait Essay Scoring via Reinforcement Learning with Scoring-aware Multiple Rewards
Heejin Do, Sangwon Ryu, Gary Geunbae Lee
Verti-Selector: Automatic Curriculum Learning for Wheeled Mobility on Vertically Challenging Terrain
Tong Xu, Chenhui Pan, Xuesu Xiao
Post-hoc Reward Calibration: A Case Study on Length Bias
Zeyu Huang, Zihan Qiu, Zili Wang, Edoardo M. Ponti, Ivan Titov
Spiders Based on Anxiety: How Reinforcement Learning Can Deliver Desired User Experience in Virtual Reality Personalized Arachnophobia Treatment
Athar Mahmoudi-Nejad, Matthew Guzdial, Pierre Boulanger
Zeroth-Order Policy Gradient for Reinforcement Learning from Human Feedback without Reward Inference
Qining Zhang, Lei Ying
Topological Foundations of Reinforcement Learning
David Krame Kadurha
Learning with Dynamics: Autonomous Regulation of UAV Based Communication Networks with Dynamic UAV Crew
Ran Zhang, Bowei Li, Liyuan Zhang, Jiang (Linda)Xie, Miao Wang