Reinforcement Learning
Reinforcement learning (RL) focuses on training agents to make optimal decisions in an environment by learning through trial and error, aiming to maximize cumulative rewards. Current research emphasizes improving RL's efficiency and robustness, particularly in areas like human-in-the-loop training (e.g., using human feedback to refine models), handling uncertainty and sparse rewards, and scaling to complex tasks (e.g., robotics, autonomous driving). Prominent approaches involve various policy gradient methods, Monte Carlo Tree Search, and the integration of large language models for improved decision-making and task decomposition. These advancements are driving progress in diverse fields, including robotics, game playing, and the development of more human-aligned AI systems.
Papers
Quantum framework for Reinforcement Learning: integrating Markov Decision Process, quantum arithmetic, and trajectory search
Thet Htar Su, Shaswot Shresthamali, Masaaki Kondo
AutoSculpt: A Pattern-based Model Auto-pruning Framework Using Reinforcement Learning and Graph Learning
Lixian Jing, Jianpeng Qi, Junyu Dong, Yanwei Yu
Trading Devil RL: Backdoor attack via Stock market, Bayesian Optimization and Reinforcement Learning
Orson Mengara
HyperQ-Opt: Q-learning for Hyperparameter Optimization
Md. Tarek Hasan
Reinforcement Learning with a Focus on Adjusting Policies to Reach Targets
Akane Tsuboya, Yu Kono, Tatsuji Takahashi
LMD-PGN: Cross-Modal Knowledge Distillation from First-Person-View Images to Third-Person-View BEV Maps for Universal Point Goal Navigation
Riku Uemura, Kanji Tanaka, Kenta Tsukahara, Daiki Iwata
Fairness in Reinforcement Learning with Bisimulation Metrics
Sahand Rezaei-Shoshtari, Hanna Yurchyk, Scott Fujimoto, Doina Precup, David Meger
Adam on Local Time: Addressing Nonstationarity in RL with Relative Adam Timesteps
Benjamin Ellis, Matthew T. Jackson, Andrei Lupu, Alexander D. Goldie, Mattie Fellows, Shimon Whiteson, Jakob Foerster
Environment Descriptions for Usability and Generalisation in Reinforcement Learning
Dennis J.N.J. Soemers, Spyridon Samothrakis, Kurt Driessens, Mark H.M. Winands
Online Preference-based Reinforcement Learning with Self-augmented Feedback from Large Language Model
Songjun Tu, Jingbo Sun, Qichao Zhang, Xiangyuan Lan, Dongbin Zhao
ACL-QL: Adaptive Conservative Level in Q-Learning for Offline Reinforcement Learning
Kun Wu, Yinuo Zhao, Zhiyuan Xu, Zhengping Che, Chengxiang Yin, Chi Harold Liu, Qinru Qiu, Feiferi Feng, Jian Tang
Online Learning from Strategic Human Feedback in LLM Fine-Tuning
Shugang Hao, Lingjie Duan
Subgoal Discovery Using a Free Energy Paradigm and State Aggregations
Amirhossein Mesbah, Reshad Hosseini, Seyed Pooya Shariatpanahi, Majid Nili Ahmadabadi
On Enhancing Network Throughput using Reinforcement Learning in Sliced Testbeds
Daniel Pereira Monteiro, Lucas Nardelli de Freitas Botelho Saar, Larissa Ferreira Rodrigues Moreira, Rodrigo Moreira
Autonomous Option Invention for Continual Hierarchical Reinforcement Learning and Planning
Rashmeet Kaur Nayyar, Siddharth Srivastava
Multi Agent Reinforcement Learning for Sequential Satellite Assignment Problems
Joshua Holder, Natasha Jaques, Mehran Mesbahi
VLM-RL: A Unified Vision Language Models and Reinforcement Learning Framework for Safe Autonomous Driving
Zilin Huang, Zihao Sheng, Yansong Qu, Junwei You, Sikai Chen
SORREL: Suboptimal-Demonstration-Guided Reinforcement Learning for Learning to Branch
Shengyu Feng, Yiming Yang
Generalized Back-Stepping Experience Replay in Sparse-Reward Environments
Guwen Lyu, Masahiro Sato
Novelty-Guided Data Reuse for Efficient and Diversified Multi-Agent Reinforcement Learning
Yangkun Chen, Kai Yang, Jian Tao, Jiafei Lyu