Reinforcement Learning
Reinforcement learning (RL) focuses on training agents to make optimal decisions in an environment by learning through trial and error, aiming to maximize cumulative rewards. Current research emphasizes improving RL's efficiency and robustness, particularly in areas like human-in-the-loop training (e.g., using human feedback to refine models), handling uncertainty and sparse rewards, and scaling to complex tasks (e.g., robotics, autonomous driving). Prominent approaches involve various policy gradient methods, Monte Carlo Tree Search, and the integration of large language models for improved decision-making and task decomposition. These advancements are driving progress in diverse fields, including robotics, game playing, and the development of more human-aligned AI systems.
Papers
Critic as Lyapunov function (CALF): a model-free, stability-ensuring agent
Pavel Osinenko, Grigory Yaremenko, Roman Zashchitin, Anton Bolychev, Sinan Ibrahim, Dmitrii Dobriborsci
FSL-LVLM: Friction-Aware Safety Locomotion using Large Vision Language Model in Wheeled Robots
Bo Peng, Donghoon Baek, Qijie Wang, Joao Ramos
Analysis of flexible traffic control method in SDN
Marta Szymczyk
Mitigating Dimensionality in 2D Rectangle Packing Problem under Reinforcement Learning Schema
Waldemar Kołodziejczyk, Mariusz Kaleta
Curricula for Learning Robust Policies over Factored State Representations in Changing Environments
Panayiotis Panayiotou, Özgür Şimşek
The unknotting number, hard unknot diagrams, and reinforcement learning
Taylor Applebaum, Sam Blackwell, Alex Davies, Thomas Edlich, András Juhász, Marc Lackenby, Nenad Tomašev, Daniel Zheng
Applying Action Masking and Curriculum Learning Techniques to Improve Data Efficiency and Overall Performance in Operational Technology Cyber Security using Reinforcement Learning
Alec Wilson, William Holmes, Ryan Menzies, Kez Smithson Whitehead
AnyBipe: An End-to-End Framework for Training and Deploying Bipedal Robots Guided by Large Language Models
Yifei Yao, Wentao He, Chenyu Gu, Jiaheng Du, Fuwei Tan, Zhen Zhu, Junguo Lu
Scores as Actions: a framework of fine-tuning diffusion models by continuous-time reinforcement learning
Hanyang Zhao, Haoxian Chen, Ji Zhang, David D. Yao, Wenpin Tang
Stochastic Reinforcement Learning with Stability Guarantees for Control of Unknown Nonlinear Systems
Thanin Quartz, Ruikun Zhou, Hans De Sterck, Jun Liu
Towards Online Safety Corrections for Robotic Manipulation Policies
Ariana Spalter, Mark Roberts, Laura M. Hiatt
Reinforcement Learning Discovers Efficient Decentralized Graph Path Search Strategies
Alexei Pisacane, Victor-Alexandru Darvariu, Mirco Musolesi
Tidal MerzA: Combining affective modelling and autonomous code generation through Reinforcement Learning
Elizabeth Wilson, György Fazekas, Geraint Wiggins
Learning Skateboarding for Humanoid Robots through Massively Parallel Reinforcement Learning
William Thibault, Vidyasagar Rajendran, William Melek, Katja Mombaur
The Role of Deep Learning Regularizations on Actors in Offline RL
Denis Tarasov, Anja Surina, Caglar Gulcehre
Hierarchical Reinforcement Learning for Temporal Abstraction of Listwise Recommendation
Luo Ji, Gao Liu, Mingyang Yin, Hongxia Yang, Jingren Zhou
Online Decision MetaMorphFormer: A Casual Transformer-Based Reinforcement Learning Framework of Universal Embodied Intelligence
Luo Ji, Runji Lin
A Framework for Predicting the Impact of Game Balance Changes through Meta Discovery
Akash Saravanan, Matthew Guzdial
Multi-Type Preference Learning: Empowering Preference-Based Reinforcement Learning with Equal Preferences
Ziang Liu, Junjie Xu, Xingjiao Wu, Jing Yang, Liang He