Reinforcement Learning
Reinforcement learning (RL) focuses on training agents to make optimal decisions in an environment by learning through trial and error, aiming to maximize cumulative rewards. Current research emphasizes improving RL's efficiency and robustness, particularly in areas like human-in-the-loop training (e.g., using human feedback to refine models), handling uncertainty and sparse rewards, and scaling to complex tasks (e.g., robotics, autonomous driving). Prominent approaches involve various policy gradient methods, Monte Carlo Tree Search, and the integration of large language models for improved decision-making and task decomposition. These advancements are driving progress in diverse fields, including robotics, game playing, and the development of more human-aligned AI systems.
Papers
Out-of-Distribution Detection for Neurosymbolic Autonomous Cyber Agents
Ankita Samaddar, Nicholas Potteiger, Xenofon Koutsoukos
Generating Critical Scenarios for Testing Automated Driving Systems
Trung-Hieu Nguyen, Truong-Giang Vuong, Hong-Nam Duong, Son Nguyen, Hieu Dinh Vo, Toshiaki Aoki, Thu-Trang Nguyen
Cooperative Cruising: Reinforcement Learning based Time-Headway Control for Increased Traffic Efficiency
Yaron Veksler, Sharon Hornstein, Han Wang, Maria Laura Delle Monache, Daniel Urieli
Reinforcement learning to learn quantum states for Heisenberg scaling accuracy
Jeongwoo Jae, Jeonghoon Hong, Jinho Choo, Yeong-Dae Kwon
Step-by-Step Guidance to Differential Anemia Diagnosis with Real-World Data and Deep Reinforcement Learning
Lillian Muyama, Estelle Lu, Geoffrey Cheminet, Jacques Pouchot, Bastien Rance, Anne-Isabelle Tropeano, Antoine Neuraz, Adrien Coulet
Technical Report on Reinforcement Learning Control on the Lucas-Nülle Inverted Pendulum
Maximilian Schenke, Shalbus Bukarov
Selective Reviews of Bandit Problems in AI via a Statistical View
Pengjie Zhou, Haoyu Wei, Huiming Zhang
Learning Ensembles of Vision-based Safety Control Filters
Ihab Tabbara, Hussein Sibai
Towards Type Agnostic Cyber Defense Agents
Erick Galinkin, Emmanouil Pountrourakis, Spiros Mancoridis
Revisiting Generative Policies: A Simpler Reinforcement Learning Algorithmic Perspective
Jinouwen Zhang, Rongkun Xue, Yazhe Niu, Yun Chen, Jing Yang, Hongsheng Li, Yu Liu
Dense Dynamics-Aware Reward Synthesis: Integrating Prior Experience with Demonstrations
Cevahir Koprulu, Po-han Li, Tianyu Qiu, Ruihan Zhao, Tyler Westenbroek, David Fridovich-Keil, Sandeep Chinchali, Ufuk Topcu
A Memory-Based Reinforcement Learning Approach to Integrated Sensing and Communication
Homa Nikbakht, Michèle Wigger, Shlomo Shamai (Shitz), H. Vincent Poor
Provable Partially Observable Reinforcement Learning with Privileged Information
Yang Cai, Xiangyu Liu, Argyris Oikonomou, Kaiqing Zhang
Hierarchical Prompt Decision Transformer: Improving Few-Shot Policy Generalization with Global and Adaptive Guidance
Zhe Wang, Haozhu Wang, Yanjun Qi
Linear Probe Penalties Reduce LLM Sycophancy
Henry Papadatos, Rachel Freedman
Bilinear Convolution Decomposition for Causal RL Interpretability
Narmeen Oozeer, Sinem Erisken, Alice Rigg
Online Poisoning Attack Against Reinforcement Learning under Black-box Environments
Jianhui Li, Bokang Zhang, Junfeng Wu
Decision Transformer vs. Decision Mamba: Analysing the Complexity of Sequential Decision Making in Atari Games
Ke Yan
Learning Dynamic Weight Adjustment for Spatial-Temporal Trajectory Planning in Crowd Navigation
Muqing Cao, Xinhang Xu, Yizhuo Yang, Jianping Li, Tongxing Jin, Pengfei Wang, Tzu-Yi Hung, Guosheng Lin, Lihua Xie
Towards Fault Tolerance in Multi-Agent Reinforcement Learning
Yuchen Shi, Huaxin Pei, Liang Feng, Yi Zhang, Danya Yao