Q Learning
Q-learning is a reinforcement learning algorithm aiming to find optimal actions in an environment by learning a Q-function that estimates the expected cumulative reward for each state-action pair. Current research focuses on improving Q-learning's robustness, efficiency, and applicability to complex scenarios, including multi-agent systems, partially observable environments (POMDPs), and those with corrupted rewards, often employing deep learning architectures like deep Q-networks (DQNs) and modifications such as double Q-learning and prioritized experience replay. These advancements are significant for addressing challenges in various fields, such as robotics, autonomous systems, and network optimization, where efficient and reliable decision-making under uncertainty is crucial.
Papers
An Empirical Investigation of Value-Based Multi-objective Reinforcement Learning for Stochastic Environments
Kewen Ding, Peter Vamplew, Cameron Foale, Richard Dazeley
SPQR: Controlling Q-ensemble Independence with Spiked Random Model for Reinforcement Learning
Dohyeok Lee, Seungyub Han, Taehyun Cho, Jungwoo Lee
Q-Boost: On Visual Quality Assessment Ability of Low-level Multi-Modality Foundation Models
Zicheng Zhang, Haoning Wu, Zhongpeng Ji, Chunyi Li, Erli Zhang, Wei Sun, Xiaohong Liu, Xiongkuo Min, Fengyu Sun, Shangling Jui, Weisi Lin, Guangtao Zhai
Reinforcement Learning for Safe Occupancy Strategies in Educational Spaces during an Epidemic
Elizabeth Akinyi Ondula, Bhaskar Krishnamachari
Sample Efficient Reinforcement Learning with Partial Dynamics Knowledge
Meshal Alharbi, Mardavij Roozbehani, Munther Dahleh
Cautiously-Optimistic Knowledge Sharing for Cooperative Multi-Agent Reinforcement Learning
Yanwen Ba, Xuan Liu, Xinning Chen, Hao Wang, Yang Xu, Kenli Li, Shigeng Zhang
Stability of Multi-Agent Learning in Competitive Networks: Delaying the Onset of Chaos
Aamal Hussain, Francesco Belardinelli
Model-free Posterior Sampling via Learning Rate Randomization
Daniil Tiapkin, Denis Belomestny, Daniele Calandriello, Eric Moulines, Remi Munos, Alexey Naumov, Pierre Perrault, Michal Valko, Pierre Menard
Lifting the Veil: Unlocking the Power of Depth in Q-learning
Shao-Bo Lin, Tao Li, Shaojie Tang, Yao Wang, Ding-Xuan Zhou