Q Learning
Q-learning is a reinforcement learning algorithm aiming to find optimal actions in an environment by learning a Q-function that estimates the expected cumulative reward for each state-action pair. Current research focuses on improving Q-learning's robustness, efficiency, and applicability to complex scenarios, including multi-agent systems, partially observable environments (POMDPs), and those with corrupted rewards, often employing deep learning architectures like deep Q-networks (DQNs) and modifications such as double Q-learning and prioritized experience replay. These advancements are significant for addressing challenges in various fields, such as robotics, autonomous systems, and network optimization, where efficient and reliable decision-making under uncertainty is crucial.
Papers
RL-GA: A Reinforcement Learning-Based Genetic Algorithm for Electromagnetic Detection Satellite Scheduling Problem
Yanjie Song, Luona Wei, Qing Yang, Jian Wu, Lining Xing, Yingwu Chen
Reinforcement Learning for Vision-based Object Manipulation with Non-parametric Policy and Action Primitives
Dongwon Son, Myungsin Kim, Jaecheol Sim, Wonsik Shin