Deep Reinforcement Learning
Deep reinforcement learning (DRL) aims to train agents to make optimal decisions in complex environments by learning through trial and error. Current research focuses on improving DRL's robustness, sample efficiency, and interpretability, often employing architectures like Proximal Policy Optimization (PPO), deep Q-networks (DQNs), and graph neural networks (GNNs) to address challenges in diverse applications such as robotics, game playing, and resource management. The resulting advancements have significant implications for various fields, enabling the development of more adaptable and efficient autonomous systems across numerous domains.
Papers
Deep RL at Scale: Sorting Waste in Office Buildings with a Fleet of Mobile Manipulators
Alexander Herzog, Kanishka Rao, Karol Hausman, Yao Lu, Paul Wohlhart, Mengyuan Yan, Jessica Lin, Montserrat Gonzalez Arenas, Ted Xiao, Daniel Kappler, Daniel Ho, Jarek Rettinghouse, Yevgen Chebotar, Kuang-Huei Lee, Keerthana Gopalakrishnan, Ryan Julian, Adrian Li, Chuyuan Kelly Fu, Bob Wei, Sangeetha Ramesh, Khem Holden, Kim Kleiven, David Rendleman, Sean Kirmani, Jeff Bingham, Jon Weisz, Ying Xu, Wenlong Lu, Matthew Bennice, Cody Fong, David Do, Jessica Lam, Yunfei Bai, Benjie Holson, Michael Quinlan, Noah Brown, Mrinal Kalakrishnan, Julian Ibarz, Peter Pastor, Sergey Levine
Rescue Conversations from Dead-ends: Efficient Exploration for Task-oriented Dialogue Policy Optimization
Yangyang Zhao, Zhenyu Wang, Mehdi Dastani, Shihan Wang
Optimal Scheduling in IoT-Driven Smart Isolated Microgrids Based on Deep Reinforcement Learning
Jiaju Qi, Lei Lei, Kan Zheng, Simon X. Yang, Xuemin, Shen
Learning adaptive manipulation of objects with revolute joint: A case study on varied cabinet doors opening
Hongxiang Yu, Dashun Guo, Zhongxiang Zhou, Yue Wang, Rong Xiong
Adversarial Policy Optimization in Deep Reinforcement Learning
Md Masudur Rahman, Yexiang Xue
BCQQ: Batch-Constraint Quantum Q-Learning with Cyclic Data Re-uploading
Maniraman Periyasamy, Marc Hölle, Marco Wiedmann, Daniel D. Scherer, Axel Plinge, Christopher Mutschler
SocNavGym: A Reinforcement Learning Gym for Social Navigation
Aditya Kapoor, Sushant Swamy, Luis Manso, Pilar Bachiller
Discovering Object-Centric Generalized Value Functions From Pixels
Somjit Nath, Gopeshh Raaj Subbaraj, Khimya Khetarpal, Samira Ebrahimi Kahou
Dynamic Datasets and Market Environments for Financial Reinforcement Learning
Xiao-Yang Liu, Ziyi Xia, Hongyang Yang, Jiechao Gao, Daochen Zha, Ming Zhu, Christina Dan Wang, Zhaoran Wang, Jian Guo
A optimization framework for herbal prescription planning based on deep reinforcement learning
Kuo Yang, Zecong Yu, Xin Su, Xiong He, Ning Wang, Qiguang Zheng, Feidie Yu, Zhuang Liu, Tiancai Wen, Xuezhong Zhou
Proto-Value Networks: Scaling Representation Learning with Auxiliary Tasks
Jesse Farebrother, Joshua Greaves, Rishabh Agarwal, Charline Le Lan, Ross Goroshin, Pablo Samuel Castro, Marc G. Bellemare