Temporal Difference

Temporal difference (TD) learning is a core reinforcement learning method aiming to estimate the value of states or state-action pairs by bootstrapping from predictions of future values. Current research focuses on improving TD's efficiency and stability, particularly when combined with deep neural networks, by addressing issues like variance reduction, handling uncertainty, and optimizing algorithm parameters (e.g., step size, target network updates). These advancements are significant for enhancing the performance and robustness of reinforcement learning agents across various applications, from robotics and game playing to more complex control problems and even supervised learning tasks.

Papers

October 12, 2022

Finite time analysis of temporal difference learning with linear function approximation: Tail averaging and regularisation
Gandharv Patil, Prashanth L.A., Dheeraj Nagaraj, Doina Precup
Regularization Model Linear Function Approximation Finite Time Temporal Difference Polyak Ruppert

September 19, 2022

Age of Semantics in Cooperative Communications: To Expedite Simulation Towards Real via Offline Reinforcement Learning
Xianfu Chen, Zhifeng Zhao, Shiwen Mao, Celimuge Wu, Honggang Zhang, Mehdi Bennis
Offline Reinforcement Learning Actor Critic Semantics Surfaced Speech Based Age Temporal Difference Semantics Freshness Cooperative Communication Power Constraint

September 16, 2022

Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Litian Liang, Yaosheng Xu, Stephen McAleer, Dailin Hu, Alexander Ihler, Pieter Abbeel, Roy Fox
Deep Network Diverse Ensemble Variance Reduction Temporal Difference Learning Temporal Difference Variance Estimation Atari Benchmark

September 1, 2022

Intrinsic fluctuations of reinforcement learning promote cooperation
Wolfram Barfuss, Janusz Meylahn
Reinforcement Learning Cooperative Behavior Temporal Difference Epsilon Greedy

June 4, 2022

Adaptive Tree Backup Algorithms for Temporal-Difference Reinforcement Learning
Brett Daley, Isaac Chan
Bias Variance Temporal Difference Adaptive Strategy

May 25, 2022

An Experimental Comparison Between Temporal Difference and Residual Gradient with Neural Network Approximation
Shuyu Yin, Tao Luo, Peilin Liu, Zhi-Qin John Xu
Reinforcement Learning Gradient Descent Deep Q Learning Temporal Difference Neural Network Approximation Experimental Comparison Residual Information

April 14, 2022

Look Back and Forth: Video Super-Resolution with Explicit Temporal Difference Modeling
Takashi Isobe, Xu Jia, Xin Tao, Changlin Li, Ruihuang Li, Yongjie Shi, Jing Mu, Huchuan Lu, Yu-Wing Tai
Optical Flow Video Super Resolution Temporal Difference

March 18, 2022

Importance Sampling Placement in Off-Policy Temporal-Difference Methods
Eric Graves, Sina Ghiassian
Policy Reinforcement Learning Importance Sampling Temporal Difference Policy Algorithm

February 4, 2022

A Temporal-Difference Approach to Policy Gradient Estimation
Samuele Tosatto, Andrew Patterson, Martha White, A. Rupam Mahmood
Policy Gradient Gradient Based Temporal Difference

January 28, 2022

Dynamic Temporal Reconciliation by Reinforcement learning
Himanshi Charotia, Abhishek Garg, Gaurav Dhama, Naman Maheshwari
Reinforcement Learning Time Series Forecasting High Resolution Temporal Difference Coherent Forecast

November 22, 2021

Optimistic Temporal Difference Learning for 2048
Hung Guei, Lung-Pin Chen, I-Chen Wu
Multi Stage Temporal Difference Learning Temporal Difference Temporal Coherence Better Initialization