Delayed Feedback

Delayed feedback, where the consequences of actions are not immediately observable, poses a significant challenge across numerous machine learning and control applications. Current research focuses on developing algorithms that effectively learn and optimize under these conditions, employing techniques such as multi-armed bandits, Thompson sampling, and various forms of online convex optimization, often incorporating model architectures like feedback delay networks and graph neural networks to handle the temporal aspect of delayed information. Addressing delayed feedback is crucial for improving the efficiency and robustness of systems in diverse fields, ranging from online advertising and recommendation systems to robotics and control engineering. The development of theoretically sound and practically efficient algorithms for handling delayed feedback remains a vibrant area of research with significant implications for real-world applications.

Papers

February 3, 2023

A Reduction-based Framework for Sequential Decision Making with Delayed Feedback
Yunchang Yang, Han Zhong, Tianhao Wu, Bin Liu, Liwei Wang, Simon S. Du
Sequential Decision Single Agent Model Reduction Sequential Decision Making Delayed Feedback Multi Agent Sequential Decision

February 1, 2023

Delayed Feedback in Kernel Bandits
Sattar Vakili, Danyal Ahmed, Alberto Bernacchia, Ciara Pike-Burke
Black Box Optimization O$ Regret Bayesian Optimisation Delayed Feedback Kernel Bandit

December 1, 2022

Gated Recurrent Neural Networks with Weighted Time-Delay Feedback
N. Benjamin Erichson, Soon Hoe Lim, Michael W. Mahoney
Discrete Time Delayed Feedback Recurrent Neural Network Architecture Recurrent Unit Gated Recurrent

July 21, 2022

Delayed Feedback in Generalised Linear Bandits Revisited
Benjamin Howson, Ciara Pike-Burke, Sarah Filippi
Linear Bandit Sequential Decision Making Problem Regret Guarantee Delayed Feedback Optimistic Algorithm

June 29, 2022

A Best-of-Both-Worlds Algorithm for Bandits with Delayed Feedback
Saeed Masoudian, Julian Zimmert, Yevgeny Seldin
Regret Guarantee Delayed Feedback Adversarial Setting Best of Both World Algorithm

June 19, 2022

Bayesian Optimization under Stochastic Delayed Feedback
Arun Verma, Zhongxiang Dai, Bryan Kian Hsiang Low
Bayesian Optimization Zeroth Order Delayed Feedback Gaussian Process Bandit

June 1, 2022

Generalized Delayed Feedback Model with Post-Click Information in Recommender Systems
Jia-Qi Yang, De-Chuan Zhan
Recommender System Delayed Feedback Conversion Rate Prediction Post Click

May 29, 2022

Input-to-State Safety with Input Delay in Longitudinal Vehicle Control
Tamas G. Molnar, Anil Alan, Adam K. Kiss, Aaron D. Ames, Gabor Orosz
External Control Control Barrier Function Delayed Feedback Input to State Longitudinal Control Disturbance Injection

May 24, 2022

Multi-Head Online Learning for Delayed Feedback Modeling
Hui Gao, Yihan Yang
Multi Head Automated Conversion Delayed Feedback Online Advertising Semantics Freshness Conversion Rate

February 24, 2022

Thompson Sampling with Unrestricted Delays
Han Wu, Stefan Wager
Multi Armed Bandit Regret Bound Thompson Sampling Delayed Feedback Delay Distribution

February 14, 2022

Asymptotically Unbiased Estimation for Delayed Feedback Modeling via Label Correction
Yu Chen, Jiaqi Jin, Hui Zhao, Pengjie Wang, Guojun Liu, Jian Xu, Bo Zheng
Importance Sampling Negative Sample Delayed Feedback Excess Delay Unbiased Estimator Unbiased Evaluation

February 2, 2022

Adaptive Experimentation with Delayed Binary Feedback
Zenan Wang, Carlos Carrion, Xiliang Lin, Fuhua Ji, Yongjun Bao, Weipeng Yan
Multi Armed Bandit Delayed Feedback Adaptive Experiment Experimentation Platform

January 18, 2022

DEFER: Distributed Edge Inference for Deep Neural Networks
Arjun Parthasarathy, Bhaskar Krishnamachari
Deep Neural Network Delayed Feedback Device Inference Edge Inference Compute Node

December 15, 2021

Safety-Critical Control with Input Delay in Dynamic Environment
Tamas G. Molnar, Adam K. Kiss, Aaron D. Ames, Gábor Orosz
Dynamic Environment Safety Guarantee Safety Critical Control Adaptive Cruise Control Control Synthesis Delayed Feedback Neural Control Barrier Function