Paper ID: 2111.02644
A Concentration Bound for LSPE($\lambda$)
Siddharth Chandak, Vivek S. Borkar, Harsh Dolhare
The popular LSPE($\lambda$) algorithm for policy evaluation is revisited to derive a concentration bound that gives high probability performance guarantees from some time on.
Submitted: Nov 4, 2021