Paper ID: 2111.02644

A Concentration Bound for LSPE($\lambda$)

Siddharth Chandak, Vivek S. Borkar, Harsh Dolhare

The popular LSPE($\lambda$) algorithm for policy evaluation is revisited to derive a concentration bound that gives high probability performance guarantees from some time on.

Submitted: Nov 4, 2021