Paper ID: 2208.01185
A Note on Zeroth-Order Optimization on the Simplex
Tijana Zrnic, Eric Mazumdar
We construct a zeroth-order gradient estimator for a smooth function defined on the probability simplex. The proposed estimator queries the simplex only. We prove that projected gradient descent and the exponential weights algorithm, when run with this estimator instead of exact gradients, converge at a $\mathcal O(T^{-1/4})$ rate.
Submitted: Aug 2, 2022