Recursive Quantile Estimation: Non-Asymptotic Confidence Bounds

Chen, Likai; Keilbar, Georg; Wu, Wei Biao

Citation Details

This paper considers the recursive estimation of quantiles using the stochastic gradient descent (SGD) algorithm with Polyak-Ruppert averaging. The algorithm offers a compu- tationally and memory efficient alternative to the usual empirical estimator. Our focus is on studying the non-asymptotic behavior by providing exponentially decreasing tail prob- ability bounds under mild assumptions on the smoothness of the density functions. This novel non-asymptotic result is based on a bound of the moment generating function of the SGD estimate. We apply our result to the problem of best arm identification in a multi-armed stochastic bandit setting under quantile preferences. more »

Award ID(s):: 2027723

PAR ID:: 10558456

Author(s) / Creator(s):: Chen, Likai; Keilbar, Georg; Wu, Wei Biao

Publisher / Repository:: MIT Press

Date Published:: 2023-02-01

Journal Name:: Journal of Machine Learning Research

ISSN:: 1533-7928

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
The DOI is not currently available.

More Like this