Hoeffding's inequality for Markov chains and its applications to statistical learning

Fan, J.

Citation Details

This paper establishes Hoeffding’s lemma and inequality for bounded functions of general-state-space and not necessarily reversible Markov chains. The sharpness of these results is characterized by the optimality of the ratio between variance prox- ies in the Markov-dependent and independent settings. The boundedness of functions is shown necessary for such results to hold in general. To showcase the usefulness of the new results, we apply them for non-asymptotic analyses of MCMC estima- tion, respondent-driven sampling and high-dimensional covariance matrix estimation on time series data with a Markovian nature. In addition to statistical problems, we also apply them to study the time-discounted rewards in econometric models and the multi-armed bandit problem with Markovian rewards arising from the field of machine learning. more »

Award ID(s):: 1662139 1712591

PAR ID:: 10326333

Author(s) / Creator(s):: Fan, J.

Date Published:: 2021-04-01

Journal Name:: Journal of machine learning research

Volume:: 22

ISSN:: 1533-7928

Page Range / eLocation ID:: 1-35

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
The DOI is not currently available.

More Like this