Non-asymptotic Performance Guarantees for Neural Estimation of f-divergences

Sreekumar, Sreejith; Zhang, Zhengxin; Goldfeld, Ziv

Citation Details

Statistical distances (SDs), which quantify the dissimilarity between probability distributions, are central to machine learning and statistics. A modern method for estimating such distances from data relies on parametrizing a variational form by a neural network (NN) and optimizing it. These estimators are abundantly used in practice, but corresponding performance guarantees are partial and call for further exploration. In particular, there seems to be a fundamental tradeoff between the two sources of error involved: approximation and estimation. While the former needs the NN class to be rich and expressive, the latter relies on controlling complexity. This paper explores this tradeoff by means of non-asymptotic error bounds, focusing on three popular choices of SDs—Kullback-Leibler divergence, chi-squared divergence, and squared Hellinger distance. Our analysis relies on non-asymptotic function approximation theorems and tools from empirical process theory. Numerical results validating the theory are also provided. more »

Award ID(s):: 1740822

PAR ID:: 10357783

Author(s) / Creator(s):: Sreekumar, Sreejith; Zhang, Zhengxin; Goldfeld, Ziv

Date Published:: 2022-01-01

Journal Name:: Proceedings of Machine Learning Research

Volume:: 130

ISSN:: 2640-3498

Page Range / eLocation ID:: 3322-3330

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
The DOI is not currently available.

More Like this