Efficient Statistics, in High Dimensions, from Truncated Samples

Daskalakis, C.; Gouleakis, T.; Tzamos, C.; Zampetakis, M.

Citation Details

We provide an efficient algorithm for the classical problem, going back to Galton, Pearson, and Fisher, of estimating, with arbitrary accuracy the parameters of a multivariate normal distribution from truncated samples. Truncated samples from a d-variate normal N(μ,Σ) means a samples is only revealed if it falls in some subset S⊆Rd; otherwise the samples are hidden and their count in proportion to the revealed samples is also hidden. We show that the mean μ and covariance matrix Σ can be estimated with arbitrary accuracy in polynomial-time, as long as we have oracle access to S, and S has non-trivial measure under the unknown d-variate normal distribution. Additionally we show that without oracle access to S, any non-trivial estimation is impossible. more »

Award ID(s):: 1741137 1650733

PAR ID:: 10078464

Author(s) / Creator(s):: Daskalakis, C.; Gouleakis, T.; Tzamos, C.; Zampetakis, M.

Date Published:: 2018-10-07

Journal Name:: Annual Symposium on Foundations of Computer Science

ISSN:: 0272-5428

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this