Recovering Probability Distributions from Missing Data

Tian, Jin

Citation Details

A probabilistic query may not be estimable from observed data corrupted by missing values if the data are not missing at random (MAR). It is therefore of theoretical interest and practical importance to determine in principle whether a probabilistic query is estimable from missing data or not when the data are not MAR. We present algorithms that systematically determine whether the joint probability distribution or a target marginal distribution is estimable from observed data with missing values, assuming that the data-generation model is represented as a Bayesian network, known as m-graphs, that not only encodes the dependencies among the variables but also explicitly portrays the mechanisms responsible for the missingness process. The results significantly advance the existing work. more »

Award ID(s):: 1704352

PAR ID:: 10060355

Author(s) / Creator(s):: Tian, Jin

Date Published:: 2017-01-01

Journal Name:: Proceedings of the Ninth Asian Conference on Machine Learning

Volume:: PMLR 77

Page Range / eLocation ID:: 574-589

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this