Semiparametric imputation using conditional Gaussian mixture models under item nonresponse

Lee, Danhyang; Kim, Jae Kwang  (ORCID:0000000202466029)

doi:10.1111/biom.13410

Citation Details

Semiparametric imputation using conditional Gaussian mixture models under item nonresponse

Abstract Imputation is a popular technique for handling item nonresponse. Parametric imputation is based on a parametric model for imputation and is not robust against the failure of the imputation model. Nonparametric imputation is fully robust but is not applicable when the dimension of covariates is large due to the curse of dimensionality. Semiparametric imputation is another robust imputation based on a flexible model where the number of model parameters can increase with the sample size. In this paper, we propose a new semiparametric imputation based on a more flexible model assumption than the Gaussian mixture model. In the proposed mixture model, we assume a conditional Gaussian model for the study variable given the auxiliary variables, but the marginal distribution of the auxiliary variables is not necessarily Gaussian. The proposed mixture model is more flexible and achieves a better approximation than the Gaussian mixture models. The proposed method is applicable to high‐dimensional covariate problem by including a penalty function in the conditional log‐likelihood function. The proposed method is applied to the 2017 Korean Household Income and Expenditure Survey conducted by Statistics Korea. more »

Award ID(s):: 1733572 1931380

PAR ID:: 10364514

Author(s) / Creator(s):: Lee, Danhyang ; Kim, Jae Kwang

Publisher / Repository:: Oxford University Press

Date Published:: 2020-12-11

Journal Name:: Biometrics

Volume:: 78

Issue:: 1

ISSN:: 0006-341X

Format(s):: Medium: X Size: p. 227-237

Size(s):: p. 227-237

Sponsoring Org:: National Science Foundation

Journal Article:
https://doi.org/10.1111/biom.13410

More Like this