Interpolating discriminant functions in high-dimensional Gaussian latent mixtures

Bing, Xin (ORCID:0000000174629360); Wegkamp, Marten

doi:10.1093/biomet/asad037

Citation Details

Interpolating discriminant functions in high-dimensional Gaussian latent mixtures

Abstract This paper considers binary classification of high-dimensional features under a postulated model with a low-dimensional latent Gaussian mixture structure and nonvanishing noise. A generalized least-squares estimator is used to estimate the direction of the optimal separating hyperplane. The estimated hyperplane is shown to interpolate on the training data. While the direction vector can be consistently estimated, as could be expected from recent results in linear regression, a naive plug-in estimate fails to consistently estimate the intercept. A simple correction, which requires an independent hold-out sample, renders the procedure minimax optimal in many scenarios. The interpolation property of the latter procedure can be retained, but surprisingly depends on the way the labels are encoded. more »

Award ID(s):: 2210557

PAR ID:: 10490549

Author(s) / Creator(s):: Bing, Xin; Wegkamp, Marten

Publisher / Repository:: Oxford University Press

Date Published:: 2023-06-08

Journal Name:: Biometrika

Volume:: 111

Issue:: 1

ISSN:: 0006-3444

Format(s):: Medium: X Size: p. 291-308

Size(s):: p. 291-308

Sponsoring Org:: National Science Foundation

Journal Article:
https://doi.org/10.1093/biomet/asad037

More Like this