Model selection properties of forward selection and sequential cross‐validation for high‐dimensional regression

Wieczorek, Jerzy  (ORCID:0000000228596534); Lei, Jing

doi:10.1002/cjs.11635

Citation Details

Model selection properties of forward selection and sequential cross‐validation for high‐dimensional regression

Forward selection (FS) is a popular variable selection method for linear regression. But theoretical understanding of FS with a diverging number of covariates is still limited. We derive sufficient conditions for FS to attain model selection consistency. Our conditions are similar to those for orthogonal matching pursuit, but are obtained using a different argument. When the true model size is unknown, we derive sufficient conditions for model selection consistency of FS with a data‐driven stopping rule, based on a sequential variant of cross‐validation. As a byproduct of our proofs, we also have a sharp (sufficient and almost necessary) condition for model selection consistency of “wrapper” forward search for linear regression. We illustrate intuition and demonstrate performance of our methods using simulation studies and real datasets. more »

Award ID(s):: 2015492

PAR ID:: 10367386

Author(s) / Creator(s):: Wieczorek, Jerzy ; Lei, Jing

Publisher / Repository:: Wiley Blackwell (John Wiley & Sons)

Date Published:: 2021-07-22

Journal Name:: Canadian Journal of Statistics

Volume:: 50

Issue:: 2

ISSN:: 0319-5724

Page Range / eLocation ID:: p. 454-470

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Journal Article:
https://doi.org/10.1002/cjs.11635

More Like this