Robust Distance Correlation for Variable Screening

Ma, Tianzhou; Yang, Fan; Ke, Hongjie; Ren, Zhao  (ORCID:0000000180965248)

doi:10.1002/sta4.70094

Citation Details

This content will become publicly available on September 1, 2026

Robust Distance Correlation for Variable Screening

ABSTRACT In modern statistical applications, identifying critical features in high‐dimensional data is essential for scientific discoveries. Traditional best subset selection methods face computational challenges, while regularization approaches such as Lasso, SCAD and their variants often exhibit poor performance with ultrahigh‐dimensional data. Sure screening methods, widely used for dimensionality reduction, have been developed as popular alternatives, but few target heavy‐tailed characteristics in modern big data. This paper introduces a new sure screening method, based on robust distance correlation (‘RDC’), designed for heavy‐tailed data. The proposed method inherits the benefits of the original model‐free distance correlation‐based screening while robustly estimating distance correlation in the presence of heavy‐tailed data. We further develop an FDR control procedure by incorporating the Reflection via Data Splitting (REDS) method. Extensive simulations demonstrate the method's advantage over existing screening procedures under different scenarios of heavy‐tailedness. Its application to high‐dimensional heavy‐tailed RNA‐seq data from The Cancer Genome Atlas (TCGA) pancreatic cancer cohort showcases superior performance in identifying biologically meaningful genes predictive of MAPK1 protein expression critical to pancreatic cancer. more »

Award ID(s):: 2113568

PAR ID:: 10639891

Author(s) / Creator(s):: Ma, Tianzhou ; Yang, Fan ; Ke, Hongjie ; Ren, Zhao

Publisher / Repository:: John Wiley & Sons Ltd

Date Published:: 2025-09-01

Journal Name:: Stat

Volume:: 14

Issue:: 3

ISSN:: 2049-1573

Page Range / eLocation ID:: e70094

Subject(s) / Keyword(s):: distance correlation false discovery rate Huber loss robustness sure screening property variable selection

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on September 1, 2026
Journal Article:
https://doi.org/10.1002/sta4.70094

More Like this