NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

High-Dimensional Quantile Regression: Convolution Smoothing and Concave Regularization

https://doi.org/10.1111/rssb.12485

Tan, Kean_Ming; Wang, Lan; Zhou, Wen-Xin (December 2021, Journal of the Royal Statistical Society Series B: Statistical Methodology)

Abstract ℓ 1 -penalized quantile regression (QR) is widely used for analysing high-dimensional data with heterogeneity. It is now recognized that the ℓ1-penalty introduces non-negligible estimation bias, while a proper use of concave regularization may lead to estimators with refined convergence rates and oracle properties as the signal strengthens. Although folded concave penalized M-estimation with strongly convex loss functions have been well studied, the extant literature on QR is relatively silent. The main difficulty is that the quantile loss is piecewise linear: it is non-smooth and has curvature concentrated at a single point. To overcome the lack of smoothness and strong convexity, we propose and study a convolution-type smoothed QR with iteratively reweighted ℓ1-regularization. The resulting smoothed empirical loss is twice continuously differentiable and (provably) locally strongly convex with high probability. We show that the iteratively reweighted ℓ1-penalized smoothed QR estimator, after a few iterations, achieves the optimal rate of convergence, and moreover, the oracle rate and the strong oracle property under an almost necessary and sufficient minimum signal strength condition. Extensive numerical studies corroborate our theoretical results.
more » « less
Resampling‐based confidence intervals for model‐free robust inference on optimal treatment regimes

https://doi.org/10.1111/biom.13337

Wu, Yunan; Wang, Lan (August 2020, Biometrics)

Abstract We propose a new procedure for inference on optimal treatment regimes in the model‐free setting, which does not require to specify an outcome regression model. Existing model‐free estimators for optimal treatment regimes are usually not suitable for the purpose of inference, because they either have nonstandard asymptotic distributions or do not necessarily guarantee consistent estimation of the parameter indexing the Bayes rule due to the use of surrogate loss. We first study a smoothed robust estimator that directly targets the parameter corresponding to the Bayes decision rule for optimal treatment regimes estimation. This estimator is shown to have an asymptotic normal distribution. Furthermore, we verify that a resampling procedure provides asymptotically accurate inference for both the parameter indexing the optimal treatment regime and the optimal value function. A new algorithm is developed to calculate the proposed estimator with substantially improved speed and stability. Numerical results demonstrate the satisfactory performance of the new methods.
more » « less
Transformation-Invariant Learning of Optimal Individualized Decision Rules with Time-to-Event Outcomes

https://doi.org/10.1080/01621459.2022.2068420

Zhou, Yu; Wang, Lan; Song, Rui; Zhao, Tuoyi (October 2023, Journal of the American Statistical Association)

Full Text Available
Quantile partially linear additive model for data with dropouts and an application to modeling cognitive decline

https://doi.org/10.1002/sim.9745

Maidman, Adam; Wang, Lan; Zhou, Xiao‐Hua; Sherwood, Ben (July 2023, Statistics in Medicine)

The National Alzheimer's Coordinating Center Uniform Data Set includes test results from a battery of cognitive exams. Motivated by the need to model the cognitive ability of low‐performing patients we create a composite score from ten tests and propose to model this score using a partially linear quantile regression model for longitudinal studies with non‐ignorable dropouts. Quantile regression allows for modeling non‐central tendencies. The partially linear model accommodates nonlinear relationships between some of the covariates and cognitive ability. The data set includes patients that leave the study prior to the conclusion. Ignoring such dropouts will result in biased estimates if the probability of dropout depends on the response. To handle this challenge, we propose a weighted quantile regression estimator where the weights are inversely proportional to the estimated probability a subject remains in the study. We prove that this weighted estimator is a consistent and efficient estimator of both linear and nonlinear effects.
more » « less
Full Text Available
Fairness-Oriented Learning for Optimal Individualized Treatment Rules

https://doi.org/10.1080/01621459.2021.2008402

Fang, Ethan X; Wang, Zhaoran; Wang, Lan (July 2023, Journal of the American Statistical Association)

Full Text Available
Model-Assisted Uniformly Honest Inference for Optimal Treatment Regimes in High Dimension

https://doi.org/10.1080/01621459.2021.1929246

Wu, Yunan; Wang, Lan; Fu, Haoda (January 2023, Journal of the American Statistical Association)

Full Text Available
ANALYSIS OF GLOBAL AND LOCAL OPTIMA OF REGULARIZED QUANTILE REGRESSION IN HIGH DIMENSIONS: A SUBGRADIENT APPROACH

https://doi.org/10.1017/S0266466622000421

Wang, Lan; He, Xuming (October 2022, Econometric Theory)

Regularized quantile regression (QR) is a useful technique for analyzing heterogeneous data under potentially heavy-tailed error contamination in high dimensions. This paper provides a new analysis of the estimation/prediction error bounds of the global solution of$$L_1$$-regularized QR (QR-LASSO) and the local solutions of nonconvex regularized QR (QR-NCP) when the number of covariates is greater than the sample size. Our results build upon and significantly generalize the earlier work in the literature. For certain heavy-tailed error distributions and a general class of design matrices, the least-squares-based LASSO cannot achieve the near-oracle rate derived under the normality assumption no matter the choice of the tuning parameter. In contrast, we establish that QR-LASSO achieves the near-oracle estimation error rate for a broad class of models under conditions weaker than those in the literature. For QR-NCP, we establish the novel results that all local optima within a feasible region have desirable estimation accuracy. Our analysis applies to not just the hard sparsity setting commonly used in the literature, but also to the soft sparsity setting which permits many small coefficients. Our approach relies on a unified characterization of the global/local solutions of regularized QR via subgradients using a generalized Karush–Kuhn–Tucker condition. The theory of the paper establishes a key property of the subdifferential of the quantile loss function in high dimensions, which is of independent interest for analyzing other high-dimensional nonsmooth problems.
more » « less
Full Text Available
A Tuning-free Robust and Efficient Approach to High-dimensional Regression

https://doi.org/10.1080/01621459.2020.1840989

Wang, Lan; Peng, Bo; Bradic, Jelena; Li, Runze; Wu, Yunan (October 2020, Journal of the American Statistical Association)
null (Ed.)
Full Text Available
Rejoinder to “A Tuning-Free Robust and Efficient Approach to High-Dimensional Regression”

https://doi.org/10.1080/01621459.2020.1843865

Wang, Lan; Peng, Bo; Bradic, Jelena; Li, Runze; Wu, Yunan (October 2020, Journal of the American Statistical Association)
null (Ed.)
Full Text Available

Search for: All records