Optimal subsampling for semi‐parametric accelerated failure time models with massive survival data using a rank‐based approach

Yang, Zehan; Wang, HaiYing; Yan, Jun

doi:10.1002/sim.10200

Citation Details

Optimal subsampling for semi‐parametric accelerated failure time models with massive survival data using a rank‐based approach

Subsampling is a practical strategy for analyzing vast survival data, which are progressively encountered across diverse research domains. While the optimal subsampling method has been applied to inferences for Cox models and parametric accelerated failure time (AFT) models, its application to semi‐parametric AFT models with rank‐based estimation have received limited attention. The challenges arise from the non‐smooth estimating function for regression coefficients and the seemingly zero contribution from censored observations in estimating functions in the commonly seen form. To address these challenges, we develop optimal subsampling probabilities for both event and censored observations by expressing the estimating functions through a well‐defined stochastic process. Meanwhile, we apply an induced smoothing procedure to the non‐smooth estimating functions. As the optimal subsampling probabilities depend on the unknown regression coefficients, we employ a two‐step procedure to obtain a feasible estimation method. An additional benefit of the method is its ability to resolve the issue of underestimation of the variance when the subsample size approaches the full sample size. We validate the performance of our estimators through a simulation study and apply the methods to analyze the survival time of lymphoma patients in the surveillance, epidemiology, and end results program. more »

Award ID(s):: 2105571

PAR ID:: 10596297

Author(s) / Creator(s):: Yang, Zehan; Wang, HaiYing; Yan, Jun

Publisher / Repository:: Wiley

Date Published:: 2024-10-30

Journal Name:: Statistics in Medicine

Volume:: 43

Issue:: 24

ISSN:: 0277-6715

Page Range / eLocation ID:: 4650 to 4666

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1002/sim.10200

More Like this