Sampling‐based estimation for massive survival data with additive hazards model

Zuo, Lulu; Zhang, Haixiang  (ORCID:0000000273115605); Wang, HaiYing; Liu, Lei  (ORCID:000000031844338X)

doi:10.1002/sim.8783

Citation Details

Sampling‐based estimation for massive survival data with additive hazards model

For massive survival data, we propose a subsampling algorithm to efficiently approximate the estimates of regression parameters in the additive hazards model. We establish consistency and asymptotic normality of the subsample‐based estimator given the full data. The optimal subsampling probabilities are obtained via minimizing asymptotic variance of the resulting estimator. The subsample‐based procedure can largely reduce the computational cost compared with the full data method. In numerical simulations, our method has low bias and satisfactory coverage probabilities. We provide an illustrative example on the survival analysis of patients with lymphoma cancer from the Surveillance, Epidemiology, and End Results Program.

Award ID(s):: 1812013

NSF-PAR ID:: 10453310

Author(s) / Creator(s):: Zuo, Lulu ; Zhang, Haixiang ; Wang, HaiYing ; Liu, Lei

Publisher / Repository:: Wiley Blackwell (John Wiley & Sons)

Date Published:: 2020-11-03

Journal Name:: Statistics in Medicine

Volume:: 40

Issue:: 2

ISSN:: 0277-6715

Page Range / eLocation ID:: p. 441-450

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Journal Article:
https://doi.org/10.1002/sim.8783

More Like this