Abstract We propose a constrained maximum partial likelihood estimator for dimension reduction in integrative (e.g., pan-cancer) survival analysis with high-dimensional predictors. We assume that for each population in the study, the hazard function follows a distinct Cox proportional hazards model. To borrow information across populations, we assume that each of the hazard functions depend only on a small number of linear combinations of the predictors (i.e., “factors”). We estimate these linear combinations using an algorithm based on “distance-to-set” penalties. This allows us to impose both low-rankness and sparsity on the regression coefficient matrix estimator. We derive asymptotic results that reveal that our estimator is more efficient than fitting a separate proportional hazards model for each population. Numerical experiments suggest that our method outperforms competitors under various data generating models. We use our method to perform a pan-cancer survival analysis relating protein expression to survival across 18 distinct cancer types. Our approach identifies six linear combinations, depending on only 20 proteins, which explain survival across the cancer types. Finally, to validate our fitted model, we show that our estimated factors can lead to better prediction than competitors on four external datasets.
more »
« less
A nonparametric test for comparing survival functions based on restricted distance correlation
In this article, we propose an omnibus test for comparing two survival functions under non-proportional hazards. The test statistic is based on a product-limit estimate of the restricted distance correlation, which is closely related to the distance between survival curves. The strong consistency is established under mild regularity conditions. Our simulation studies show that the new test has satisfactory power under proportional hazard and various non-proportional hazards settings including delayed treatment effect, diminishing effect, and crossing survival curves; therefore, it can be a competitive alternative to the existing omnibus tests such as Kolmogorov-Smirnov test, Cramer-von Mises test, two-stage test, and the maxCombo test based on weighted log-rank statistics. Two extensions of the new test to one-sided alternatives and a Gaussian kernel are also discussed
more »
« less
- Award ID(s):
- 2119968
- PAR ID:
- 10511235
- Publisher / Repository:
- DeGruyter
- Date Published:
- Journal Name:
- Dependence Modeling
- Volume:
- 11
- Issue:
- 1
- ISSN:
- 2300-2298
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
null (Ed.)Abstract Background Interstitial lung abnormalities (ILA) can be detected on computed tomography (CT) in lung cancer patients and have an association with mortality in advanced non-small cell lung cancer (NSCLC) patients. The aim of this study is to demonstrate the significance of ILA for mortality in patients with stage I NSCLC using Boston Lung Cancer Study cohort. Methods Two hundred and thirty-one patients with stage I NSCLC from 2000 to 2011 were investigated in this retrospective study (median age, 69 years; 93 males, 138 females). ILA was scored on baseline CT scans prior to treatment using a 3-point scale (0 = no evidence of ILA, 1 = equivocal for ILA, 2 = ILA) by a sequential reading method. ILA score 2 was considered the presence of ILA. The difference of overall survival (OS) for patients with different ILA scores were tested via log-rank test and multivariate Cox proportional hazards models were used to estimate hazard ratios (HRs) including ILA score, age, sex, smoking status, and treatment as the confounding variables. Results ILA was present in 22 out of 231 patients (9.5%) with stage I NSCLC. The presence of ILA was associated with shorter OS (patients with ILA score 2, median 3.85 years [95% confidence interval (CI): 3.36 – not reached (NR)]; patients with ILA score 0 or 1, median 10.16 years [95%CI: 8.65 - NR]; P < 0.0001). In a Cox proportional hazards model, the presence of ILA remained significant for increased risk for death (HR = 2.88, P = 0.005) after adjusting for age, sex, smoking and treatment. Conclusions ILA was detected on CT in 9.5% of patients with stage I NSCLC. The presence of ILA was significantly associated with a shorter OS and could be an imaging marker of shorter survival in stage I NSCLC.more » « less
-
In the absence of data from a randomized trial, researchers may aim to use observational data to draw causal inference about the effect of a treatment on a time-to-event outcome. In this context, interest often focuses on the treatment-specific survival curves, that is, the survival curves were the population under study to be assigned to receive the treatment or not. Under certain conditions, including that all confounders of the treatment-outcome relationship are observed, the treatment-specific survival curve can be identified with a covariate-adjusted survival curve. In this article, we propose a novel cross-fitted doubly-robust estimator that incorporates data-adaptive (e.g. machine learning) estimators of the conditional survival functions. We establish conditions on the nuisance estimators under which our estimator is consistent and asymptotically linear, both pointwise and uniformly in time. We also propose a novel ensemble learner for combining multiple candidate estimators of the conditional survival estimators. Notably, our methods and results accommodate events occurring in discrete or continuous time, or an arbitrary mix of the two. We investigate the practical performance of our methods using numerical studies and an application to the effect of a surgical treatment to prevent metastases of parotid carcinoma on mortality.more » « less
-
Abstract Survival of Gulf Stream (GS) warm core rings (WCRs) was investigated using a census consisting of a total of 961 rings formed during the period 1980–2017. Kaplan‐Meier survival probability and Cox hazard proportional models were used for the analysis. The survival analysis was performed for rings formed in four 5° zones between 75° W and 55° W. The radius, latitude, and distance from the shelf‐break of a WCR at formation all had a significant effect on the survival of WCRs. A pattern of higher survival was observed in WCRs formed in Zone 2 (70°–65° W) or Zone 3 (65°–60° W) and then demised in Zone 1 (75°–70° W). Survival probability of the WCRs increased to more than70%for those formed within a latitude band from 39.5° to 41.5° N. Survival probability is reduced when the WCRs are formed near the New England Seamounts.more » « less
-
Extensive literature has been proposed for the analysis of correlated survival data. Subjects within a cluster share some common characteristics, e.g., genetic and environmental factors, so their time-to-event outcomes are correlated. The frailty model under proportional hazards assumption has been widely applied for the analysis of clustered survival outcomes. However, the prediction performance of this method can be less satisfactory when the risk factors have complicated effects, e.g., nonlinear and interactive. To deal with these issues, we propose a neural network frailty Cox model that replaces the linear risk function with the output of a feed-forward neural network. The estimation is based on quasi-likelihood using Laplace approximation. A simulation study suggests that the proposed method has the best performance compared with existing methods. The method is applied to the clustered time-to-failure prediction within the kidney transplantation facility using the national kidney transplant registry data from the U.S. Organ Procurement and Transplantation Network. All computer programs are available at https://github.com/rivenzhou/deep_learning_clustered.more » « less