skip to main content


Title: Quantile association regression on bivariate survival data
The association between two event times is of scientific importance in various fields. Due to population heterogeneity, it is desirable to examine the degree to which local association depends on different characteristics of the population. Here we adopt a novel quantile-based local association measure and propose a conditional quantile association regression model to allow covariate effects on local association of two survival times. Estimating equations for the quantile association coefficients are constructed based on the relationship between this quantile association measure and the conditional copula. Asymptotic properties for the resulting estimators are rigorously derived, and induced smoothing is used to obtain the covariance matrix. Through simulations we demonstrate the good practical performance of the proposed inference procedures. An application to age-related macular degeneration (AMD) data reals interesting varying effects of the baseline AMD severity score on the local association between two AMD progression times.  more » « less
Award ID(s):
1916001
NSF-PAR ID:
10237565
Author(s) / Creator(s):
; ; ;
Date Published:
Journal Name:
Canadian Journal of Statistics
ISSN:
0319-5724
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. When analyzing bivariate outcome data, it is often of scientific interest to measure and estimate the association between the bivariate outcomes. In the presence of influential covariates for one or both of the outcomes, conditional association measures can quantify the strength of association without the disturbance of the marginal covariate effects, to provide cleaner and less‐confounded insights into the bivariate association. In this work, we propose estimation and inferential procedures for assessing the conditional Kendall's tau coefficient given the covariates, by adopting the quantile regression and quantile copula framework to handle marginal covariate effects. The proposed method can flexibly accommodate right censoring and be readily applied to bivariate survival data. It also facilitates an estimator of the conditional concordance measure, namely, a conditionalindex, where the unconditionalindex is commonly used to assess the predictive capacity for survival outcomes. The proposed method is flexible and robust and can be easily implemented using standard software. The method performed satisfactorily in extensive simulation studies with and without censoring. Application of our methods to two real‐life data examples demonstrates their desirable practical utility.

     
    more » « less
  2. The conditional average treatment effect (CATE) is the best measure of individual causal effects given baseline covariates. However, the CATE only captures the (conditional) average, and can overlook risks and tail events, which are important to treatment choice. In aggregate analyses, this is usually addressed by measuring the distributional treatment effect (DTE), such as differences in quantiles or tail expectations between treatment groups. Hypothetically, one can similarly fit conditional quantile regressions in each treatment group and take their difference, but this would not be robust to misspecification or provide agnostic best-in-class predictions. We provide a new robust and model-agnostic methodology for learning the conditional DTE (CDTE) for a class of problems that includes conditional quantile treatment effects, conditional super-quantile treatment effects, and conditional treatment effects on coherent risk measures given by f-divergences. Our method is based on constructing a special pseudo-outcome and regressing it on covariates using any regression learner. Our method is model-agnostic in that it can provide the best projection of CDTE onto the regression model class. Our method is robust in that even if we learn these nuisances nonparametrically at very slow rates, we can still learn CDTEs at rates that depend on the class complexity and even conduct inferences on linear projections of CDTEs. We investigate the behavior of our proposal in simulations, as well as in a case study of 401(k) eligibility effects on wealth. 
    more » « less
  3. Motivated by a genome‐wide association study on the glomerular filtration rate, we develop a new robust test for longitudinal data to detect the effects of biomarkers in high‐dimensional quantile regression, in the presence of prespecified control variables. The test is based on the sum of score‐type statistics deduced from conditional quantile regression. The test statistic is constructed in a working‐independent manner, but the calibration reflects the intrinsic within‐subject correlation. Therefore, the test takes advantage of the feature of longitudinal data and provides more information than those based on only one measurement for each subject. Asymptotic properties of the proposed test statistic are established under both the null and local alternative hypotheses. Simulation studies show that the proposed test can control the family‐wise error rate well, while providing competitive power. The proposed method is applied to the motivating glomerular filtration rate data to test the overall significance of a large number of candidate single‐nucleotide polymorphisms that are possibly associated with the Type 1 diabetes, conditioning on the patients' demographics.

     
    more » « less
  4. Abstract Aim

    Animal movement is an important determinant of individual survival, population dynamics and ecosystem structure and function. Nonetheless, it is still unclear how local movements are related to resource availability and the spatial arrangement of resources. Using resident bird species and migratory bird species outside the migratory period, we examined how the distribution of resources affects the movement patterns of both large terrestrial birds (e.g., raptors, bustards and hornbills) and waterbirds (e.g., cranes, storks, ducks, geese and flamingos).

    Location

    Global.

    Time period

    2003–2015.

    Major taxa studied

    Birds.

    Methods

    We compiled GPS tracking data for 386 individuals across 36 bird species. We calculated the straight‐line distance between GPS locations of each individual at the 1‐hr and 10‐day time‐scales. For each individual and time‐scale, we calculated the median and 0.95 quantile of displacement. We used linear mixed‐effects models to examine the effect of the spatial arrangement of resources, measured as enhanced vegetation index homogeneity, on avian movements, while accounting for mean resource availability, body mass, diet, flight type, migratory status and taxonomy and spatial autocorrelation.

    Results

    We found a significant effect of resource spatial arrangement at the 1‐hr and 10‐day time‐scales. On average, individual movements were seven times longer in environments with homogeneously distributed resources compared with areas of low resource homogeneity. Contrary to previous work, we found no significant effect of resource availability, diet, flight type, migratory status or body mass on the non‐migratory movements of birds.

    Main conclusions

    We suggest that longer movements in homogeneous environments might reflect the need for different habitat types associated with foraging and reproduction. This highlights the importance of landscape complementarity, where habitat patches within a landscape include a range of different, yet complementary resources. As habitat homogenization increases, it might force birds to travel increasingly longer distances to meet their diverse needs.

     
    more » « less
  5. Abstract

    Epidemiologic studies of the short‐term effects of ambient particulate matter (PM) on the risk of acute cardiovascular or cerebrovascular events often use data from administrative databases in which only the date of hospitalization is known. A common study design for analyzing such data is the case‐crossover design, in which exposure at a time when a patient experiences an event is compared to exposure at times when the patient did not experience an event within a case‐control paradigm. However, the time of true event onset may precede hospitalization by hours or days, which can yield attenuated effect estimates. In this article, we consider a marginal likelihood estimator, a regression calibration estimator, and a conditional score estimator, as well as parametric bootstrap versions of each, to correct for this bias. All considered approaches require validation data on the distribution of the delay times. We compare the performance of the approaches in realistic scenarios via simulation, and apply the methods to analyze data from a Boston‐area study of the association between ambient air pollution and acute stroke onset. Based on both simulation and the case study, we conclude that a two‐stage regression calibration estimator with a parametric bootstrap bias correction is an effective method for correcting bias in health effect estimates arising from delayed onset in a case‐crossover study.

     
    more » « less