NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Precise unbiased estimation in randomized experiments using auxiliary observational data

https://doi.org/10.1515/jci-2022-0011

Gagnon-Bartsch, Johann A.; Sales, Adam C.; Wu, Edward; Botelho, Anthony F.; Erickson, John A.; Miratrix, Luke W.; Heffernan, Neil T. (January 2023, Journal of Causal Inference)

Abstract Randomized controlled trials (RCTs) admit unconfounded design-based inference – randomization largely justifies the assumptions underlying statistical effect estimates – but often have limited sample sizes. However, researchers may have access to big observational data on covariates and outcomes from RCT nonparticipants. For example, data from A/B tests conducted within an educational technology platform exist alongside historical observational data drawn from student logs. We outline a design-based approach to using such observational data for variance reduction in RCTs. First, we use the observational data to train a machine learning algorithm predicting potential outcomes using covariates and then use that algorithm to generate predictions for RCT participants. Then, we use those predictions, perhaps alongside other covariates, to adjust causal effect estimates with a flexible, design-based covariate-adjustment routine. In this way, there is no danger of biases from the observational data leaking into the experimental estimates, which are guaranteed to be exactly unbiased regardless of whether the machine learning models are “correct” in any sense or whether the observational samples closely resemble RCT samples. We demonstrate the method in analyzing 33 randomized A/B tests and show that it decreases standard errors relative to other estimators, sometimes substantially.
more » « less
Full Text Available
Widespread attenuating changes in brain connectivity associated with the general factor of psychopathology in 9- and 10-year olds

https://doi.org/10.1038/s41398-021-01708-w

Sripada, Chandra; Angstadt, Mike; Taxali, Aman; Kessler, Daniel; Greathouse, Tristan; Rutherford, Saige; Clark, D_Angus; Hyde, Luke_W; Weigard, Alex; Brislin, Sarah_J; et al (November 2021, Translational Psychiatry)

Abstract Convergent research identifies a general factor (“P factor”) that confers transdiagnostic risk for psychopathology. Large-scale networks are key organizational units of the human brain. However, studies of altered network connectivity patterns associated with the P factor are limited, especially in early adolescence when most mental disorders are first emerging. We studied 11,875 9- and 10-year olds from the Adolescent Brain and Cognitive Development (ABCD) study, of whom 6593 had high-quality resting-state scans. Network contingency analysis was used to identify altered interconnections associated with the P factor among 16 large-scale networks. These connectivity changes were then further characterized with quadrant analysis that quantified the directionality of P factor effects in relation to neurotypical patterns of positive versus negative connectivity across connections. The results showed that the P factor was associated with altered connectivity across 28 network cells (i.e., sets of connections linking pairs of networks);p_PERMUTATIONvalues < 0.05 FDR-corrected for multiple comparisons. Higher P factor scores were associated with hypoconnectivity within default network and hyperconnectivity between default network and multiple control networks. Among connections within these 28 significant cells, the P factor was predominantly associated with “attenuating” effects (67%;p_PERMUTATION < 0.0002), i.e., reduced connectivity at neurotypically positive connections and increased connectivity at neurotypically negative connections. These results demonstrate that the general factor of psychopathology produces attenuating changes across multiple networks including default network, involved in spontaneous responses, and control networks involved in cognitive control. Moreover, they clarify mechanisms of transdiagnostic risk for psychopathology and invite further research into developmental causes of distributed attenuated connectivity.
more » « less
The LOOP Estimator: Adjusting for Covariates in Randomized Experiments

https://doi.org/10.1177/0193841X18808003

Wu, Edward; Gagnon-Bartsch, Johann_A (November 2018, Evaluation Review)

Background:When conducting a randomized controlled trial, it is common to specify in advance the statistical analyses that will be used to analyze the data. Typically, these analyses will involve adjusting for small imbalances in baseline covariates. However, this poses a dilemma, as adjusting for too many covariates can hurt precision more than it helps, and it is often unclear which covariates are predictive of outcome prior to conducting the experiment. Objectives:This article aims to produce a covariate adjustment method that allows for automatic variable selection, so that practitioners need not commit to any specific set of covariates prior to seeing the data. Results:In this article, we propose the “leave-one-out potential outcomes” estimator. We leave out each observation and then impute that observation’s treatment and control potential outcomes using a prediction algorithm such as a random forest. In addition to allowing for automatic variable selection, this estimator is unbiased under the Neyman–Rubin model, generally performs at least as well as the unadjusted estimator, and the experimental randomization largely justifies the statistical assumptions made.
more » « less
An Iterated Block Particle Filter for Inference on Coupled Dynamic Systems With Shared and Unit-Specific Parameters

https://doi.org/10.5705/ss.202022.0188

Ionides, Edward; Ning, Ning; Wheeler, Jesse (January 2024, Statistica Sinica)

Full Text Available
Conjuring Power from a Theory of Change: The PWRD Method for Trials with Anticipated Variation in Effects

https://doi.org/10.1080/19345747.2022.2142178

Lycurgus, Timothy; Hansen, Ben B.; White, Mark (October 2023, Journal of Research on Educational Effectiveness)

Full Text Available
Graph-aware modeling of brain connectivity networks

https://doi.org/10.1214/22-AOAS1709

Kim, Yura; Kessler, Daniel; Levina, Elizaveta (September 2023, The Annals of Applied Statistics)

Functional connections in the brain are frequently represented by weighted networks, with nodes representing locations in the brain and edges representing the strength of connectivity between these locations. One challenge in analyzing such data is that inference at the individual edge level is not particularly biologically meaningful; interpretation is more useful at the level of so-called functional systems or groups of nodes and connections between them; this is often called “graph-aware” inference in the neuroimaging literature. However, pooling over functional regions leads to significant loss of information and lower accuracy. Another challenge is correlation among edge weights within a subject which makes inference based on independence assumptions unreliable. We address both of these challenges with a linear mixed effects model, which accounts for functional systems and for edge dependence, while still modeling individual edge weights to avoid loss of information. The model allows for comparing two populations, such as patients and healthy controls, both at the functional regions level and at individual edge level, leading to biologically meaningful interpretations. We fit this model to resting state fMRI data on schizophrenic patients and healthy controls, obtaining interpretable results consistent with the schizophrenia literature.
more » « less
Full Text Available
Inference and Estimation for Random Effects in High-Dimensional Linear Mixed Models

https://doi.org/10.1080/01621459.2021.2004896

Law, Michael; Ritov, Ya’acov (July 2023, Journal of the American Statistical Association)

Full Text Available
Identifiability and inference of phylogenetic birth–death models

https://doi.org/10.1016/j.jtbi.2023.111520

Legried, Brandon; Terhorst, Jonathan (July 2023, Journal of Theoretical Biology)

Full Text Available
Approximate Post-Selective Inference for Regression with the Group LASSO

Panigrahi, S.; MacDonald, P.W.; Kessler, D. (March 2023, Journal of machine learning research)

Full Text Available
Testing attributable effects hypotheses with an application to the Oregon Health Insurance Experiment

https://doi.org/10.4310/22-SII724

Fredrickson, Mark M.; Chen, Yuguo (January 2023, Statistics and Its Interface)

Full Text Available

« Prev Next »

Search for: All records