Precise unbiased estimation in randomized experiments using auxiliary observational data

Gagnon-Bartsch, Johann A.; Sales, Adam C.; Wu, Edward; Botelho, Anthony F.; Erickson, John A.; Miratrix, Luke W.; Heffernan, Neil T.

doi:10.1515/jci-2022-0011

Citation Details

Precise unbiased estimation in randomized experiments using auxiliary observational data

Abstract Randomized controlled trials (RCTs) admit unconfounded design-based inference – randomization largely justifies the assumptions underlying statistical effect estimates – but often have limited sample sizes. However, researchers may have access to big observational data on covariates and outcomes from RCT nonparticipants. For example, data from A/B tests conducted within an educational technology platform exist alongside historical observational data drawn from student logs. We outline a design-based approach to using such observational data for variance reduction in RCTs. First, we use the observational data to train a machine learning algorithm predicting potential outcomes using covariates and then use that algorithm to generate predictions for RCT participants. Then, we use those predictions, perhaps alongside other covariates, to adjust causal effect estimates with a flexible, design-based covariate-adjustment routine. In this way, there is no danger of biases from the observational data leaking into the experimental estimates, which are guaranteed to be exactly unbiased regardless of whether the machine learning models are “correct” in any sense or whether the observational samples closely resemble RCT samples. We demonstrate the method in analyzing 33 randomized A/B tests and show that it decreases standard errors relative to other estimators, sometimes substantially. more »

Award ID(s):: 1646108 1931419

PAR ID:: 10473828

Author(s) / Creator(s):: Gagnon-Bartsch, Johann A.; Sales, Adam C.; Wu, Edward; Botelho, Anthony F.; Erickson, John A.; Miratrix, Luke W.; Heffernan, Neil T.

Publisher / Repository:: De Gruyter

Date Published:: 2023-01-01

Journal Name:: Journal of Causal Inference

Volume:: 11

Issue:: 1

ISSN:: 2193-3685

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1515/jci-2022-0011

More Like this