Abstract Propensity score weighting is a tool for causal inference to adjust for measured confounders in observational studies. In practice, data often present complex structures, such as clustering, which make propensity score modeling and estimation challenging. In addition, for clustered data, there may be unmeasured cluster-level covariates that are related to both the treatment assignment and outcome. When such unmeasured cluster-specific confounders exist and are omitted in the propensity score model, the subsequent propensity score adjustment may be biased. In this article, we propose a calibration technique for propensity score estimation under the latent ignorable treatment assignment mechanism, i. e., the treatment-outcome relationship is unconfounded given the observed covariates and the latent cluster-specific confounders. We impose novel balance constraints which imply exact balance of the observed confounders and the unobserved cluster-level confounders between the treatment groups. We show that the proposed calibrated propensity score weighting estimator is doubly robust in that it is consistent for the average treatment effect if either the propensity score model is correctly specified or the outcome follows a linear mixed effects model. Moreover, the proposed weighting method can be combined with sampling weights for an integrated solution to handle confounding and sampling designs for causal inference with clustered survey data. In simulation studies, we show that the proposed estimator is superior to other competitors. We estimate the effect of School Body Mass Index Screening on prevalence of overweight and obesity for elementary schools in Pennsylvania.
more »
« less
Nonparametric identification of causal effects in clustered observational studies with differential selection
The clustered observational study (COS) design is the observational counterpart to the clustered randomized trial. COSs are common in both education and health services research. In education, treatments may be given to all students within some schools but withheld from all students in other schools. In health studies, treatments may be applied to clusters such as hospitals or groups of patients treated by the same physician. In this paper, we study the identification of causal effects in COS designs. We focus on the prospect of differential selection of units to clusters, which occurs when the units’ cluster selections depend on the clusters’ treatment assignments. Extant work on COSs has made an implicit assumption that rules out the presence of differential selection. We derive the identification results for designs with differential selection and that contexts with differential cluster selection require different adjustment sets than standard designs. We outline estimators for designs with and without differential selection. Using a series of simulations, we outline the magnitude of the bias that can occur with differential selection. We then present 2 empirical applications focusing on the likelihood of differential selection.
more »
« less
- Award ID(s):
- 2113171
- PAR ID:
- 10520180
- Publisher / Repository:
- Oxford Academic
- Date Published:
- Journal Name:
- Journal of the Royal Statistical Society Series A: Statistics in Society
- ISSN:
- 0964-1998
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Engineering projects, such as designing a solar farm that converts solar radiation shined on the Earth into electricity, engage students in addressing real-world challenges by learning and applying geoscience knowledge. To improve their designs, students benefit from frequent and informative feedback as they iterate. However, teacher attention may be limited or inadequate, both during COVID-19 and beyond. We present Aladdin, a web-based computer-aided design (CAD) platform for engineering design with a built-in artificial intelligence teaching assistant (AITA). We also present two curriculum units (Solar Energy Science and Solar Farm Design), where students explore the Sun-Earth relationship and optimize the energy output and yearly profit of a solar farm with the help of the AITA. We tested the software and curriculum units with over 100 students in two Midwestern high schools. Pre- and post-survey data showed improvements in understanding of science concepts and self-efficacy in engineering design. Pre-post analysis of design performance gains reveals that AI helped lower achievers more than higher achievers. Interviews revealed students’ values and preferences when receiving feedback. Our findings suggest that AITAs may be helpful as an additional feedback mechanism for geoscience and engineering education. Future efforts should focus on improving the usability of the software and providing multiple types of feedback to promote inclusive and equitable use of AI in education.more » « less
-
null (Ed.)ABSTRACT Dynamically relaxed galaxy clusters have long played an important role in galaxy cluster studies because it is thought their properties can be reconstructed more precisely and with less systematics. As relaxed clusters are desirable, there exist a plethora of criteria for classifying a galaxy cluster as relaxed. In this work, we examine 9 commonly used observational and theoretical morphological metrics extracted from $$54\, 000$$mock-X synthetic X-ray images of galaxy clusters taken from the IllustrisTNG, BAHAMAS, and MACSIS simulation suites. We find that the simulated criteria distributions are in reasonable agreement with the observed distributions. Many criteria distributions evolve as a function of redshift, cluster mass, numerical resolution, and subgrid physics, limiting the effectiveness of a single relaxation threshold value. All criteria are positively correlated with each other, however, the strength of the correlation is sensitive to redshift, mass, and numerical choices. Driven by the intrinsic scatter inherent to all morphological metrics and the arbitrary nature of relaxation threshold values, we find the consistency of relaxed subsets defined by the different metrics to be relatively poor. Therefore, the use of relaxed cluster subsets introduces significant selection effects that are non-trivial to resolve.more » « less
-
Cluster detection is important and widely used in a variety of applications, including public health, public safety, transportation, and so on. Given a collection of data points, we aim to detect density-connected spatial clusters with varying geometric shapes and densities, under the constraint that the clusters are statistically significant. The problem is challenging, because many societal applications and domain science studies have low tolerance for spurious results, and clusters may have arbitrary shapes and varying densities. As a classical topic in data mining and learning, a myriad of techniques have been developed to detect clusters with both varying shapes and densities (e.g., density-based, hierarchical, spectral, or deep clustering methods). However, the vast majority of these techniques do not consider statistical rigor and are susceptible to detecting spurious clusters formed as a result of natural randomness. On the other hand, scan statistic approaches explicitly control the rate of spurious results, but they typically assume a single “hotspot” of over-density and many rely on further assumptions such as a tessellated input space. To unite the strengths of both lines of work, we propose a statistically robust formulation of a multi-scale DBSCAN, namely Significant DBSCAN+, to identify significant clusters that are density connected. As we will show, incorporation of statistical rigor is a powerful mechanism that allows the new Significant DBSCAN+ to outperform state-of-the-art clustering techniques in various scenarios. We also propose computational enhancements to speed-up the proposed approach. Experiment results show that Significant DBSCAN+ can simultaneously improve the success rate of true cluster detection (e.g., 10–20% increases in absolute F1 scores) and substantially reduce the rate of spurious results (e.g., from thousands/hundreds of spurious detections to none or just a few across 100 datasets), and the acceleration methods can improve the efficiency for both clustered and non-clustered data.more » « less
-
Multi-agent dynamical systems refer to scenarios where multiple units (aka agents) interact with each other and evolve collectively over time. For instance, people’s health conditions are mutually influenced. Receiving vaccinations not only strengthens the longterm health status of one unit but also provides protection for those in their immediate surroundings. To make informed decisions in multi-agent dynamical systems, such as determining the optimal vaccine distribution plan, it is essential for decision-makers to estimate the continuous-time counterfactual outcomes. However, existing studies of causal inference over time rely on the assumption that units are mutually independent, which is not valid for multi-agent dynamical systems. In this paper, we aim to bridge this gap and study how to estimate counterfactual outcomes in multi-agent dynamical systems. Causal inference in a multi-agent dynamical system has unique challenges: 1) Confounders are timevarying and are present in both individual unit covariates and those of other units; 2) Units are affected by not only their own but also others’ treatments; 3) The treatments are naturally dynamic, such as receiving vaccines and boosters in a seasonal manner. To this end, we model a multi-agent dynamical system as a graph and propose a novel model called CF-GODE (CounterFactual Graph Ordinary Differential Equations). CF-GODE is a causal model that estimates continuous-time counterfactual outcomes in the presence of inter-dependencies between units. To facilitate continuous-time estimation,we propose Treatment-Induced GraphODE, a novel ordinary differential equation based on graph neural networks (GNNs), which can incorporate dynamical treatments as additional inputs to predict potential outcomes over time. To remove confounding bias, we propose two domain adversarial learning based objectives that learn balanced continuous representation trajectories, which are not predictive of treatments and interference. We further provide theoretical justification to prove their effectiveness. Experiments on two semi-synthetic datasets confirm that CF-GODE outperforms baselines on counterfactual estimation. We also provide extensive analyses to understand how our model works.more » « less