skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: A Bracketing Relationship between Difference-in-Differences and Lagged-Dependent-Variable Adjustment
Difference-in-differences is a widely used evaluation strategy that draws causal inference from observational panel data. Its causal identification relies on the assumption of parallel trends, which is scale-dependent and may be questionable in some applications. A common alternative is a regression model that adjusts for the lagged dependent variable, which rests on the assumption of ignorability conditional on past outcomes. In the context of linear models, Angrist and Pischke (2009) show that the difference-in-differences and lagged-dependent-variable regression estimates have a bracketing relationship. Namely, for a true positive effect, if ignorability is correct, then mistakenly assuming parallel trends will overestimate the effect; in contrast, if the parallel trends assumption is correct, then mistakenly assuming ignorability will underestimate the effect. We show that the same bracketing relationship holds in general nonparametric (model-free) settings. We also extend the result to semiparametric estimation based on inverse probability weighting. We provide three examples to illustrate the theoretical results with replication files in Ding and Li (2019).  more » « less
Award ID(s):
1713152
PAR ID:
10167774
Author(s) / Creator(s):
;
Date Published:
Journal Name:
Political Analysis
Volume:
27
Issue:
4
ISSN:
1047-1987
Page Range / eLocation ID:
605 to 615
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. We develop an omitted variable bias framework for sensitivity analysis of instrumental variable estimates that naturally handles multiple side effects (violations of the exclusion restriction assumption) and confounders (violations of the ignorability of the instrument assumption) of the instrument, exploits expert knowledge to bound sensitivity parameters and can be easily implemented with standard software. Specifically, we introduce sensitivity statistics for routine reporting, such as (extreme) robustness values for instrumental variables, describing the minimum strength that omitted variables need to have to change the conclusions of a study. Next, we provide visual displays that fully characterize the sensitivity of point estimates and confidence intervals to violations of the standard instrumental variable assumptions. Finally, we offer formal bounds on the worst possible bias under the assumption that the maximum explanatory power of omitted variables is no stronger than a multiple of the explanatory power of observed variables. Conveniently, many pivotal conclusions regarding the sensitivity of the instrumental variable estimate (e.g., tests against the null hypothesis of a zero causal effect) can be reached simply through separate sensitivity analyses of the effect of the instrument on the treatment (the first stage) and the effect of the instrument on the outcome (the reduced form). We apply our methods in a running example that uses proximity to college as an instrumental variable to estimate the returns to schooling. 
    more » « less
  2. null (Ed.)
    Many ecological studies and conservation policies are based on field observations of species, which can be affected by systematic variability introduced by the observation process. A recently introduced causal modeling technique called 'half-sibling regression' can detect and correct for systematic errors in measurements of multiple independent random variables. However, it will remove intrinsic variability if the variables are dependent, and therefore does not apply to many situations, including modeling of species counts that are controlled by common causes. We present a technique called 'three-quarter sibling regression' to partially overcome this limitation. It can filter the effect of systematic noise when the latent variables have observed common causes. We provide theoretical justification of this approach, demonstrate its effectiveness on synthetic data, and show that it reduces systematic detection variability due to moon brightness in moth surveys. 
    more » « less
  3. Traffic crashes significantly contribute to global fatalities, particularly in urban areas, highlighting the need to evaluate the relationship between urban environments and traffic safety. This study extends former spatial modeling frameworks by drawing paths between global models, including spatial lag (SLM), and spatial error (SEM), and local models, including geographically weighted regression (GWR), multi-scale geographically weighted regression (MGWR), and multi-scale geographically weighted regression with spatially lagged dependent variable (MGWRL). Utilizing the proposed framework, this study analyzes severe traffic crashes in relation to urban built environments using various spatial regression models within Leon County, Florida. According to the results, SLM outperforms OLS, SEM, and GWR models. Local models with lagged dependent variables outperform both the global and generic versions of the local models in all performance measures, whereas MGWR and MGWRL outperform GWR and GWRL. Local models performed better than global models, showing spatial non-stationarity; so, the relationship between the dependent and independent variables varies over space. The better performance of models with lagged dependent variables signifies that the spatial distribution of severe crashes is correlated. Finally, the better performance of multi-scale local models than classical local models indicates varying influences of independent variables with different bandwidths. According to the MGWRL model, census block groups close to the urban area with higher population, higher education level, and lower car ownership rates have lower crash rates. On the contrary, motor vehicle percentage for commuting is found to have a negative association with severe crash rate, which suggests the locality of the mentioned associations. 
    more » « less
  4. Abstract To examine the atmospheric responses to Arctic sea-ice variability in the Northern Hemisphere cold season (October to following March), this study uses a coordinated set of large-ensemble experiments of nine atmospheric general circulation models (AGCMs) forced with observed daily-varying sea-ice, sea-surface temperature, and radiative forcings prescribed during the 1979-2014 period, together with a parallel set of experiments where Arctic sea ice is substituted by its climatology. The simulations of the former set reproduce the near-surface temperature trends in reanalysis data, with similar amplitude, and their multi-model ensemble mean (MMEM) shows decreasing sea-level pressure over much of the polar cap and Eurasia in boreal autumn. The MMEM difference between the two experiments allows isolating the effects of Arctic sea-ice loss, which explain a large portion of the Arctic warming trends in the lower troposphere and drives a small but statistically significant weakening of the wintertime Arctic Oscillation. The observed interannual co-variability between sea-ice extent in the Barents-Kara Seas and lagged atmospheric circulation is distinguished from the effects of confounding factors based on multiple regression, and quantitatively compared to the co-variability in MMEMs. The interannual sea-ice decline followed by a negative North Atlantic Oscillation-like anomaly found in observations is also seen in the MMEM differences, with consistent spatial structure but much smaller amplitude. This result suggests that the sea-ice impacts on trends and interannual atmospheric variability simulated by AGCMs could be underestimated, but caution is needed because internal atmospheric variability may have affected the observed relationship. 
    more » « less
  5. Matching is one of the simplest approaches for estimating causal effects from observational data. Matching techniques compare the observed outcomes across pairs of individuals with similar covariate values but different treatment statuses in order to estimate causal effects. However, traditional matching techniques are unreliable given high-dimensional covariates due to the infamous curse of dimensionality. To overcome this challenge, we propose a simple, fast, yet highly effective approach to matching using Random Hyperplane Tessellations (RHPT). First, we prove that the RHPT representation is an approximate balancing score – thus maintaining the strong ignorability assumption – and provide empirical evidence for this claim. Second, we report results of extensive experiments showing that matching using RHPT outperforms traditional matching techniques and is competitive with state-of-the-art deep learning methods for causal effect estimation. In addition, RHPT avoids the need for computationally expensive training of deep neural networks. 
    more » « less