skip to main content


Title: Reasoning about causal models with infinitely many variables,
Generalized structural equations models (GSEMs) are, as the name suggests, a generalization of structural equations models (SEMs). They can deal with (among other things) infinitely many variables with infinite ranges, which is critical for capturing dynamical systems. We provide a sound and complete axiomatization of causal reasoning in GSEMs that is an extension of the sound and complete axiomatization provided by Halpern for SEMs. Considering GSEMs helps clarify what properties Halpern's axioms capture.  more » « less
Award ID(s):
1718108
PAR ID:
10414205
Author(s) / Creator(s):
;
Date Published:
Journal Name:
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence
Page Range / eLocation ID:
5668--5675
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Generalized structural equations models (GSEMs) are, as the name suggests, a generalization of structural equations models (SEMs). They can deal with (among other things) infinitely many variables with infinite ranges, which is critical for capturing dynamical systems. We provide a sound and complete axiomatization of causal reasoning in GSEMs that is an extension of the sound and complete axiomatization provided by Halpern for SEMs. Considering GSEMs helps clarify what properties Halpern's axioms capture. 
    more » « less
  2. Abstract

    Stochastic epidemic models (SEMs) fit to incidence data are critical to elucidating outbreak dynamics, shaping response strategies, and preparing for future epidemics. SEMs typically represent counts of individuals in discrete infection states using Markov jump processes (MJPs), but are computationally challenging as imperfect surveillance, lack of subject‐level information, and temporal coarseness of the data obscure the true epidemic. Analytic integration over the latent epidemic process is impossible, and integration via Markov chain Monte Carlo (MCMC) is cumbersome due to the dimensionality and discreteness of the latent state space. Simulation‐based computational approaches can address the intractability of the MJP likelihood, but are numerically fragile and prohibitively expensive for complex models. A linear noise approximation (LNA) that approximates the MJP transition density with a Gaussian density has been explored for analyzing prevalence data in large‐population settings, but requires modification for analyzing incidence counts without assuming that the data are normally distributed. We demonstrate how to reparameterize SEMs to appropriately analyze incidence data, and fold the LNA into a data augmentation MCMC framework that outperforms deterministic methods, statistically, and simulation‐based methods, computationally. Our framework is computationally robust when the model dynamics are complex and applies to a broad class of SEMs. We evaluate our method in simulations that reflect Ebola, influenza, and SARS‐CoV‐2 dynamics, and apply our method to national surveillance counts from the 2013–2015 West Africa Ebola outbreak.

     
    more » « less
  3. In real-world phenomena which involve mutual influence or causal effects between interconnected units, equilibrium states are typically represented with cycles in graphical models. An expressive class of graphical models, relational causal models, can represent and reason about complex dynamic systems exhibiting such cycles or feedback loops. Existing cyclic causal discovery algorithms for learning causal models from observational data assume that the data instances are independent and identically distributed which makes them unsuitable for relational causal models. At the same time, causal discovery algorithms for relational causal models assume acyclicity. In this work, we examine the necessary and sufficient conditions under which a constraint-based relational causal discovery algorithm is sound and complete for cyclic relational causal models. We introduce relational acyclification, an operation specifically designed for relational models that enables reasoning about the identifiability of cyclic relational causal models. We show that under the assumptions of relational acyclification and sigma-faithfulness, the relational causal discovery algorithm RCD is sound and complete for cyclic relational models. We present experimental results to support our claim. 
    more » « less
  4. Recent advancements in physics-informed machine learning have contributed to solving partial differential equations through means of a neural network. Following this, several physics-informed neural network works have followed to solve inverse problems arising in structural health monitoring. Other works involving physics-informed neural networks solve the wave equation with partial data and modeling wavefield data generator for efficient sound data generation. While a lot of work has been done to show that partial differential equations can be solved and identified using a neural network, little work has been done the same with more basic machine learning (ML) models. The advantage with basic ML models is that the parameters learned in a simpler model are both more interpretable and extensible. For applications such as ultrasonic nondestructive evaluation, this interpretability is essential for trustworthiness of the methods and characterization of the material system under test. In this work, we show an interpretable, physics-informed representation learning framework that can analyze data across multiple dimensions (e.g., two dimensions of space and one dimension of time). The algorithm comes with convergence guarantees. In addition, our algorithm provides interpretability of the learned model as the parameters correspond to the individual solutions extracted from data. We demonstrate how this algorithm functions with wavefield videos. 
    more » « less
  5. Abstract

    A central goal at the interface of ecology and conservation is understanding how the relationship between biodiversity and ecosystem function (B–EF) will shift with changing climate. Despite recent theoretical advances, studies which examine temporal variation in the functional traits and mechanisms (mass ratio effects and niche complementarity effects) that underpin the B–EF relationship are lacking.

    Here, we use 13 years of data on plant species composition, plant traits, local‐scale abiotic variables, above‐ground net primary productivity (ANPP), and climate from the alpine tundra of Colorado (USA) to investigate temporal dynamics in the B–EF relationship. To assess how changing climatic conditions may alter the B–EF relationship, we built structural equation models (SEMs) for 11 traits across 13 years and evaluated the power of different trait SEMs to predict ANPP, as well as the relative contributions of mass ratio effects (community‐weighted mean trait values; CWM), niche complementarity effects (functional dispersion; FDis) and local abiotic variables. Additionally, we coupled linear mixed effects models with Multimodel inference methods to assess how inclusion of trait–climate interactions might improve our ability to predict ANPP through time.

    In every year, at least one SEM exhibited good fit, explaining between 19.6% and 57.2% of the variation in ANPP. However, the identity of the trait which best explained ANPP changed depending on winter precipitation, with leaf area, plant height and foliar nitrogen isotope content (δ15N) SEMs performing best in high, middle and low precipitation years, respectively. Regardless of trait identity, CWMs exerted a stronger influence on ANPP than FDis and total biotic effects were always greater than total abiotic effects. Multimodel inference reinforced the results of SEM analysis, with the inclusion of climate–trait interactions marginally improving our ability to predict ANPP through time.

    Synthesis. Our results suggest that temporal variation in climatic conditions influences which traits, mechanisms and abiotic variables were most responsible for driving the B–EF relationship. Importantly, our findings suggest that future research should consider temporal variability in the B–EF relationship, particularly how the predictive power of individual functional traits and abiotic variables may fluctuate as conditions shift due to climate change.

     
    more » « less