skip to main content


Title: Statistical inference using GLEaM model with spatial heterogeneity and correlation between regions
Abstract

A better understanding of various patterns in the coronavirus disease 2019 (COVID-19) spread in different parts of the world is crucial to its prevention and control. Motivated by the previously developed Global Epidemic and Mobility (GLEaM) model, this paper proposes a new stochastic dynamic model to depict the evolution of COVID-19. The model allows spatial and temporal heterogeneity of transmission parameters and involves transportation between regions. Based on the proposed model, this paper also designs a two-step procedure for parameter inference, which utilizes the correlation between regions through a prior distribution that imposes graph Laplacian regularization on transmission parameters. Experiments on simulated data and real-world data in China and Europe indicate that the proposed model achieves higher accuracy in predicting the newly confirmed cases than baseline models.

 
more » « less
Award ID(s):
2007040
NSF-PAR ID:
10372973
Author(s) / Creator(s):
; ; ;
Publisher / Repository:
Nature Publishing Group
Date Published:
Journal Name:
Scientific Reports
Volume:
12
Issue:
1
ISSN:
2045-2322
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. null (Ed.)

    Mathematical models are widely recognized as an important tool for analyzing and understanding the dynamics of infectious disease outbreaks, predict their future trends, and evaluate public health intervention measures for disease control and elimination. We propose a novel stochastic metapopulation state-space model for COVID-19 transmission, which is based on a discrete-time spatio-temporal susceptible, exposed, infected, recovered, and deceased (SEIRD) model. The proposed framework allows the hidden SEIRD states and unknown transmission parameters to be estimated from noisy, incomplete time series of reported epidemiological data, by application of unscented Kalman filtering (UKF), maximum-likelihood adaptive filtering, and metaheuristic optimization. Experiments using both synthetic data and real data from the Fall 2020 COVID-19 wave in the state of Texas demonstrate the effectiveness of the proposed model.

     
    more » « less
  2. There is an ongoing debate on the different transmission modes of SARS-CoV-2 and their relative contributions to the pandemic. In this paper, we employ a simple mathematical model, which incorporates both the human-to-human and environment-to-human transmission routes, to study the transmission dynamics of COVID-19. We focus our attention on the role of airborne transmission in the spread of the disease in a university campus setting. We conduct both mathematical analysis and numerical simulation, and incorporate published experimental data for the viral concentration in the air to fit model parameters. Meanwhile, we compare the outcome to that of the standard SIR model, utilizing a perturbation analysis in the presence of multiple time scales. Our data fitting and numerical simulation results show that the risk of airborne transmission for SARS-CoV-2 strongly depends on how long the virus can remain viable in the air. If the time for this viability is short, the airborne transmission route would be inconsequential in shaping the overall transmission risk and the total infection size. On the other hand, if the infectious virus can persist in aerosols beyond a few hours, then airborne transmission could play a much more significant role in the spread of COVID-19.

     
    more » « less
  3. We propose a modified population-based susceptible-exposed-infectious-recovered (SEIR) compartmental model for a retrospective study of the COVID-19 transmission dynamics in India during the first wave. We extend the conventional SEIR methodology to account for the complexities of COVID-19 infection, its multiple symptoms, and transmission pathways. In particular, we consider a time-dependent transmission rate to account for governmental controls (e.g., national lockdown) and individual behavioral factors (e.g., social distancing, mask-wearing, personal hygiene, and self-quarantine). An essential feature of COVID-19 that is different from other infections is the significant contribution of asymptomatic and pre-symptomatic cases to the transmission cycle. A Bayesian method is used to calibrate the proposed SEIR model using publicly available data (daily new tested positive, death, and recovery cases) from several Indian states. The uncertainty of the parameters is naturally expressed as the posterior probability distribution. The calibrated model is used to estimate undetected cases and study different initial intervention policies, screening rates, and public behavior factors, that can potentially strike a balance between disease control and the humanitarian crisis caused by a sudden strict lockdown. 
    more » « less
  4. The COVID-19 pandemic has significantly impacted most countries in the world. Analyzing COVID-19 data from these countries together is a prominent challenge. Under the sponsorship of NSF REU, this paper describes our experience with a ten-week project that aims to guide an REU scholar to develop a physics-guided graph attention network to predict the global COVID- 19 Pandemics. We mainly presented the preparation, implementation, and dissemination of the addressed project. The COVID-19 situation in a country could be dramatically different from that of others, which suggests that COVID-19 pandemic data are generated based on different mechanisms, making COVID-19 data in different countries follow different probability distributions. Learning more than one hundred underlying probability distributions for countries in the world from large scale COVID- 19 data is beyond a single machine learning model. To address this challenge, we proposed two team-learning frameworks for predicting the COVID-19 pandemic trends: peer learning and layered ensemble learning framework. This addressed framework assigns an adaptive physics-guided graph attention network (GAT) to each learning agent. All the learning agents are fabricated in a hierarchical architecture, which enables agents to collaborate with each other in peer-to-peer and cross-layer way. This layered architecture shares the burden of large-scale data processing on machine learning models of all units. Experiments are run to verify the effectiveness of our approaches. The results indicate the proposed ensemble outperforms baseline methods. Besides being documented on GitHub, this work has resulted in two journal papers. 
    more » « less
  5. Since the pandemic of COVID-19 began in January 2020, the world has witnessed drastic social-economic changes. To harness the virus spread, several studies have been done to study contributing factors that are pertinent to COVID-19 transmission risks. However, little has been done to investigate how human activities on the spatial network are correlated to the virus transmission and spread. This paper performs a statistical analysis to examine interrelationships between spatial network characteristics and cumulative cases of COVID-19 in US counties. Specifically, both county-level transportation profiles (e.g., the total number of commute workers, route miles of freight railroad) and road network characteristics of US counties are considered. Then, the lasso regression model is utilized to identify a sparse set of significant variables that are sensitive to the response variable of COVID-19 cases. Finally, the fixed-effect model is built to capture the relationship between the selected set of predictors and the response variable. This work helps identify and determine salient features from spatial network characteristics and transportation profiles, thereby improving the understanding of COVID-19 spread dynamics. These significant variables can also be utilized to develop simulation models for the prediction of real-time positions of virus spread and the optimization of intervention strategies. 
    more » « less