skip to main content


Title: Exo-SIR: an epidemiological model to analyze the impact of exogenous spread of infection
Epidemics like Covid-19 and Ebola have impacted people’s lives signifcantly. The impact of mobility of people across the countries or states in the spread of epidemics has been signifcant. The spread of disease due to factors local to the population under consideration is termed the endogenous spread. The spread due to external factors like migration, mobility, etc., is called the exogenous spread. In this paper, we introduce the Exo-SIR model, an extension of the popular SIR model and a few variants of the model. The novelty in our model is that it captures both the exogenous and endogenous spread of the virus. First, we present an analytical study. Second, we simulate the Exo-SIR model with and without assuming contact network for the population. Third, we implement the Exo-SIR model on real datasets regarding Covid-19 and Ebola. We found that endogenous infection is infuenced by exogenous infection. Furthermore, we found that the Exo-SIR model predicts the peak time better than the SIR model. Hence, the Exo-SIR model would be helpful for governments to plan policy interventions at the time of a pandemic. Keywords Covid-19, Ebola, Epidemic modeling, Compartment model, Exogenous infection, Endogenous infection, SIR, Exo-SIR  more » « less
Award ID(s):
2133842
NSF-PAR ID:
10339013
Author(s) / Creator(s):
; ; ; ;
Date Published:
Journal Name:
International Journal of Data Science and Analytics
ISSN:
2364-415X
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract This project is funded by the US National Science Foundation (NSF) through their NSF RAPID program under the title “Modeling Corona Spread Using Big Data Analytics.” The project is a joint effort between the Department of Computer & Electrical Engineering and Computer Science at FAU and a research group from LexisNexis Risk Solutions. The novel coronavirus Covid-19 originated in China in early December 2019 and has rapidly spread to many countries around the globe, with the number of confirmed cases increasing every day. Covid-19 is officially a pandemic. It is a novel infection with serious clinical manifestations, including death, and it has reached at least 124 countries and territories. Although the ultimate course and impact of Covid-19 are uncertain, it is not merely possible but likely that the disease will produce enough severe illness to overwhelm the worldwide health care infrastructure. Emerging viral pandemics can place extraordinary and sustained demands on public health and health systems and on providers of essential community services. Modeling the Covid-19 pandemic spread is challenging. But there are data that can be used to project resource demands. Estimates of the reproductive number (R) of SARS-CoV-2 show that at the beginning of the epidemic, each infected person spreads the virus to at least two others, on average (Emanuel et al. in N Engl J Med. 2020, Livingston and Bucher in JAMA 323(14):1335, 2020). A conservatively low estimate is that 5 % of the population could become infected within 3 months. Preliminary data from China and Italy regarding the distribution of case severity and fatality vary widely (Wu and McGoogan in JAMA 323(13):1239–42, 2020). A recent large-scale analysis from China suggests that 80 % of those infected either are asymptomatic or have mild symptoms; a finding that implies that demand for advanced medical services might apply to only 20 % of the total infected. Of patients infected with Covid-19, about 15 % have severe illness and 5 % have critical illness (Emanuel et al. in N Engl J Med. 2020). Overall, mortality ranges from 0.25 % to as high as 3.0 % (Emanuel et al. in N Engl J Med. 2020, Wilson et al. in Emerg Infect Dis 26(6):1339, 2020). Case fatality rates are much higher for vulnerable populations, such as persons over the age of 80 years (> 14 %) and those with coexisting conditions (10 % for those with cardiovascular disease and 7 % for those with diabetes) (Emanuel et al. in N Engl J Med. 2020). Overall, Covid-19 is substantially deadlier than seasonal influenza, which has a mortality of roughly 0.1 %. Public health efforts depend heavily on predicting how diseases such as those caused by Covid-19 spread across the globe. During the early days of a new outbreak, when reliable data are still scarce, researchers turn to mathematical models that can predict where people who could be infected are going and how likely they are to bring the disease with them. These computational methods use known statistical equations that calculate the probability of individuals transmitting the illness. Modern computational power allows these models to quickly incorporate multiple inputs, such as a given disease’s ability to pass from person to person and the movement patterns of potentially infected people traveling by air and land. This process sometimes involves making assumptions about unknown factors, such as an individual’s exact travel pattern. By plugging in different possible versions of each input, however, researchers can update the models as new information becomes available and compare their results to observed patterns for the illness. In this paper we describe the development a model of Corona spread by using innovative big data analytics techniques and tools. We leveraged our experience from research in modeling Ebola spread (Shaw et al. Modeling Ebola Spread and Using HPCC/KEL System. In: Big Data Technologies and Applications 2016 (pp. 347-385). Springer, Cham) to successfully model Corona spread, we will obtain new results, and help in reducing the number of Corona patients. We closely collaborated with LexisNexis, which is a leading US data analytics company and a member of our NSF I/UCRC for Advanced Knowledge Enablement. The lack of a comprehensive view and informative analysis of the status of the pandemic can also cause panic and instability within society. Our work proposes the HPCC Systems Covid-19 tracker, which provides a multi-level view of the pandemic with the informative virus spreading indicators in a timely manner. The system embeds a classical epidemiological model known as SIR and spreading indicators based on causal model. The data solution of the tracker is built on top of the Big Data processing platform HPCC Systems, from ingesting and tracking of various data sources to fast delivery of the data to the public. The HPCC Systems Covid-19 tracker presents the Covid-19 data on a daily, weekly, and cumulative basis up to global-level and down to the county-level. It also provides statistical analysis for each level such as new cases per 100,000 population. The primary analysis such as Contagion Risk and Infection State is based on causal model with a seven-day sliding window. Our work has been released as a publicly available website to the world and attracted a great volume of traffic. The project is open-sourced and available on GitHub. The system was developed on the LexisNexis HPCC Systems, which is briefly described in the paper. 
    more » « less
  2. SARS-CoV-2 continues to upend human life by posing novel threats related to disease spread and mutations. Current models for the disease burden of SARS-CoV-2 consider the aggregate nature of the virus without differentiating between the potency of its multiple strains. Hence, there is a need to create a fundamental modeling framework for multi-strain viruses that considers the competing viral pathogenic pathways. Alongside the consideration that other viral pathogens may coexist, there is also a need for a generalizable modeling framework to account for multiple epidemics (i.e., multi-demics) scenarios, such as influenza and COVID-19 occurring simultaneously. We present a fundamental network thermodynamics approach for assessing, determining, and predicting viral outbreak severity, which extends well-known standard epidemiological models. In particular, we use historical data from New York City’s 2011–2019 influenza seasons and SARS-CoV-2 spread to identify the model parameters. In our model-based analysis, we employ a standard susceptible–infected–recovered (SIR) model with pertinent generalizations to account for multi-strain and multi-demics scenarios. We show that the reaction affinities underpinning the formation processes of our model can be used to categorize the severity of infectious or deceased populations. The spontaneity of occurrence captured by the change in Gibbs free energy of reaction (ΔG) in the system suggests the stability of forward occurring population transfers. The magnitude of ΔG is used to examine past influenza outbreaks and infer epidemiological factors, such as mortality and case burden. This method can be extrapolated for wide-ranging utility in computational epidemiology. The risk of overlapping multi-demics seasons between influenza and SARS-CoV-2 will persist as a significant threat in forthcoming years. Further, the possibility of mutating strains requires novel ways of analyzing the network of competing infection pathways. The approach outlined in this study allows for the identification of new stable strains and the potential increase in disease burden from a complex systems perspective, thereby allowing for a potential response to the significant question: are the effects of a multi-demic greater than the sum of its individual viral epidemics? 
    more » « less
  3. The large population movement during the Spring Festival travel in China can considerably accelerate the spread of epidemics, especially after the relaxation of strict control measures against COVID-19. This study aims to assess the impact of population migration in Spring Festival holiday on epidemic spread under different scenarios. Using inter-city population movement data, we construct the population flow network during the non-holiday time as well as the Spring Festival holiday. We build a large-scale metapopulation model to simulate the epidemic spread among 371 Chinese cities. We analyze the impact of Spring Festival travel on the peak timing and peak magnitude nationally and in each city. Assuming an R0 (basic reproduction number) of 15 and the initial conditions as the reported COVID-19 infections on 17 December 2022, model simulations indicate that the Spring Festival travel can substantially increase the national peak magnitude of infection. The infection peaks arrive at most cities 1–4 days earlier as compared to those of the non-holiday time. While peak infections in certain large cities, such as Beijing and Shanghai, are decreased due to the massive migration of people to smaller cities during the pre-Spring Festival period, peak infections increase significantly in small- or medium-sized cities. For a less transmissible disease (R0 = 5), infection peaks in large cities are delayed until after the Spring Festival. Small- or medium-sized cities may experience a larger infection due to the large-scale population migration from metropolitan areas. The increased disease burden may impose considerable strain on the healthcare systems in these resource-limited areas. For a less transmissible disease, particular attention needs to be paid to outbreaks in large cities when people resume work after holidays. 
    more » « less
  4. In this paper we develop a multiscale model combining social-force-based pedestrian movement with a population level stochastic infection transmission dynamics framework. The model is then applied to study the infection transmission within airplanes and the transmission of the Ebola virus through casual contacts. Drastic limitations on air-travel during epidemics, such as during the 2014 Ebola outbreak in West Africa, carry considerable economic and human costs. We use the computational model to evaluate the effects of passenger movement within airplanes and air-travel policies on the geospatial spread of infectious diseases. We find that boarding policy by an airline is more critical for infection propagation compared to deplaning policy. Enplaning in two sections resulted in fewer infections than the currently followed strategy with multiple zones. In addition, we found that small commercial airplanes are better than larger ones at reducing the number of new infections in a flight. Aggregated results indicate that passenger movement strategies and airplane size predicted through these network models can have significant impact on an event like the 2014 Ebola epidemic. The methodology developed here is generic and can be readily modified to incorporate the impact from the outbreak of other directly transmitted infectious diseases. 
    more » « less
  5. In this paper we develop a multiscale model combining social-force-based pedestrian movement with a population level stochastic infection transmission dynamics framework. The model is then applied to study the infection transmission within airplanes and the transmission of the Ebola virus through casual contacts. Drastic limitations on air-travel during epidemics, such as during the 2014 Ebola outbreak in West Africa, carry considerable economic and human costs. We use the computational model to evaluate the effects of passenger movement within airplanes and air-travel policies on the geospatial spread of infectious diseases. We find that boarding policy by an airline is more critical for infection propagation compared to deplaning policy. Enplaning in two sections resulted in fewer infections than the currently followed strategy with multiple zones. In addition, we found that small commercial airplanes are better than larger ones at reducing the number of new infections in a flight. Aggregated results indicate that passenger movement strategies and airplane size predicted through these network models can have significant impact on an event like the 2014 Ebola epidemic. The methodology developed here is generic and can be readily modified to incorporate the impact from the outbreak of other directly transmitted infectious diseases. 
    more » « less