skip to main content


Title: Network Modeling and Analysis of COVID-19 Testing Strategies
The COVID-19 preparedness plans by the Centers for Disease Control and Prevention strongly underscores the need for efficient and effective testing strategies. This, in turn, calls upon the design and development of statistical sampling and testing of COVID-19 strategies. However, the evaluation of operational details requires a detailed representation of human behaviors in epidemic simulation models. Traditional epidemic simulations are mainly based upon system dynamic models, which use differential equations to study macro-level and aggregated behaviors of population subgroups. As such, individual behaviors (e.g., personal protection, commute conditions, social patterns) can’t be adequately modeled and tracked for the evaluation of health policies and action strategies. Therefore, this paper presents a network-based simulation model to optimize COVID-19 testing strategies for effective identifications of virus carriers in a spatial area. Specifically, we design a data-driven risk scoring system for statistical sampling and testing of COVID-19. This system collects real-time data from simulated networked behaviors of individuals in the spatial network to support decision-making during the virus spread process. Experimental results showed that this framework has superior performance in optimizing COVID-19 testing decisions and effectively identifying virus carriers from the population.  more » « less
Award ID(s):
2026875
NSF-PAR ID:
10317298
Author(s) / Creator(s):
; ;
Date Published:
Journal Name:
2021 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC)
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract This project is funded by the US National Science Foundation (NSF) through their NSF RAPID program under the title “Modeling Corona Spread Using Big Data Analytics.” The project is a joint effort between the Department of Computer & Electrical Engineering and Computer Science at FAU and a research group from LexisNexis Risk Solutions. The novel coronavirus Covid-19 originated in China in early December 2019 and has rapidly spread to many countries around the globe, with the number of confirmed cases increasing every day. Covid-19 is officially a pandemic. It is a novel infection with serious clinical manifestations, including death, and it has reached at least 124 countries and territories. Although the ultimate course and impact of Covid-19 are uncertain, it is not merely possible but likely that the disease will produce enough severe illness to overwhelm the worldwide health care infrastructure. Emerging viral pandemics can place extraordinary and sustained demands on public health and health systems and on providers of essential community services. Modeling the Covid-19 pandemic spread is challenging. But there are data that can be used to project resource demands. Estimates of the reproductive number (R) of SARS-CoV-2 show that at the beginning of the epidemic, each infected person spreads the virus to at least two others, on average (Emanuel et al. in N Engl J Med. 2020, Livingston and Bucher in JAMA 323(14):1335, 2020). A conservatively low estimate is that 5 % of the population could become infected within 3 months. Preliminary data from China and Italy regarding the distribution of case severity and fatality vary widely (Wu and McGoogan in JAMA 323(13):1239–42, 2020). A recent large-scale analysis from China suggests that 80 % of those infected either are asymptomatic or have mild symptoms; a finding that implies that demand for advanced medical services might apply to only 20 % of the total infected. Of patients infected with Covid-19, about 15 % have severe illness and 5 % have critical illness (Emanuel et al. in N Engl J Med. 2020). Overall, mortality ranges from 0.25 % to as high as 3.0 % (Emanuel et al. in N Engl J Med. 2020, Wilson et al. in Emerg Infect Dis 26(6):1339, 2020). Case fatality rates are much higher for vulnerable populations, such as persons over the age of 80 years (> 14 %) and those with coexisting conditions (10 % for those with cardiovascular disease and 7 % for those with diabetes) (Emanuel et al. in N Engl J Med. 2020). Overall, Covid-19 is substantially deadlier than seasonal influenza, which has a mortality of roughly 0.1 %. Public health efforts depend heavily on predicting how diseases such as those caused by Covid-19 spread across the globe. During the early days of a new outbreak, when reliable data are still scarce, researchers turn to mathematical models that can predict where people who could be infected are going and how likely they are to bring the disease with them. These computational methods use known statistical equations that calculate the probability of individuals transmitting the illness. Modern computational power allows these models to quickly incorporate multiple inputs, such as a given disease’s ability to pass from person to person and the movement patterns of potentially infected people traveling by air and land. This process sometimes involves making assumptions about unknown factors, such as an individual’s exact travel pattern. By plugging in different possible versions of each input, however, researchers can update the models as new information becomes available and compare their results to observed patterns for the illness. In this paper we describe the development a model of Corona spread by using innovative big data analytics techniques and tools. We leveraged our experience from research in modeling Ebola spread (Shaw et al. Modeling Ebola Spread and Using HPCC/KEL System. In: Big Data Technologies and Applications 2016 (pp. 347-385). Springer, Cham) to successfully model Corona spread, we will obtain new results, and help in reducing the number of Corona patients. We closely collaborated with LexisNexis, which is a leading US data analytics company and a member of our NSF I/UCRC for Advanced Knowledge Enablement. The lack of a comprehensive view and informative analysis of the status of the pandemic can also cause panic and instability within society. Our work proposes the HPCC Systems Covid-19 tracker, which provides a multi-level view of the pandemic with the informative virus spreading indicators in a timely manner. The system embeds a classical epidemiological model known as SIR and spreading indicators based on causal model. The data solution of the tracker is built on top of the Big Data processing platform HPCC Systems, from ingesting and tracking of various data sources to fast delivery of the data to the public. The HPCC Systems Covid-19 tracker presents the Covid-19 data on a daily, weekly, and cumulative basis up to global-level and down to the county-level. It also provides statistical analysis for each level such as new cases per 100,000 population. The primary analysis such as Contagion Risk and Infection State is based on causal model with a seven-day sliding window. Our work has been released as a publicly available website to the world and attracted a great volume of traffic. The project is open-sourced and available on GitHub. The system was developed on the LexisNexis HPCC Systems, which is briefly described in the paper. 
    more » « less
  2. null (Ed.)
    The coronavirus disease 2019 (COVID-19) pandemic caused by the novel severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has placed epidemic modeling at the center of attention of public policymaking. Predicting the severity and speed of transmission of COVID-19 is crucial to resource management and developing strategies to deal with this epidemic. Based on the available data from current and previous outbreaks, many efforts have been made to develop epidemiological models, including statistical models, computer simulations, mathematical representations of the virus and its impacts, and many more. Despite their usefulness, modeling and forecasting the spread of COVID-19 remains a challenge. In this article, we give an overview of the unique features and issues of COVID-19 data and how they impact epidemic modeling and projection. In addition, we illustrate how various models could be connected to each other. Moreover, we provide new data science perspectives on the challenges of COVID-19 forecasting, from data collection, curation, and validation to the limitations of models, as well as the uncertainty of the forecast. Finally, we discuss some data science practices that are crucial to more robust and accurate epidemic forecasting. 
    more » « less
  3. Since the pandemic of COVID-19 began in January 2020, the world has witnessed drastic social-economic changes. To harness the virus spread, several studies have been done to study contributing factors that are pertinent to COVID-19 transmission risks. However, little has been done to investigate how human activities on the spatial network are correlated to the virus transmission and spread. This paper performs a statistical analysis to examine interrelationships between spatial network characteristics and cumulative cases of COVID-19 in US counties. Specifically, both county-level transportation profiles (e.g., the total number of commute workers, route miles of freight railroad) and road network characteristics of US counties are considered. Then, the lasso regression model is utilized to identify a sparse set of significant variables that are sensitive to the response variable of COVID-19 cases. Finally, the fixed-effect model is built to capture the relationship between the selected set of predictors and the response variable. This work helps identify and determine salient features from spatial network characteristics and transportation profiles, thereby improving the understanding of COVID-19 spread dynamics. These significant variables can also be utilized to develop simulation models for the prediction of real-time positions of virus spread and the optimization of intervention strategies. 
    more » « less
  4. The objective is to understand the role of emerging variants, vaccination, and NPI policies on COVID-19 infections and deaths. We aim to identify scenarios in which COVID-19 can be managed such that the death rate from COVID-19 becomes comparable with the combined annual mortality rate from influenza and pneumonia. As a case study for a large urban area, we simulate COVID-19 transmission in King County, Washington, (greater Seattle) using an agent- based simulation model. Calibrated to local epidemiological data, our study uses detailed synthetic population data and includes interactions between vaccination and specific NPIs while considering waning immunity and emergence of variants. Virus mutation scenarios include 12 combinations of infectivity, disease severity, and immune evasiveness. A highly effective pancoronavirus vaccine that works against all strains is considered an optimistic scenario. Our findings highlight the potential benefits of pancoronavirus vaccines that offer enhanced and longer-lasting immunity. We emphasize the crucial role of nonpharmaceutical interventions in reducing COVID-19 deaths regardless of virus mutation scenarios. Owing to highly immune evasive and contagious SARS-CoV-2 variants, most scenarios in this study fail to reduce the mortality of COVID-19 to the level of influenza and pneumonia. However, our findings indicate that periodic vaccinations and a threshold nonpharmaceutical intervention policy may succeed in achieving this goal. This indicates the need for caution and vigilance in managing a continuing COVID-19 epidemic. 
    more » « less
  5. Borri, Alessandro (Ed.)
    Ever since the outbreak of the COVID-19 epidemic, various public health control strategies have been proposed and tested against the coronavirus SARS-CoV-2. We study three specific COVID-19 epidemic control models: the susceptible, exposed, infectious, recovered (SEIR) model with vaccination control; the SEIR model with shield immunity control; and the susceptible, un-quarantined infected, quarantined infected, confirmed infected (SUQC) model with quarantine control. We express the control requirement in metric temporal logic (MTL) formulas (a type of formal specification languages) which can specify the expected control outcomes such as “ the deaths from the infection should never exceed one thousand per day within the next three months ” or “ the population immune from the disease should eventually exceed 200 thousand within the next 100 to 120 days ”. We then develop methods for synthesizing control strategies with MTL specifications. To the best of our knowledge, this is the first paper to systematically synthesize control strategies based on the COVID-19 epidemic models with formal specifications. We provide simulation results in three different case studies: vaccination control for the COVID-19 epidemic with model parameters estimated from data in Lombardy, Italy; shield immunity control for the COVID-19 epidemic with model parameters estimated from data in Lombardy, Italy; and quarantine control for the COVID-19 epidemic with model parameters estimated from data in Wuhan, China. The results show that the proposed synthesis approach can generate control inputs such that the time-varying numbers of individuals in each category (e.g., infectious, immune) satisfy the MTL specifications. The results also show that early intervention is essential in mitigating the spread of COVID-19, and more control effort is needed for more stringent MTL specifications. For example, based on the model in Lombardy, Italy, achieving less than 100 deaths per day and 10000 total deaths within 100 days requires 441.7% more vaccination control effort than achieving less than 1000 deaths per day and 50000 total deaths within 100 days. 
    more » « less