skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: An open-access database of infectious disease transmission trees to explore superspreader epidemiology
Historically, emerging and reemerging infectious diseases have caused large, deadly, and expensive multinational outbreaks. Often outbreak investigations aim to identify who infected whom by reconstructing the outbreak transmission tree, which visualizes transmission between individuals as a network with nodes representing individuals and branches representing transmission from person to person. We compiled a database, called OutbreakTrees, of 382 published, standardized transmission trees consisting of 16 directly transmitted diseases ranging in size from 2 to 286 cases. For each tree and disease, we calculated several key statistics, such as tree size, average number of secondary infections, the dispersion parameter, and the proportion of cases considered superspreaders, and examined how these statistics varied over the course of each outbreak and under different assumptions about the completeness of outbreak investigations. We demonstrated the potential utility of the database through 2 short analyses addressing questions about superspreader epidemiology for a variety of diseases, including Coronavirus Disease 2019 (COVID-19). First, we found that our transmission trees were consistent with theory predicting that intermediate dispersion parameters give rise to the highest proportion of cases causing superspreading events. Additionally, we investigated patterns in how superspreaders are infected. Across trees with more than 1 superspreader, we found preliminary support for the theory that superspreaders generate other superspreaders. In sum, our findings put the role of superspreading in COVID-19 transmission in perspective with that of other diseases and suggest an approach to further research regarding the generation of superspreaders. These data have been made openly available to encourage reuse and further scientific inquiry.  more » « less
Award ID(s):
1659683
PAR ID:
10406845
Author(s) / Creator(s):
; ;
Editor(s):
Riley, Steven
Date Published:
Journal Name:
PLOS Biology
Volume:
20
Issue:
6
ISSN:
1545-7885
Page Range / eLocation ID:
e3001685
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. null (Ed.)
    Since the emergence of coronavirus disease 2019 (COVID-19), unprecedented movement restrictions and social distancing measures have been implemented worldwide. The socioeconomic repercussions have fueled calls to lift these measures. In the absence of population-wide restrictions, isolation of infected individuals is key to curtailing transmission. However, the effectiveness of symptom-based isolation in preventing a resurgence depends on the extent of presymptomatic and asymptomatic transmission. We evaluate the contribution of presymptomatic and asymptomatic transmission based on recent individual-level data regarding infectiousness prior to symptom onset and the asymptomatic proportion among all infections. We found that the majority of incidences may be attributable to silent transmission from a combination of the presymptomatic stage and asymptomatic infections. Consequently, even if all symptomatic cases are isolated, a vast outbreak may nonetheless unfold. We further quantified the effect of isolating silent infections in addition to symptomatic cases, finding that over one-third of silent infections must be isolated to suppress a future outbreak below 1% of the population. Our results indicate that symptom-based isolation must be supplemented by rapid contact tracing and testing that identifies asymptomatic and presymptomatic cases, in order to safely lift current restrictions and minimize the risk of resurgence. 
    more » « less
  2. null (Ed.)
    In the wake of community coronavirus disease 2019 (COVID-19) transmission in the United States, there is a growing public health concern regarding the adequacy of resources to treat infected cases. Hospital beds, intensive care units (ICUs), and ventilators are vital for the treatment of patients with severe illness. To project the timing of the outbreak peak and the number of ICU beds required at peak, we simulated a COVID-19 outbreak parameterized with the US population demographics. In scenario analyses, we varied the delay from symptom onset to self-isolation, the proportion of symptomatic individuals practicing self-isolation, and the basic reproduction number R 0 . Without self-isolation, when R 0 = 2.5, treatment of critically ill individuals at the outbreak peak would require 3.8 times more ICU beds than exist in the United States. Self-isolation by 20% of cases 24 h after symptom onset would delay and flatten the outbreak trajectory, reducing the number of ICU beds needed at the peak by 48.4% (interquartile range 46.4–50.3%), although still exceeding existing capacity. When R 0 = 2, twice as many ICU beds would be required at the peak of outbreak in the absence of self-isolation. In this scenario, the proportional impact of self-isolation within 24 h on reducing the peak number of ICU beds is substantially higher at 73.5% (interquartile range 71.4–75.3%). Our estimates underscore the inadequacy of critical care capacity to handle the burgeoning outbreak. Policies that encourage self-isolation, such as paid sick leave, may delay the epidemic peak, giving a window of time that could facilitate emergency mobilization to expand hospital capacity. 
    more » « less
  3. An open question in epidemiology is why transmission is often overdispersed, meaning that most new infections are driven by few infected individuals. For example, around 10% of COVID-19 cases cause 80% of new COVID-19 cases. This overdispersion in parasite transmission is likely driven by intrinsic heterogeneity among hosts, i.e. variable SARS-CoV-2 viral loads. However, host heterogeneity could also indirectly increase transmission dispersion by driving parasite adaptation. Specifically, transmission variation among hosts could drive parasite specialization to highly infectious hosts. Adaptation to rare, highly infectious hosts could amplify transmission dispersion by simultaneously decreasing transmission from common, less infectious hosts. This study considers whether increased transmission dispersion can be, in part, an emergent property of parasite adaptation to heterogeneous host populations. We develop a mathematical model using a Price equation framework to address this question that follows the epidemiological and evolutionary dynamics of a general host–parasite system. The results predict that parasite adaptation to heterogeneous host populations drives high transmission dispersion early in epidemics. Furthermore, parasite adaptation can maintain increased transmission dispersion at endemic equilibria if virulence differs between hosts in a heterogeneous population. More broadly, this study provides a framework for predicting how parasite adaptation determines transmission dispersion for emerging and re-emerging infectious diseases. 
    more » « less
  4. Abstract Background The COVID-19 outbreak in Wuhan started in December 2019 and was under control by the end of March 2020 with a total of 50,006 confirmed cases by the implementation of a series of nonpharmaceutical interventions (NPIs) including unprecedented lockdown of the city. This study analyzes the complete outbreak data from Wuhan, assesses the impact of these public health interventions, and estimates the asymptomatic, undetected and total cases for the COVID-19 outbreak in Wuhan. Methods By taking different stages of the outbreak into account, we developed a time-dependent compartmental model to describe the dynamics of disease transmission and case detection and reporting. Model coefficients were parameterized by using the reported cases and following key events and escalated control strategies. Then the model was used to calibrate the complete outbreak data by using the Monte Carlo Markov Chain (MCMC) method. Finally we used the model to estimate asymptomatic and undetected cases and approximate the overall antibody prevalence level. Results We found that the transmission rate between Jan 24 and Feb 1, 2020, was twice as large as that before the lockdown on Jan 23 and 67.6 % (95% CI [0.584,0.759]) of detectable infections occurred during this period. Based on the reported estimates that around 20% of infections were asymptomatic and their transmission ability was about 70% of symptomatic ones, we estimated that there were about 14,448 asymptomatic and undetected cases (95% CI [12,364,23,254]), which yields an estimate of a total of 64,454 infected cases (95% CI [62,370,73,260]), and the overall antibody prevalence level in the population of Wuhan was 0.745% (95% CI [0.693 % ,0.814 % ]) by March 31, 2020. Conclusions We conclude that the control of the COVID-19 outbreak in Wuhan was achieved via the enforcement of a combination of multiple NPIs: the lockdown on Jan 23, the stay-at-home order on Feb 2, the massive isolation of all symptomatic individuals via newly constructed special shelter hospitals on Feb 6, and the large scale screening process on Feb 18. Our results indicate that the population in Wuhan is far away from establishing herd immunity and provide insights for other affected countries and regions in designing control strategies and planing vaccination programs. 
    more » « less
  5. Abstract Despite a number of successful approaches in predicting the spatiotemporal patterns of the novel coronavirus (COVID-19) pandemic and quantifying the effectiveness of non-pharmaceutical interventions starting from data about the initial outbreak location, we lack an intrinsic understanding as outbreak locations shift and evolve. Here, we fill this gap by developing a country distance approach to capture the pandemic’s propagation backbone tree from a complex airline network with multiple and evolving outbreak locations. We apply this approach, which is analogous to the effective resistance in series and parallel circuits, to examine countries’ closeness regarding disease spreading and evaluate the effectiveness of travel restrictions on delaying infections. In particular, we find that 63.2% of travel restrictions implemented as of 1 June 2020 are ineffective. The remaining percentage postponed the disease arrival time by 18.56 days per geographical area and resulted in a total reduction of 13,186,045 infected cases. Our approach enables us to design optimized and coordinated travel restrictions to extend the delay in arrival time and further reduce more infected cases while preserving air travel. 
    more » « less