skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


This content will become publicly available on October 1, 2026

Title: Estimating the infection fatality rate of emerging diseases using a regression approach applied to global COVID-19 cases
Background: Estimating the infection fatality rate (IFR) for emerging diseases is elusive due to the presence of asymptomatic or mildly symptomatic infections and variable testing capacity. IFR estimates are also affected by region-specific differences in sampling regimes, demographics, and healthcare resources. Methods: Here we present a novel regression approach using population testing and readily available case fatality rates (CFR) to estimate the IFR during an outbreak. The approach is based on few assumptions and can be used for a wide range of emerging diseases. We validate the use of the method using commonly reported COVID-19 testing data. Results: Our new statistical approach reveals a conservative global IFR of 0.90 % (CI: 0.70 %, 1.16 %) for COVID-19 across the 139 countries affected before May 2020. Deviation of countries’ reported CFR from the estimator did not correlate with demography, per capita GDP, or healthcare access and quality, suggesting variation is due to differing testing regimes or reporting guidelines by country. Conclusions: This method can be used retrospectively or for future disease outbreaks when other data are limited.  more » « less
Award ID(s):
2011179
PAR ID:
10626607
Author(s) / Creator(s):
;
Publisher / Repository:
Elsevier
Date Published:
Journal Name:
Journal of Infection and Public Health
Volume:
18
Issue:
10
ISSN:
1876-0341
Page Range / eLocation ID:
102856
Subject(s) / Keyword(s):
Asymptomatic Linear regression Pandemic SARS-COV-2 Seroprevalence Testing capacity
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Introduction: Coronavirus disease 2019 (COVID-19) has had a profound impact globally, causing the death of millions of people and deeply affecting socio-psychological, human health, and economic systems, with some nations bearing a disproportionate burden. Despite obesity having been established as one of the major risk factors of COVID-19 severity and other degenerative diseases, the effects that dietary pattern intake plays in COVID-19 outcomes remain poorly understood. The goal of this study is to look into the connection between eating habits, the number of non-obese and obese people, and COVID-19 outcomes in countries with populations exhibiting normal Body Mass Index (BMI), which is an indicator of obesity. Methods: The analysis includes data from 170 countries. From the 170 countries, we focused on 53 nations where the average, BMI falls within the normal range (18.5 to 24.9). A subset of 20 nations was selected for a more detailed examination, comprising 10 nations with the lowest BMI values within the normal range (18.5-19.8) and 10 nations with the highest BMI values within the normal range (23.5-24.9). We used Artificial Intelligence (AI) and Machine Learning (ML) applications to evaluate key metrics, including dietary patterns (sugar and vegetable intake), obesity prevalence, incidence rate, mortality rate, and Case Fatality Rate (CFR). Results: The results demonstrate a significant correlation between higher obesity prevalence and increased COVID-19 severity, evidenced by elevated incidence, mortality, and CFRs in countries like North Macedonia and Italy. In contrast, nations such as Iceland and New Zealand with well-established healthcare systems revealed low mortality rate and case fatality rate despite variations in dietary habits. The study also revealed that vegetable consumption appears to provide a slight to significant protective effects, suggesting that dietary patters alone do not consistently predict COVID-19 Outcomes. Conclusion: Data generated from this study showed the crucial role of healthcare infrastructure along with the testing capacity and data reporting in influencing the success of pandemic responses. It also highlights the need of integrating public health strategies, which focus on obesity management and improvement of healthcare preparedness. In addition, AI-driven predictive modeling offers valuable insights that may guide pandemic response efforts in the future, thereby enhancing global health crisis management and mitigating the impact of future health emergencies. Keywords: COVID-19; Dietary patterns; Obesity; Artificial intelligence; Machine learning; Public health; Health care systems 
    more » « less
  2. Introduction: Coronavirus disease 2019 (COVID-19) has had a profound impact globally, causing the death of millions of people and deeply affecting socio-psychological, human health, and economic systems, with some nations bearing a disproportionate burden. Despite obesity having been established as one of the major risk factors of COVID-19 severity and other degenerative diseases, the effects that dietary pattern plays in COVID-19 outcomes remain poorly understood. The goal of this study is to look into the connection between eating habits, the number of non-obese and obese people, and COVID-19 outcomes in countries with populations exhibiting normal Body Mass Index (BMI), which is an indicator of obesity. Methods: The analysis includes data from 170 countries. From the 170 countries, we focused on 53 nations where the average, BMI falls within the normal range (18.5 to 24.9). A subset of 20 nations was selected for a more detailed examination, comprising 10 nations with the lowest BMI values within the normal range (18.5-19.8) and 10 nations with the highest BMI values within the normal range (23.5-24.9). We used Artificial Intelligence (AI) and Machine Learning (ML) applications to evaluate key metrics, including dietary patterns (sugar and vegetable intake), obesity prevalence, incidence rate, mortality rate, and Case Fatality Rate (CFR). Results: The results demonstrate a significant correlation between higher obesity prevalence and increased COVID-19 severity, evidenced by elevated incidence, mortality, and CFRs in countries like North Macedonia and Italy. In contrast, nations such as Iceland and New Zealand with well-established healthcare systems revealed low mortality rates and case fatality rates despite variations in dietary habits. The study also revealed that vegetable consumption appears to provide a slight to significant protective effect, suggesting that dietary patterns alone do not consistently predict COVID-19 Outcomes. Conclusion: Data generated from this study showed the crucial role of healthcare infrastructure along with the testing capacity and data reporting in influencing the success of pandemic responses. It also highlights the need to integrate public health strategies, which focus on obesity management and improvement of healthcare preparedness. In addition, AI-driven predictive modeling offers valuable insights that may guide pandemic response efforts in the future, thereby enhancing global health crisis management and mitigating the impact of future health emergencies. 
    more » « less
  3. This study addresses COVID-19 testing as a nonlinear sampling problem, aiming to uncover the dependence of the true infection count in the population on COVID-19 testing metrics such as testing volume and positivity rates. Employing an artificial neural network, we explore the relationship among daily confirmed case counts, testing data, population statistics, and the actual daily case count. The trained artificial neural network undergoes testing in in-sample, out-of-sample, and several hypothetical scenarios. A substantial focus of this paper lies in the estimation of the daily true case count, which serves as the output set of our training process. To achieve this, we implement a regularized backcasting technique that utilizes death counts and the infection fatality ratio (IFR), as the death statistics and serological surveys (providing the IFR) as more reliable COVID-19 data sources. Addressing the impact of factors such as age distribution, vaccination, and emerging variants on the IFR time series is a pivotal aspect of our analysis. We expect our study to enhance our understanding of the genuine implications of the COVID-19 pandemic, subsequently benefiting mitigation strategies. 
    more » « less
  4. The United States struggled exceptionally during the COVID-19 pandemic. For researchers and policymakers, it is of great interest to understand the risk factors associated with COVID-19 when examining data aggregated at a regional level. We examined the county-level association between the reported COVID-19 case fatality rate (CFR) and various demographic, socioeconomic and health factors in two hard-hit US states: New York and Florida. In particular, we examined the changes over time in the association patterns. For each state, we divided the data into three seasonal phases based on observed waves of the COVID-19 outbreak. For each phase, we used tests of correlations to explore the marginal association between each potential covariate and the reported CFR. We used graphical models to further clarify direct or indirect associations in a multivariate setting. We found that during the early phase of the pandemic, the association patterns were complex: the reported CFRs were high, with great variation among counties. As pandemics progressed, especially during the winter phase, socioeconomic factors such as median household income and health-related factors such as the prevalence of adult smokers and mortality rate of respiratory diseases became more significantly associated with the CFR. It is remarkable that common risk factors were identified for both states. 
    more » « less
  5. Yang, Junyuan (Ed.)
    The ongoing COVID-19 pandemic has killed at least 1.1 million people in the United States and over 6.7 million globally. Accurately estimating the age-specific infection fatality rate (IFR) of SARS-CoV-2 for different populations is crucial for assessing and understanding the impact of COVID-19 and for appropriately allocating vaccines and treatments to at-risk groups. We estimated age-specific IFRs of wild-type SARS-CoV-2 using published seroprevalence, case, and death data from New York City (NYC) from March to May 2020, using a Bayesian framework that accounted for delays between key epidemiological events. IFRs increased 3-4-fold with every 20 years of age, from 0.06% in individuals between 18–45 years old to 4.7% in individuals over 75. We then compared IFRs in NYC to several city- and country-wide estimates including England, Switzerland (Geneva), Sweden (Stockholm), Belgium, Mexico, and Brazil, as well as a global estimate. IFRs in NYC were higher for individuals younger than 65 years old than most other populations, but similar for older individuals. IFRs for age groups less than 65 decreased with income and increased with income inequality measured using the Gini index. These results demonstrate that the age-specific fatality of COVID-19 differs among developed countries and raises questions about factors underlying these differences, including underlying health conditions and healthcare access. 
    more » « less