skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Routine Laboratory Blood Tests Predict SARS-CoV-2 Infection Using Machine Learning
Abstract Background Accurate diagnostic strategies to identify SARS-CoV-2 positive individuals rapidly for management of patient care and protection of health care personnel are urgently needed. The predominant diagnostic test is viral RNA detection by RT-PCR from nasopharyngeal swabs specimens, however the results are not promptly obtainable in all patient care locations. Routine laboratory testing, in contrast, is readily available with a turn-around time (TAT) usually within 1-2 hours. Method We developed a machine learning model incorporating patient demographic features (age, sex, race) with 27 routine laboratory tests to predict an individual’s SARS-CoV-2 infection status. Laboratory testing results obtained within 2 days before the release of SARS-CoV-2 RT-PCR result were used to train a gradient boosting decision tree (GBDT) model from 3,356 SARS-CoV-2 RT-PCR tested patients (1,402 positive and 1,954 negative) evaluated at a metropolitan hospital. Results The model achieved an area under the receiver operating characteristic curve (AUC) of 0.854 (95% CI: 0.829-0.878). Application of this model to an independent patient dataset from a separate hospital resulted in a comparable AUC (0.838), validating the generalization of its use. Moreover, our model predicted initial SARS-CoV-2 RT-PCR positivity in 66% individuals whose RT-PCR result changed from negative to positive within 2 days. Conclusion This model employing routine laboratory test results offers opportunities for early and rapid identification of high-risk SARS-CoV-2 infected patients before their RT-PCR results are available. It may play an important role in assisting the identification of SARS-CoV-2 infected patients in areas where RT-PCR testing is not accessible due to financial or supply constraints.  more » « less
Award ID(s):
1716432 1750326
PAR ID:
10201771
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ; ;
Date Published:
Journal Name:
Clinical Chemistry
Volume:
66
Issue:
11
ISSN:
0009-9147
Page Range / eLocation ID:
1396 to 1404
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. null (Ed.)
    Background . New York City (NYC) experienced an initial surge and gradual decline in the number of SARS-CoV-2-confirmed cases in 2020. A change in the pattern of laboratory test results in COVID-19 patients over this time has not been reported or correlated with patient outcome. Methods . We performed a retrospective study of routine laboratory and SARS-CoV-2 RT-PCR test results from 5,785 patients evaluated in a NYC hospital emergency department from March to June employing machine learning analysis. Results . A COVID-19 high-risk laboratory test result profile (COVID19-HRP), consisting of 21 routine blood tests, was identified to characterize the SARS-CoV-2 patients. Approximately half of the SARS-CoV-2 positive patients had the distinct COVID19-HRP that separated them from SARS-CoV-2 negative patients. SARS-CoV-2 patients with the COVID19-HRP had higher SARS-CoV-2 viral loads, determined by cycle threshold values from the RT-PCR, and poorer clinical outcome compared to other positive patients without the COVID12-HRP. Furthermore, the percentage of SARS-CoV-2 patients with the COVID19-HRP has significantly decreased from March/April to May/June. Notably, viral load in the SARS-CoV-2 patients declined, and their laboratory profile became less distinguishable from SARS-CoV-2 negative patients in the later phase. Conclusions . Our longitudinal analysis illustrates the temporal change of laboratory test result profile in SARS-CoV-2 patients and the COVID-19 evolvement in a US epicenter. This analysis could become an important tool in COVID-19 population disease severity tracking and prediction. In addition, this analysis may play an important role in prioritizing high-risk patients, assisting in patient triaging and optimizing the usage of resources. 
    more » « less
  2. Faeder, James R. (Ed.)
    The rapid spread of SARS-CoV-2 has placed a significant burden on public health systems to provide swift and accurate diagnostic testing highlighting the critical need for innovative testing approaches for future pandemics. In this study, we present a novel sample pooling procedure based on compressed sensing theory to accurately identify virally infected patients at high prevalence rates utilizing an innovative viral RNA extraction process to minimize sample dilution. At prevalence rates ranging from 0–14.3%, the number of tests required to identify the infection status of all patients was reduced by 69.26% as compared to conventional testing in primary human SARS-CoV-2 nasopharyngeal swabs and a coronavirus model system. Our method provided quantification of individual sample viral load within a pool as well as a binary positive-negative result. Additionally, our modified pooling and RNA extraction process minimized sample dilution which remained constant as pool sizes increased. Compressed sensing can be adapted to a wide variety of diagnostic testing applications to increase throughput for routine laboratory testing as well as a means to increase testing capacity to combat future pandemics. 
    more » « less
  3. null (Ed.)
    Background Conventional diagnosis of COVID-19 with reverse transcription polymerase chain reaction (RT-PCR) testing (hereafter, PCR) is associated with prolonged time to diagnosis and significant costs to run the test. The SARS-CoV-2 virus might lead to characteristic patterns in the results of widely available, routine blood tests that could be identified with machine learning methodologies. Machine learning modalities integrating findings from these common laboratory test results might accelerate ruling out COVID-19 in emergency department patients. Objective We sought to develop (ie, train and internally validate with cross-validation techniques) and externally validate a machine learning model to rule out COVID 19 using only routine blood tests among adults in emergency departments. Methods Using clinical data from emergency departments (EDs) from 66 US hospitals before the pandemic (before the end of December 2019) or during the pandemic (March-July 2020), we included patients aged ≥20 years in the study time frame. We excluded those with missing laboratory results. Model training used 2183 PCR-confirmed cases from 43 hospitals during the pandemic; negative controls were 10,000 prepandemic patients from the same hospitals. External validation used 23 hospitals with 1020 PCR-confirmed cases and 171,734 prepandemic negative controls. The main outcome was COVID 19 status predicted using same-day routine laboratory results. Model performance was assessed with area under the receiver operating characteristic (AUROC) curve as well as sensitivity, specificity, and negative predictive value (NPV). Results Of 192,779 patients included in the training, external validation, and sensitivity data sets (median age decile 50 [IQR 30-60] years, 40.5% male [78,249/192,779]), AUROC for training and external validation was 0.91 (95% CI 0.90-0.92). Using a risk score cutoff of 1.0 (out of 100) in the external validation data set, the model achieved sensitivity of 95.9% and specificity of 41.7%; with a cutoff of 2.0, sensitivity was 92.6% and specificity was 59.9%. At the cutoff of 2.0, the NPVs at a prevalence of 1%, 10%, and 20% were 99.9%, 98.6%, and 97%, respectively. Conclusions A machine learning model developed with multicenter clinical data integrating commonly collected ED laboratory data demonstrated high rule-out accuracy for COVID-19 status, and might inform selective use of PCR-based testing. 
    more » « less
  4. Rapid and ultrasensitive point-of-care RNA detection plays a critical role in the diagnosis and management of various infectious diseases. The gold-standard detection method of reverse transcription-quantitative polymerase chain reaction (RT-qPCR) is ultrasensitive and accurate yet limited by the lengthy turnaround time (1-2 days). On the other hand, antigen test offers rapid at-home detection (15-20 min) but suffers from low sensitivity and high false-negative rates. An ideal point-of-care diagnostic device would combine the merits of PCR-level sensitivity and rapid sample-to-result workflow comparable to antigen testing. However, the existing RNA detection platform typically possesses superior sensitivity or rapid sample-to-result time, but not both. This paper reports a point-of-care microfluidic device that offers ultrasensitive yet rapid detection of viral RNA from clinical samples. The device consists of a microfluidic chip for precisely manipulating small volumes of samples, a miniaturized heater for viral lysis and ribonuclease (RNase) inactivation, a CRISPR Cas13a- electrochemical sensor for target preamplification-free and ultrasensitive RNA detection, and a smartphone-compatible potentiostat for data acquisition. As demonstrations, the devices achieve the detection of heat-inactivated SARS-CoV-2 samples with a limit of detection (LOD) down to 10 aM within 25 minutes, which is comparable to the sensitivity of RT-PCR and rapidness of antigen test. The platform also successfully distinguishes all nine positive unprocessed clinical SARS-CoV-2 nasopharyngeal swab samples from four negative samples within 25 minutes of sample-to-result time. Together, this device provides a point-of-care solution that can be deployed in diverse settings beyond laboratory environments for rapid and accurate detection of RNA from clinical samples. The device can potentially be expandable to detect other viral targets, such as human immunodeficiency virus (HIV) self-testing and Zika virus, where rapid and ultrasensitive point-of-care detection is required. 
    more » « less
  5. null (Ed.)
    The COVID-19 pandemic provides an urgent example where a gap exists between availability of state-of-the-art diagnostics and current needs. As assay protocols and primer sequences become widely known, many laboratories perform diagnostic tests using methods such as RT-PCR or reverse transcription loop mediated isothermal amplification (RT-LAMP). Here, we report an RT-LAMP isothermal assay for the detection of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) virus and demonstrate the assay on clinical samples using a simple and accessible point-of-care (POC) instrument. We characterized the assay by dipping swabs into synthetic nasal fluid spiked with the virus, moving the swab to viral transport medium (VTM), and sampling a volume of the VTM to perform the RT-LAMP assay without an RNA extraction kit. The assay has a limit of detection (LOD) of 50 RNA copies per μL in the VTM solution within 30 min. We further demonstrate our assay by detecting SARS-CoV-2 viruses from 20 clinical samples. Finally, we demonstrate a portable and real-time POC device to detect SARS-CoV-2 from VTM samples using an additively manufactured three-dimensional cartridge and a smartphone-based reader. The POC system was tested using 10 clinical samples, and was able to detect SARS-CoV-2 from these clinical samples by distinguishing positive samples from negative samples after 30 min. The POC tests are in complete agreement with RT-PCR controls. This work demonstrates an alternative pathway for SARS-CoV-2 diagnostics that does not require conventional laboratory infrastructure, in settings where diagnosis is required at the point of sample collection. 
    more » « less