skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


This content will become publicly available on April 16, 2026

Title: Deletion detection in SARS-CoV-2 genomes using multiplex-PCR sequencing from COVID-19 patients: elimination of false positives
Abstract Deletions are prevalent in the genomes of SARS-CoV-2 isolates from COVID-19 patients, but their roles in the severity, transmission, and persistence of disease are poorly understood. Millions of COVID-19 swab samples from patients have been sequenced and made available online, offering an unprecedented opportunity to study such deletions. Multiplex PCR-based amplicon sequencing (amplicon-seq) has been the most widely used method for sequencing clinical COVID-19 samples. However, existing bioinformatics methods applied to negative control samples sequenced by multiplex-PCR sequencing often yield large numbers of false-positive deletions. We found that these false positives commonly occur in short alignments, at low frequency and depth, and near primer-binding sites used for whole-genome amplification. To address this issue, we developed a filtering strategy, validated with positive control samples containing a known deletion. Our strategy accurately detected the known deletion and removed more than 99% of false positives. This method, applied to public COVID-19 swab data, revealed that deletions occurring independently of transcription regulatory sequences were about 20-fold less common than previously reported; however, they remain more frequent in symptomatic patients. Our optimized approach should enhance the reliability of SARS-CoV-2 deletion characterization from surveillance studies. Finally, our approach may guide the development of more reliable bioinformatics pipelines for genome sequence analyses of other viruses.  more » « less
Award ID(s):
2151959
PAR ID:
10607878
Author(s) / Creator(s):
; ;
Publisher / Repository:
medRxiv
Date Published:
Format(s):
Medium: X
Institution:
medRxiv
Sponsoring Org:
National Science Foundation
More Like this
  1. null (Ed.)
    Abstract Background Wastewater-based epidemiology (WBE) for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) can be an important source of information for coronavirus disease 2019 (COVID-19) management during and after the pandemic. Currently, governments and transportation industries around the world are developing strategies to minimize SARS-CoV-2 transmission associated with resuming activity. This study investigated the possible use of SARS-CoV-2 RNA wastewater surveillance from airline and cruise ship sanitation systems and its potential use as a COVID-19 public health management tool. Methods Aircraft and cruise ship wastewater samples (n = 21) were tested for SARS-CoV-2 using two virus concentration methods, adsorption–extraction by electronegative membrane (n = 13) and ultrafiltration by Amicon (n = 8), and five assays using reverse-transcription quantitative polymerase chain reaction (RT-qPCR) and RT-droplet digital PCR (RT-ddPCR). Representative qPCR amplicons from positive samples were sequenced to confirm assay specificity. Results SARS-CoV-2 RNA was detected in samples from both aircraft and cruise ship wastewater; however concentrations were near the assay limit of detection. The analysis of multiple replicate samples and use of multiple RT-qPCR and/or RT-ddPCR assays increased detection sensitivity and minimized false-negative results. Representative qPCR amplicons were confirmed for the correct PCR product by sequencing. However, differences in sensitivity were observed among molecular assays and concentration methods. Conclusions The study indicates that surveillance of wastewater from large transport vessels with their own sanitation systems has potential as a complementary data source to prioritize clinical testing and contact tracing among disembarking passengers. Importantly, sampling methods and molecular assays must be further optimized to maximize detection sensitivity. The potential for false negatives by both wastewater testing and clinical swab testing suggests that the two strategies could be employed together to maximize the probability of detecting SARS-CoV-2 infections amongst passengers. 
    more » « less
  2. Abstract Objective: Current guidance states that asymptomatic screening for severe acute respiratory coronavirus virus 2 (SARS-CoV-2) prior to admission to an acute-care setting is at the facility’s discretion. This study’s objective was to estimate the number of undetected cases of SARS-CoV-2 admitted as inpatients under 4 testing approaches and varying assumptions. Design and setting: Individual-based microsimulation of 104 North Carolina acute-care hospitals Patients: All simulated inpatient admissions to acute-care hospitals from December 15, 2021, to January 13, 2022 [ie, during the SARS-COV-2 ο (omicron) variant surge]. Interventions: We simulated (1) only testing symptomatic patients, (2) 1-stage antigen testing with no confirmatory polymerase chain reaction (PCR) test, (3) 1-stage antigen testing with a confirmatory PCR for negative results, and (4) serial antigen screening (ie, repeat antigen test 2 days after a negative result). Results: Over 1 month, there were 77,980 admissions: 13.7% for COVID-19, 4.3% with but not for COVID-19, and 82.0% for non–COVID-19 indications without current infection. Without asymptomatic screening, 1,089 (credible interval [CI], 946–1,253) total SARS-CoV-2 infections (7.72%) went undetected. With 1-stage antigen screening, 734 (CI, 638–845) asymptomatic infections (67.4%) were detected, with 1,277 false positives. With combined antigen and PCR screening, 1,007 (CI, 875–1,159) asymptomatic infections (92.5%) were detected, with 5,578 false positives. A serial antigen testing policy detected 973 (CI, 845–1,120) asymptomatic infections (89.4%), with 2,529 false positives. Conclusions: Serial antigen testing identified >85% of asymptomatic infections and resulted in fewer false positives with less cost per identified infection compared to combined antigen plus PCR testing. 
    more » « less
  3. Giri, Basant (Ed.)
    Early detection of SARS-CoV-2 infection is key to managing the current global pandemic, as evidence shows the virus is most contagious on or before symptom onset. Here, we introduce a low-cost, high-throughput method for diagnosing and studying SARS-CoV-2 infection. Dubbed Pathogen-Oriented Low-Cost Assembly & Re-Sequencing (POLAR), this method amplifies the entirety of the SARS-CoV-2 genome. This contrasts with typical RT-PCR-based diagnostic tests, which amplify only a few loci. To achieve this goal, we combine a SARS-CoV-2 enrichment method developed by the ARTIC Network (https://artic.network/) with short-read DNA sequencing andde novogenome assembly. Using this method, we can reliably (>95% accuracy) detect SARS-CoV-2 at a concentration of 84 genome equivalents per milliliter (GE/mL). The vast majority of diagnostic methods meeting our analytical criteria that are currently authorized for use by the United States Food and Drug Administration with the Coronavirus Disease 2019 (COVID-19) Emergency Use Authorization require higher concentrations of the virus to achieve this degree of sensitivity and specificity. In addition, we can reliably assemble the SARS-CoV-2 genome in the sample, often with no gaps and perfect accuracy given sufficient viral load. The genotypic data in these genome assemblies enable the more effective analysis of disease spread than is possible with an ordinary binary diagnostic. These data can also help identify vaccine and drug targets. Finally, we show that the diagnoses obtained using POLAR of positive and negative clinical nasal mid-turbinate swab samples 100% match those obtained in a clinical diagnostic lab using the Center for Disease Control’s 2019-Novel Coronavirus test. Using POLAR, a single person can manually process 192 samples over an 8-hour experiment at the cost of ~$36 per patient (as of December 7th, 2022), enabling a 24-hour turnaround with sequencing and data analysis time. We anticipate that further testing and refinement will allow greater sensitivity using this approach. 
    more » « less
  4. null (Ed.)
    Background . New York City (NYC) experienced an initial surge and gradual decline in the number of SARS-CoV-2-confirmed cases in 2020. A change in the pattern of laboratory test results in COVID-19 patients over this time has not been reported or correlated with patient outcome. Methods . We performed a retrospective study of routine laboratory and SARS-CoV-2 RT-PCR test results from 5,785 patients evaluated in a NYC hospital emergency department from March to June employing machine learning analysis. Results . A COVID-19 high-risk laboratory test result profile (COVID19-HRP), consisting of 21 routine blood tests, was identified to characterize the SARS-CoV-2 patients. Approximately half of the SARS-CoV-2 positive patients had the distinct COVID19-HRP that separated them from SARS-CoV-2 negative patients. SARS-CoV-2 patients with the COVID19-HRP had higher SARS-CoV-2 viral loads, determined by cycle threshold values from the RT-PCR, and poorer clinical outcome compared to other positive patients without the COVID12-HRP. Furthermore, the percentage of SARS-CoV-2 patients with the COVID19-HRP has significantly decreased from March/April to May/June. Notably, viral load in the SARS-CoV-2 patients declined, and their laboratory profile became less distinguishable from SARS-CoV-2 negative patients in the later phase. Conclusions . Our longitudinal analysis illustrates the temporal change of laboratory test result profile in SARS-CoV-2 patients and the COVID-19 evolvement in a US epicenter. This analysis could become an important tool in COVID-19 population disease severity tracking and prediction. In addition, this analysis may play an important role in prioritizing high-risk patients, assisting in patient triaging and optimizing the usage of resources. 
    more » « less
  5. Early detection of the COVID-19 virus, SARS-CoV-2, is key to mitigating the spread of new outbreaks. Data from individual testing is increasingly difficult to obtain as people conduct non-reported home tests, defer tests due to logistics or attitudes, or ignore testing altogether. Wastewater based epidemiology is an alternative method for surveilling a community while maintaining individual anonymity; however, a problem is that SARS-CoV-2 markers in wastewater vary throughout the day. Collecting grab samples at a single time may miss marker presence, while autosampling throughout a day is technically challenging and expensive. This study investigates a passive sampling method that would be expected to accumulate greater amounts of viral material from sewers over a period of time. Tampons were tested as passive swab sampling devices from which viral markers could be eluted with a Tween-20 surfactant wash. Six sewersheds in Detroit were sampled 16–22 times by paired swab (4 h immersion before retrieval) and grab methods over a five-month period and enumerated for N1 and N2 SARS-CoV-2 markers using ddPCR. Swabs detected SARS-CoV-2 markers significantly more frequently (P < 0.001) than grab samples, averaging two to three-fold more copies of SARS-CoV-2 markers than their paired grab samples (p < 0.0001) in the assayed volume (10 mL) of wastewater or swab eluate. No significant difference was observed in the recovery of a spiked-in control (Phi6), indicating that the improved sensitivity is not due to improvements in nucleic acid recovery or reduction of PCR inhibition. The outcomes of swab-based sampling varied significantly between sites, with swab samples providing the greatest improvements in counts for smaller sewersheds that otherwise tend to have greater variation in grab sample counts. Swab-sampling with tampons provides significant advantages in detection of SARS-CoV-2 wastewater markers and are expected to provide earlier detection of new outbreaks than grab samples, with consequent public health benefits. 
    more » « less