skip to main content

Title: Genome Sequencing of Sewage Detects Regionally Prevalent SARS-CoV-2 Variants
ABSTRACT Viral genome sequencing has guided our understanding of the spread and extent of genetic diversity of SARS-CoV-2 during the COVID-19 pandemic. SARS-CoV-2 viral genomes are usually sequenced from nasopharyngeal swabs of individual patients to track viral spread. Recently, RT-qPCR of municipal wastewater has been used to quantify the abundance of SARS-CoV-2 in several regions globally. However, metatranscriptomic sequencing of wastewater can be used to profile the viral genetic diversity across infected communities. Here, we sequenced RNA directly from sewage collected by municipal utility districts in the San Francisco Bay Area to generate complete and nearly complete SARS-CoV-2 genomes. The major consensus SARS-CoV-2 genotypes detected in the sewage were identical to clinical genomes from the region. Using a pipeline for single nucleotide variant calling in a metagenomic context, we characterized minor SARS-CoV-2 alleles in the wastewater and detected viral genotypes which were also found within clinical genomes throughout California. Observed wastewater variants were more similar to local California patient-derived genotypes than they were to those from other regions within the United States or globally. Additional variants detected in wastewater have only been identified in genomes from patients sampled outside California, indicating that wastewater sequencing can provide evidence for recent introductions more » of viral lineages before they are detected by local clinical sequencing. These results demonstrate that epidemiological surveillance through wastewater sequencing can aid in tracking exact viral strains in an epidemic context. « less
Authors:
; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ;
Editors:
Pettigrew, Melinda M.
Award ID(s):
1633740
Publication Date:
NSF-PAR ID:
10285816
Journal Name:
mBio
Volume:
12
Issue:
1
ISSN:
2161-2129
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Background SARS-CoV-2 Delta variant has caused a dramatic resurgence in infections in the United Sates, raising questions regarding potential transmissibility among vaccinated individuals. Methods Between October 2020 and July 2021, we sequenced 4,439 SARS-CoV-2 full genomes, 23% of all known infections in Alachua County, Florida, including 109 vaccine breakthrough cases. Univariate and multivariate regression analyses were conducted to evaluate associations between viral RNA burden and patient characteristics. Contact tracing and phylogenetic analysis were used to investigate direct transmissions involving vaccinated individuals. Results The majority of breakthrough sequences with lineage assignment were classified as Delta variants (74.6%) and occurred, onmore »average, about three months (104 ± 57.5 days) after full vaccination, at the same time (June-July 2021) of Delta variant exponential spread within the county. Six Delta variant transmission pairs between fully vaccinated individuals were identified through contact tracing, three of which were confirmed by phylogenetic analysis. Delta breakthroughs exhibited broad viral RNA copy number values during acute infection (IQR 1.2 – 8.64 Log copies/ml), on average 38% lower than matched unvaccinated patients (3.29 – 10.81 Log copies/ml, p<0.00001). Nevertheless, 49-50% of all breakthroughs, and 56-60% of Delta-infected breakthroughs exhibited viral RNA levels above the transmissibility threshold (4 Log copies/ml) irrespective of time post vaccination. Conclusions Delta infection transmissibility and general viral RNA quantification patterns in vaccinated individuals suggest limited levels of sterilizing immunity that need to be considered by public health policies. In particular, ongoing evaluation of vaccine boosters should specifically address whether extra vaccine doses curb breakthrough contribution to epidemic spread.« less
  2. Lowen, Anice C. (Ed.)
    Genetic diversity is the fuel of evolution and facilitates adaptation to novel environments. However, our understanding of what drives differences in the genetic diversity during the early stages of viral infection is somewhat limited. Here, we use ultra-deep sequencing to interrogate 43 clinical samples taken from early infections of the human-infecting viruses HIV, RSV and CMV. Hundreds to thousands of virus templates were sequenced per sample, allowing us to reveal dramatic differences in within-host genetic diversity among virus populations. We found that increased diversity was mostly driven by presence of multiple divergent genotypes in HIV and CMV samples, which wemore »suggest reflect multiple transmitted/founder viruses. Conversely, we detected an abundance of low frequency hyper-edited genomes in RSV samples, presumably reflecting defective virus genomes (DVGs). We suggest that RSV is characterized by higher levels of cellular co-infection, which allow for complementation and hence elevated levels of DVGs.« less
  3. Abstract Background Wastewater-based epidemiology (WBE) for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) can be an important source of information for coronavirus disease 2019 (COVID-19) management during and after the pandemic. Currently, governments and transportation industries around the world are developing strategies to minimize SARS-CoV-2 transmission associated with resuming activity. This study investigated the possible use of SARS-CoV-2 RNA wastewater surveillance from airline and cruise ship sanitation systems and its potential use as a COVID-19 public health management tool. Methods Aircraft and cruise ship wastewater samples (n = 21) were tested for SARS-CoV-2 using two virus concentration methods, adsorption–extraction by electronegative membranemore »(n = 13) and ultrafiltration by Amicon (n = 8), and five assays using reverse-transcription quantitative polymerase chain reaction (RT-qPCR) and RT-droplet digital PCR (RT-ddPCR). Representative qPCR amplicons from positive samples were sequenced to confirm assay specificity. Results SARS-CoV-2 RNA was detected in samples from both aircraft and cruise ship wastewater; however concentrations were near the assay limit of detection. The analysis of multiple replicate samples and use of multiple RT-qPCR and/or RT-ddPCR assays increased detection sensitivity and minimized false-negative results. Representative qPCR amplicons were confirmed for the correct PCR product by sequencing. However, differences in sensitivity were observed among molecular assays and concentration methods. Conclusions The study indicates that surveillance of wastewater from large transport vessels with their own sanitation systems has potential as a complementary data source to prioritize clinical testing and contact tracing among disembarking passengers. Importantly, sampling methods and molecular assays must be further optimized to maximize detection sensitivity. The potential for false negatives by both wastewater testing and clinical swab testing suggests that the two strategies could be employed together to maximize the probability of detecting SARS-CoV-2 infections amongst passengers.« less
  4. Yeager, Meredith (Ed.)
    Abstract Global sequencing of genomes of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has continued to reveal new genetic variants that are the key to unraveling its early evolutionary history and tracking its global spread over time. Here we present the heretofore cryptic mutational history and spatiotemporal dynamics of SARS-CoV-2 from an analysis of thousands of high-quality genomes. We report the likely most recent common ancestor of SARS-CoV-2, reconstructed through a novel application and advancement of computational methods initially developed to infer the mutational history of tumor cells in a patient. This progenitor genome differs from genomes of the firstmore »coronaviruses sampled in China by three variants, implying that none of the earliest patients represent the index case or gave rise to all the human infections. However, multiple coronavirus infections in China and the United States harbored the progenitor genetic fingerprint in January 2020 and later, suggesting that the progenitor was spreading worldwide months before and after the first reported cases of COVID-19 in China. Mutations of the progenitor and its offshoots have produced many dominant coronavirus strains that have spread episodically over time. Fingerprinting based on common mutations reveals that the same coronavirus lineage has dominated North America for most of the pandemic in 2020. There have been multiple replacements of predominant coronavirus strains in Europe and Asia as well as continued presence of multiple high-frequency strains in Asia and North America. We have developed a continually updating dashboard of global evolution and spatiotemporal trends of SARS-CoV-2 spread (http://sars2evo.datamonkey.org/).« less
  5. INTRODUCTION One of the central applications of the human reference genome has been to serve as a baseline for comparison in nearly all human genomic studies. Unfortunately, many difficult regions of the reference genome have remained unresolved for decades and are affected by collapsed duplications, missing sequences, and other issues. Relative to the current human reference genome, GRCh38, the Telomere-to-Telomere CHM13 (T2T-CHM13) genome closes all remaining gaps, adds nearly 200 million base pairs (Mbp) of sequence, corrects thousands of structural errors, and unlocks the most complex regions of the human genome for scientific inquiry. RATIONALE We demonstrate how the T2T-CHM13more »reference genome universally improves read mapping and variant identification in a globally diverse cohort. This cohort includes all 3202 samples from the expanded 1000 Genomes Project (1KGP), sequenced with short reads, as well as 17 globally diverse samples sequenced with long reads. By applying state-of-the-art methods for calling single-nucleotide variants (SNVs) and structural variants (SVs), we document the strengths and limitations of T2T-CHM13 relative to its predecessors and highlight its promise for revealing new biological insights within technically challenging regions of the genome. RESULTS Across the 1KGP samples, we found more than 1 million additional high-quality variants genome-wide using T2T-CHM13 than with GRCh38. Within previously unresolved regions of the genome, we identified hundreds of thousands of variants per sample—a promising opportunity for evolutionary and biomedical discovery. T2T-CHM13 improves the Mendelian concordance rate among trios and eliminates tens of thousands of spurious SNVs per sample, including a reduction of false positives in 269 challenging, medically relevant genes by up to a factor of 12. These corrections are in large part due to improvements to 70 protein-coding genes in >9 Mbp of inaccurate sequence caused by falsely collapsed or duplicated regions in GRCh38. Using the T2T-CHM13 genome also yields a more comprehensive view of SVs genome-wide, with a greatly improved balance of insertions and deletions. Finally, by providing numerous resources for T2T-CHM13 (including 1KGP genotypes, accessibility masks, and prominent annotation databases), our work will facilitate the transition to T2T-CHM13 from the current reference genome. CONCLUSION The vast improvements in variant discovery across samples of diverse ancestries position T2T-CHM13 to succeed as the next prevailing reference for human genetics. T2T-CHM13 thus offers a model for the construction and study of high-quality reference genomes from globally diverse individuals, such as is now being pursued through collaboration with the Human Pangenome Reference Consortium. As a foundation, our work underscores the benefits of an accurate and complete reference genome for revealing diversity across human populations. Genomic features and resources available for T2T-CHM13. Comparisons to GRCh38 reveal broad improvements in SNVs, indels, and SVs discovered across diverse human populations by means of short-read (1KGP) and long-read sequencing (LRS). These improvements are due to resolution of complex genomic loci (nonsyntenic and previously unresolved), duplication errors, and discordant haplotypes, including those in medically relevant genes.« less