skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Jumper enables discontinuous transcript assembly in coronaviruses
Abstract Genes in SARS-CoV-2 and other viruses in the order ofNidoviralesare expressed by a process of discontinuous transcription which is distinct from alternative splicing in eukaryotes and is mediated by the viral RNA-dependent RNA polymerase. Here, we introduce the DISCONTINUOUS TRANSCRIPT ASSEMBLYproblem of finding transcripts and their abundances given an alignment of paired-end short reads under a maximum likelihood model that accounts for varying transcript lengths. We show, using simulations, that our method, JUMPER, outperforms existing methods for classical transcript assembly. On short-read data of SARS-CoV-1, SARS-CoV-2 and MERS-CoV samples, we find that JUMPER not only identifies canonical transcripts that are part of the reference transcriptome, but also predicts expression of non-canonical transcripts that are supported by subsequent orthogonal analyses. Moreover, application of JUMPER on samples with and without treatment reveals viral drug response at the transcript level. As such, JUMPER enables detailed analyses ofNidoviralestranscriptomes under varying conditions.  more » « less
Award ID(s):
2027669 2046488 1850502 1652815
PAR ID:
10305467
Author(s) / Creator(s):
; ; ;
Publisher / Repository:
Nature Publishing Group
Date Published:
Journal Name:
Nature Communications
Volume:
12
Issue:
1
ISSN:
2041-1723
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Elkins, Christopher A (Ed.)
    ABSTRACT Municipal wastewater harbors diverse RNA viruses, which are responsible for many emerging and reemerging diseases in humans, animals, and plants. Although genomic sequencing can be a high-throughput approach for profiling the RNA virome in wastewater, wastewater processing methods often influence sequencing outcomes. Here, we systematically evaluated two wastewater processing methods, tangential-flow ultrafiltration (TFF) and Nanotrap Microbiome A Particles, for detecting the target RNA virus severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) via amplicon sequencing and characterizing the RNA virome using whole-transcriptome shotgun sequencing. Our results from paired comparison tests showed that the TFF and Nanotrap methods recovered similar SARS-CoV-2 variants at the lineage level (analysis of similarity [ANOSIM]R= −0.012,P= 0.874). Optimizing automated procedures for the Nanotrap method and concentration factors for the TFF method was critical for achieving high-depth and high-breadth coverage of the target virus genome. Notably, the two methods enriched distinct RNA viromes from the same wastewater samples (ANOSIMR= 0.260,P= 0.002), with TFF samples showing 22-fold and 7-fold higher relative abundances ofReoviridaeandCoronaviridae, respectively. These differences are likely due to the distinct virus concentration mechanisms employed by each method, which are influenced by liquid-solid partitioning of virus particles and interactions of viral surface proteins with ligands. Our findings underscore the importance of optimizing wastewater processing methods for genomic monitoring and have implications for broader environmental applications.IMPORTANCEWastewater genomic sequencing is an emerging technology for tracking viral infections within communities. However, different methods for concentrating viruses and extracting nucleic acids can influence the recoveries of RNA virome from wastewater. An in-depth understanding of virus concentration mechanisms and their impact on sequencing data quality and bioinformatic output would be critical to guide method selection and optimization. Specifically, this study systematically evaluated tangential-flow ultrafiltration and Nanotrap microbiome particles for their application to sequence SARS-CoV-2 and whole RNA virome from wastewater. Both methods yielded high-quality sequencing data for amplicon sequencing of SARS-CoV-2, but their outcomes diverged in the recovered RNA virome. We identified RNA viruses that are preferentially recovered by each of these two methods and proposed considerations of method selection for future studies of wastewater RNA virome. 
    more » « less
  2. null (Ed.)
    Microbes and viruses are known to alter host transcriptomes by means of infection. In light of recent challenges posed by the COVID-19 pandemic, a deeper understanding of the disease at the transcriptome level is needed. However, research about transcriptome reprogramming by post-transcriptional regulation is very limited. In this study, computational methods developed by our lab were applied to RNA-seq data to detect transcript variants (i.e., alternative splicing (AS) and alternative polyadenylation (APA) events). The RNA-seq data were obtained from a publicly available source, and they consist of mock-treated and SARS-CoV-2 infected (COVID-19) lung alveolar (A549) cells. Data analysis results show that more AS events are found in SARS-CoV-2 infected cells than in mock-treated cells, whereas fewer APA events are detected in SARS-CoV-2 infected cells. A combination of conventional differential gene expression analysis and transcript variants analysis revealed that most of the genes with transcript variants are not differentially expressed. This indicates that no strong correlation exists between differential gene expression and the AS/APA events in the mock-treated or SARS-CoV-2 infected samples. These genes with transcript variants can be applied as another layer of molecular signatures for COVID-19 studies. In addition, the transcript variants are enriched in important biological pathways that were not detected in the studies that only focused on differential gene expression analysis. Therefore, the pathways may lead to new molecular mechanisms of SARS-CoV-2 pathogenesis. 
    more » « less
  3. Abstract The ongoing COVID-19 pandemic highlights the necessity for a more fundamental understanding of the coronavirus life cycle. The causative agent of the disease, SARS-CoV-2, is being studied extensively from a structural standpoint in order to gain insight into key molecular mechanisms required for its survival. Contained within the untranslated regions of the SARS-CoV-2 genome are various conserved stem-loop elements that are believed to function in RNA replication, viral protein translation, and discontinuous transcription. While the majority of these regions are variable in sequence, a 41-nucleotide s2m element within the genome 3′ untranslated region is highly conserved among coronaviruses and three other viral families. In this study, we demonstrate that the SARS-CoV-2 s2m element dimerizes by forming an intermediate homodimeric kissing complex structure that is subsequently converted to a thermodynamically stable duplex conformation. This process is aided by the viral nucleocapsid protein, potentially indicating a role in mediating genome dimerization. Furthermore, we demonstrate that the s2m element interacts with multiple copies of host cellular microRNA (miRNA) 1307-3p. Taken together, our results highlight the potential significance of the dimer structures formed by the s2m element in key biological processes and implicate the motif as a possible therapeutic drug target for COVID-19 and other coronavirus-related diseases. 
    more » « less
  4. Gorbalenya, Alexander E (Ed.)
    Researchers and clinicians often rely on molecular assays like PCR to identify and monitor viral infections, instead of the resource-prohibitive gold standard of viral culture. However, it remains unclear when (if ever) PCR measurements of viral load are reliable indicators of replicating or infectious virus. The recent popularity of PCR protocols targeting subgenomic RNA for SARS-CoV-2 has caused further confusion, as the relationships between subgenomic RNA and standard total RNA assays are incompletely characterized and opinions differ on which RNA type better predicts culture outcomes. Here, we explore these issues by comparing total RNA, subgenomic RNA, and viral culture results from 24 studies of SARS-CoV-2 in non-human primates (including 2167 samples from 174 individuals) using custom-developed Bayesian statistical models. On out-of-sample data, our best models predict subgenomic RNA positivity from total RNA data with 91% accuracy, and they predict culture positivity with 85% accuracy. Further analyses of individual time series indicate that many apparent prediction errors may arise from issues with assay sensitivity or sample processing, suggesting true accuracy may be higher than these estimates. Total RNA and subgenomic RNA showed equivalent performance as predictors of culture positivity. Multiple cofactors (including exposure conditions, host traits, and assay protocols) influence culture predictions, yielding insights into biological and methodological sources of variation in assay outcomes–and indicating that no single threshold value applies across study designs. We also show that our model can accurately predict when an individual is no longer infectious, illustrating the potential for future models trained on human data to guide clinical decisions on case isolation. Our work shows that meta-analysis ofin vivodata can overcome longstanding challenges arising from limited sample sizes and can yield robust insights beyond those attainable from individual studies. Our analytical pipeline offers a framework to develop similar predictive tools in other virus-host systems, including models trained on human data, which could support laboratory analyses, medical decisions, and public health guidelines. 
    more » « less
  5. Monitoring wastewater samples at building-level resolution screens large populations for SARS-CoV-2, prioritizing testing and isolation efforts. Here we perform untargeted metatranscriptomics on virally-enriched wastewater samples from 10 locations on the UC San Diego campus, demonstrating that resulting bacterial taxonomic and functional profiles discriminate SARS-CoV-2 status even without direct detection of viral transcripts. Our proof-of-principle reveals emergent threats through changes in the human microbiome, suggesting new approaches for untargeted wastewater-based epidemiology. 
    more » « less