skip to main content


Title: Jumper enables discontinuous transcript assembly in coronaviruses
Abstract

Genes in SARS-CoV-2 and other viruses in the order ofNidoviralesare expressed by a process of discontinuous transcription which is distinct from alternative splicing in eukaryotes and is mediated by the viral RNA-dependent RNA polymerase. Here, we introduce the DISCONTINUOUS TRANSCRIPT ASSEMBLYproblem of finding transcripts and their abundances given an alignment of paired-end short reads under a maximum likelihood model that accounts for varying transcript lengths. We show, using simulations, that our method, JUMPER, outperforms existing methods for classical transcript assembly. On short-read data of SARS-CoV-1, SARS-CoV-2 and MERS-CoV samples, we find that JUMPER not only identifies canonical transcripts that are part of the reference transcriptome, but also predicts expression of non-canonical transcripts that are supported by subsequent orthogonal analyses. Moreover, application of JUMPER on samples with and without treatment reveals viral drug response at the transcript level. As such, JUMPER enables detailed analyses ofNidoviralestranscriptomes under varying conditions.

 
more » « less
Award ID(s):
2027669 2046488 1850502 1652815
NSF-PAR ID:
10305467
Author(s) / Creator(s):
; ; ;
Publisher / Repository:
Nature Publishing Group
Date Published:
Journal Name:
Nature Communications
Volume:
12
Issue:
1
ISSN:
2041-1723
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Background

    SARS-CoV-2 is an RNA virus responsible for the coronavirus disease 2019 (COVID-19) pandemic. Viruses exist in complex microbial environments, and recent studies have revealed both synergistic and antagonistic effects of specific bacterial taxa on viral prevalence and infectivity. We set out to test whether specific bacterial communities predict SARS-CoV-2 occurrence in a hospital setting.

    Methods

    We collected 972 samples from hospitalized patients with COVID-19, their health care providers, and hospital surfaces before, during, and after admission. We screened for SARS-CoV-2 using RT-qPCR, characterized microbial communities using 16S rRNA gene amplicon sequencing, and used these bacterial profiles to classify SARS-CoV-2 RNA detection with a random forest model.

    Results

    Sixteen percent of surfaces from COVID-19 patient rooms had detectable SARS-CoV-2 RNA, although infectivity was not assessed. The highest prevalence was in floor samples next to patient beds (39%) and directly outside their rooms (29%). Although bed rail samples more closely resembled the patient microbiome compared to floor samples, SARS-CoV-2 RNA was detected less often in bed rail samples (11%). SARS-CoV-2 positive samples had higher bacterial phylogenetic diversity in both human and surface samples and higher biomass in floor samples. 16S microbial community profiles enabled high classifier accuracy for SARS-CoV-2 status in not only nares, but also forehead, stool, and floor samples. Across these distinct microbial profiles, a single amplicon sequence variant from the genusRothiastrongly predicted SARS-CoV-2 presence across sample types, with greater prevalence in positive surface and human samples, even when compared to samples from patients in other intensive care units prior to the COVID-19 pandemic.

    Conclusions

    These results contextualize the vast diversity of microbial niches where SARS-CoV-2 RNA is detected and identify specific bacterial taxa that associate with the viral RNA prevalence both in the host and hospital environment.

     
    more » « less
  2. Abstract

    Wastewater surveillance has proven to be an effective tool to monitor the transmission and emergence of infectious agents at a community scale. Workflows for wastewater surveillance generally rely on concentration steps to increase the probability of detection of low-abundance targets, but preconcentration can substantially increase the time and cost of analyses while also introducing additional loss of target during processing. To address some of these issues, we conducted a longitudinal study implementing a simplified workflow for SARS-CoV-2 detection from wastewater, using a direct column-based extraction approach. Composite influent wastewater samples were collected weekly for 1 year between June 2020 and June 2021 in Athens-Clarke County, Georgia, USA. Bypassing any concentration step, low volumes (280 µl) of influent wastewater were extracted using a commercial kit, and immediately analyzed by RT-qPCR for the SARS-CoV-2 N1 and N2 gene targets. SARS-CoV-2 viral RNA was detected in 76% (193/254) of influent samples, and the recovery of the surrogate bovine coronavirus was 42% (IQR: 28%, 59%). N1 and N2 assay positivity, viral concentration, and flow-adjusted daily viral load correlated significantly with per-capita case reports of COVID-19 at the county-level (ρ = 0.69–0.82). To compensate for the method’s high limit of detection (approximately 106–107 copies l−1 in wastewater), we extracted multiple small-volume replicates of each wastewater sample. With this approach, we detected as few as five cases of COVID-19 per 100 000 individuals. These results indicate that a direct-extraction-based workflow for SARS-CoV-2 wastewater surveillance can provide informative and actionable results.

     
    more » « less
  3. Abstract

    SARS‐CoV‐2 causes individualized symptoms. Many reasons have been given. We propose that an individual's epitranscriptomic system could be responsible as well. The viral RNA genome can be subject to epitranscriptomic modifications, which can be different for different individuals, and thus epitranscriptomics can affect many events including RNA replication differently. In this context, we studied the effects of modifications including pseudouridine (Ψ), 5‐methylcytosine (m5C),N6‐methyladenosine (m6A),N1‐methyladenosine (m1A) andN3‐methylcytosine (m3C) on the activity of SARS‐CoV‐2 replication complex (SC2RC). We found that Ψ, m5C, m6A and m3C had little effect, whereas m1A inhibited the enzyme. Both m1A and m3C disrupt canonical base pairing, but they had different effects. The fact that m1A inhibits SC2RC implies that the modification can be difficult to detect. This fact also implies that individuals with upregulated m1A including cancer, obesity and diabetes patients might have milder symptoms. However, this contradicts clinical observations. Relevant discussions are provided.

     
    more » « less
  4. Abstract

    The SARS‐CoV‐2 pandemic caused a public health crisis throughout the world and highlighted the need for rapid and sensitive testing as a countermeasure. A sensitive and specific biosensor platform is developed for the detection of antigen and RNA of SARS‐CoV‐2, and its variant (B1.1.529). The demonstrated biosensor platform combines unique protein catalyzed capture bioreceptors (PCCs) for antigen capture and a chimeric (RNA‐DNA) probe for RNA detection using LwaCas13a collateral cleavage activity atop graphene field effect transistors (gFETs). The reported biosensor is able to differentiate unprocessed 104pfu m−1samples of SARS‐CoV‐2 from Influenza and Rhinovirus. The limit of detection (LOD) calculated for SARS‐CoV‐2 antigen is 103in buffer and 104PFU mL−1in 10% saliva, while LOD of ≈65 amcalculated for viral RNA isolate without amplification. To provide a high reliability of detection, the role of internal and external factors with respect to gate voltage is further analyzed by Principal Component Analysis (PCA). Based on PCA analysis, the authors are able to classify the samples as pathogen positive or negative (Y> 0: Positive for pathogen,Y< 0: Negative for pathogen). The reported platform can be quickly adapted for multi‐omics and multiplexed diagnosis of continuously evolving biothreats and global pandemics.

     
    more » « less
  5. Abstract

    The ongoing COVID-19 pandemic highlights the necessity for a more fundamental understanding of the coronavirus life cycle. The causative agent of the disease, SARS-CoV-2, is being studied extensively from a structural standpoint in order to gain insight into key molecular mechanisms required for its survival. Contained within the untranslated regions of the SARS-CoV-2 genome are various conserved stem-loop elements that are believed to function in RNA replication, viral protein translation, and discontinuous transcription. While the majority of these regions are variable in sequence, a 41-nucleotide s2m element within the genome 3′ untranslated region is highly conserved among coronaviruses and three other viral families. In this study, we demonstrate that the SARS-CoV-2 s2m element dimerizes by forming an intermediate homodimeric kissing complex structure that is subsequently converted to a thermodynamically stable duplex conformation. This process is aided by the viral nucleocapsid protein, potentially indicating a role in mediating genome dimerization. Furthermore, we demonstrate that the s2m element interacts with multiple copies of host cellular microRNA (miRNA) 1307-3p. Taken together, our results highlight the potential significance of the dimer structures formed by the s2m element in key biological processes and implicate the motif as a possible therapeutic drug target for COVID-19 and other coronavirus-related diseases.

     
    more » « less