skip to main content


Title: A computational approach for the identification of distant homologs of bacterial riboswitches based on inverse RNA folding
Abstract

Riboswitches are conserved structural ribonucleic acid (RNA) sensors that are mainly found to regulate a large number of genes/operons in bacteria. Presently, >50 bacterial riboswitch classes have been discovered, but only the thiamine pyrophosphate riboswitch class is detected in a few eukaryotes like fungi, plants and algae. One of the most important challenges in riboswitch research is to discover existing riboswitch classes in eukaryotes and to understand the evolution of bacterial riboswitches. However, traditional search methods for riboswitch detection have failed to detect eukaryotic riboswitches besides just one class and any distant structural homologs of riboswitches. We developed a novel approach based on inverse RNA folding that attempts to find sequences that match the shape of the target structure with minimal sequence conservation based on key nucleotides that interact directly with the ligand. Then, to support our matched candidates, we expanded the results into a covariance model representing similar sequences preserving the structure. Our method transforms a structure-based search into a sequence-based search that considers the conservation of secondary structure shape and ligand-binding residues. This method enables us to identify a potential structural candidate in fungi that could be the distant homolog of bacterial purine riboswitches. Further, phylogenomic analysis and evolutionary distribution of this structural candidate indicate that the most likely point of origin of this structural candidate in these organisms is associated with the loss of traditional purine riboswitches. The computational approach could be applicable to other domains and problems in RNA research.

 
more » « less
NSF-PAR ID:
10402796
Author(s) / Creator(s):
; ; ; ;
Publisher / Repository:
Oxford University Press
Date Published:
Journal Name:
Briefings in Bioinformatics
Volume:
24
Issue:
3
ISSN:
1467-5463
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    RNA folds cotranscriptionally to traverse out-of-equilibrium intermediate structures that are important for RNA function in the context of gene regulation. To investigate this process, here we study the structure and function of the Bacillus subtilis yxjA purine riboswitch, a transcriptional riboswitch that downregulates a nucleoside transporter in response to binding guanine. Although the aptamer and expression platform domain sequences of the yxjA riboswitch do not completely overlap, we hypothesized that a strand exchange process triggers its structural switching in response to ligand binding. In vivo fluorescence assays, structural chemical probing data and experimentally informed secondary structure modeling suggest the presence of a nascent intermediate central helix. The formation of this central helix in the absence of ligand appears to compete with both the aptamer’s P1 helix and the expression platform’s transcriptional terminator. All-atom molecular dynamics simulations support the hypothesis that ligand binding stabilizes the aptamer P1 helix against central helix strand invasion, thus allowing the terminator to form. These results present a potential model mechanism to explain how ligand binding can induce downstream conformational changes by influencing local strand displacement processes of intermediate folds that could be at play in multiple riboswitch classes.

     
    more » « less
  2. Abstract

    RNAs begin to fold and function during transcription. Riboswitches undergo cotranscriptional switching in the context of transcription elongation, RNA folding, and ligand binding. To investigate how these processes jointly modulate the function of the folate stress-sensingFusobacterium ulceransZTP riboswitch, we apply a single-molecule vectorial folding (VF) assay in which an engineered superhelicase Rep-X sequentially releases fluorescently labeled riboswitch RNA from a heteroduplex in a 5′-to-3′ direction, at ~60 nt s−1[comparable to the speed of bacterial RNA polymerase (RNAP)]. We demonstrate that the ZTP riboswitch is kinetically controlled and that its activation is favored by slower unwinding, strategic pausing between but not before key folding elements, or a weakened transcription terminator. Real-time single-molecule monitoring captures folding riboswitches in multiple states, including an intermediate responsible for delayed terminator formation. These results show how individual nascent RNAs occupy distinct channels within the folding landscape that controls the fate of the riboswitch.

     
    more » « less
  3. RNA motif classification is important for understanding structure/function connections and building phylogenetic relationships. Using our coarse-grained RNA-As-Graphs (RAG) representations, we identify recurrent dual graph motifs in experimentally solved RNA structures based on an improved search algorithm that finds and ranks independent RNA substructures. Our expanded list of 183 existing dual graph motifs reveals five common motifs found in transfer RNA, riboswitch, and ribosomal 5S RNA components. Moreover, we identify three motifs for available viral frameshifting RNA elements, suggesting a correlation between viral structural complexity and frameshifting efficiency. We further partition the RNA substructures into 1844 distinct submotifs, with pseudoknots and junctions retained intact. Common modules are internal loops and three-way junctions, and three submotifs are associated with riboswitches that bind nucleotides, ions, and signaling molecules. Together, our library of existing RNA motifs and submotifs adds to the growing universe of RNA modules, and provides a resource of structures and substructures for novel RNA design. 
    more » « less
  4. Abstract

    A central question in biology is how RNA sequence changes influence dynamic conformational changes during cotranscriptional folding. Here we investigated this question through the study of transcriptional fluoride riboswitches, non-coding RNAs that sense the fluoride anion through the coordinated folding and rearrangement of a pseudoknotted aptamer domain and a downstream intrinsic terminator expression platform. Using a combination of Escherichia coli RNA polymerase in vitro transcription and cellular gene expression assays, we characterized the function of mesophilic and thermophilic fluoride riboswitch variants. We showed that only variants containing the mesophilic pseudoknot function at 37°C. We next systematically varied the pseudoknot sequence and found that a single wobble base pair is critical for function. Characterizing thermophilic variants at 65°C through Thermus aquaticus RNA polymerase in vitro transcription showed the importance of this wobble pair for function even at elevated temperatures. Finally, we performed all-atom molecular dynamics simulations which supported the experimental findings, visualized the RNA structure switching process, and provided insight into the important role of magnesium ions. Together these studies provide deeper insights into the role of riboswitch sequence in influencing folding and function that will be important for understanding of RNA-based gene regulation and for synthetic biology applications.

     
    more » « less
  5. null (Ed.)
    We report the biological and structural characterization of umbravirus-like associated RNAs (ulaRNAs), a new category of coat-protein dependent subviral RNA replicons that infect plants. These RNAs encode an RNA-dependent RNA polymerase (RdRp) following a −1 ribosomal frameshift event, are 2.7–4.6 kb in length, and are related to umbraviruses, unlike similar RNA replicons that are related to tombusviruses. Three classes of ulaRNAs are proposed, with citrus yellow vein associated virus (CYVaV) placed in Class 2. With the exception of CYVaV, Class 2 and Class 3 ulaRNAs encode an additional open reading frame (ORF) with movement protein-like motifs made possible by additional sequences just past the RdRp termination codon. The full-length secondary structure of CYVaV was determined using Selective 2’ Hydroxyl Acylation analyzed by Primer Extension (SHAPE) structure probing and phylogenic comparisons, which was used as a template for determining the putative structures of the other Class 2 ulaRNAs, revealing a number of distinctive structural features. The ribosome recoding sites of nearly all ulaRNAs, which differ significantly from those of umbraviruses, may exist in two conformations and are highly efficient. The 3′ regions of Class 2 and Class 3 ulaRNAs have structural elements similar to those of nearly all umbraviruses, and all Class 2 ulaRNAs have a unique, conserved 3′ cap-independent translation enhancer. CYVaV replicates independently in protoplasts, demonstrating that the reported sequence is full-length. Additionally, CYVaV contains a sequence in its 3′ UTR that confers protection to nonsense mediated decay (NMD), thus likely obviating the need for umbravirus ORF3, a known suppressor of NMD. This initial characterization lays down a road map for future investigations into these novel virus-like RNAs. 
    more » « less