skip to main content


Title: RNA sequence to structure analysis from comprehensive pairwise mutagenesis of multiple self-cleaving ribozymes
Self-cleaving ribozymes are RNA molecules that catalyze the cleavage of their own phosphodiester backbones. These ribozymes are found in all domains of life and are also a tool for biotechnical and synthetic biology applications. Self-cleaving ribozymes are also an important model of sequence-to-function relationships for RNA because their small size simplifies synthesis of genetic variants and self-cleaving activity is an accessible readout of the functional consequence of the mutation. Here, we used a high-throughput experimental approach to determine the relative activity for every possible single and double mutant of five self-cleaving ribozymes. From this data, we comprehensively identified non-additive effects between pairs of mutations (epistasis) for all five ribozymes. We analyzed how changes in activity and trends in epistasis map to the ribozyme structures. The variety of structures studied provided opportunities to observe several examples of common structural elements, and the data was collected under identical experimental conditions to enable direct comparison. Heatmap-based visualization of the data revealed patterns indicating structural features of the ribozymes including paired regions, unpaired loops, non-canonical structures, and tertiary structural contacts. The data also revealed signatures of functionally critical nucleotides involved in catalysis. The results demonstrate that the data sets provide structural information similar to chemical or enzymatic probing experiments, but with additional quantitative functional information. The large-scale data sets can be used for models predicting structure and function and for efforts to engineer self-cleaving ribozymes.  more » « less
Award ID(s):
1826801
NSF-PAR ID:
10408020
Author(s) / Creator(s):
; ; ; ;
Date Published:
Journal Name:
eLife
Volume:
12
ISSN:
2050-084X
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Ribozymes are RNA molecules that catalyze biochemical reactions. Self-cleaving ribozymes are a common naturally occurring class of ribozymes that catalyze site-specific cleavage of their own phosphodiester backbone. In addition to their natural functions, self-cleaving ribozymes have been used to engineer control of gene expression because they can be designed to alter RNA processing and stability. However, the rational design of ribozyme activity remains challenging, and many ribozyme-based systems are engineered or improved by random mutagenesis and selection ( in vitro evolution). Improving a ribozyme-based system often requires several mutations to achieve the desired function, but extensive pairwise and higher-order epistasis prevent a simple prediction of the effect of multiple mutations that is needed for rational design. Recently, high-throughput sequencing-based approaches have produced data sets on the effects of numerous mutations in different ribozymes (RNA fitness landscapes). Here we used such high-throughput experimental data from variants of the CPEB3 self-cleaving ribozyme to train a predictive model through machine learning approaches. We trained models using either a random forest or long short-term memory (LSTM) recurrent neural network approach. We found that models trained on a comprehensive set of pairwise mutant data could predict active sequences at higher mutational distances, but the correlation between predicted and experimentally observed self-cleavage activity decreased with increasing mutational distance. Adding sequences with increasingly higher numbers of mutations to the training data improved the correlation at increasing mutational distances. Systematically reducing the size of the training data set suggests that a wide distribution of ribozyme activity may be the key to accurate predictions. Because the model predictions are based only on sequence and activity data, the results demonstrate that this machine learning approach allows readily obtainable experimental data to be used for RNA design efforts even for RNA molecules with unknown structures. The accurate prediction of RNA functions will enable a more comprehensive understanding of RNA fitness landscapes for studying evolution and for guiding RNA-based engineering efforts. 
    more » « less
  2. null (Ed.)
    Various self-cleaving ribozymes appearing in nature catalyze the sequence-specific intramolecular cleavage of RNA and can be engineered to catalyze cleavage of appropriate substrates in an intermolecular fashion, thus acting as true catalysts. The mechanisms of the small, self-cleaving ribozymes have been extensively studied and reviewed previously. Self-cleaving ribozymes can possess high catalytic activity and high substrate specificity; however, substrate specificity is also engineerable within the constraints of the ribozyme structure. While these ribozymes share a common fundamental catalytic mechanism, each ribozyme family has a unique overall architecture and active site organization, indicating that several distinct structures yield this chemical activity. The multitude of catalytic structures, combined with some flexibility in substrate specificity within each family, suggests that such catalytic RNAs, taken together, could access a wide variety of substrates. Here, we give an overview of 10 classes of self-cleaving ribozymes and capture what is understood about their substrate specificity and synthetic applications. Evolution of these ribozymes in an RNA world might be characterized by the emergence of a new ribozyme family followed by rapid adaptation or diversification for specific substrates. 
    more » « less
  3. Zhang, Jianzhi (Ed.)
    Abstract Fitness landscapes of protein and RNA molecules can be studied experimentally using high-throughput techniques to measure the functional effects of numerous combinations of mutations. The rugged topography of these molecular fitness landscapes is important for understanding and predicting natural and experimental evolution. Mutational effects are also dependent upon environmental conditions, but the effects of environmental changes on fitness landscapes remains poorly understood. Here, we investigate the changes to the fitness landscape of a catalytic RNA molecule while changing a single environmental variable that is critical for RNA structure and function. Using high-throughput sequencing of in vitro selections, we mapped a fitness landscape of the Azoarcus group I ribozyme under eight different concentrations of magnesium ions (1–48 mM MgCl2). The data revealed the magnesium dependence of 16,384 mutational neighbors, and from this, we investigated the magnesium induced changes to the topography of the fitness landscape. The results showed that increasing magnesium concentration improved the relative fitness of sequences at higher mutational distances while also reducing the ruggedness of the mutational trajectories on the landscape. As a result, as magnesium concentration was increased, simulated populations evolved toward higher fitness faster. Curve-fitting of the magnesium dependence of individual ribozymes demonstrated that deep sequencing of in vitro reactions can be used to evaluate the structural stability of thousands of sequences in parallel. Overall, the results highlight how environmental changes that stabilize structures can also alter the ruggedness of fitness landscapes and alter evolutionary processes. 
    more » « less
  4. Harris, Kelley (Ed.)
    Abstract Self-cleaving ribozymes are genetic elements found in all domains of life, but their evolution remains poorly understood. A ribozyme located in the second intron of the cytoplasmic polyadenylation binding protein 3 gene (CPEB3) shows high sequence conservation in mammals, but little is known about the functional conservation of self-cleaving ribozyme activity across the mammalian tree of life or during the course of mammalian evolution. Here, we use a phylogenetic approach to design a mutational library and a deep sequencing assay to evaluate the in vitro self-cleavage activity of numerous extant and resurrected CPEB3 ribozymes that span over 100 My of mammalian evolution. We found that the predicted sequence at the divergence of placentals and marsupials is highly active, and this activity has been conserved in most lineages. A reduction in ribozyme activity appears to have occurred multiple different times throughout the mammalian tree of life. The in vitro activity data allow an evaluation of the predicted mutational pathways leading to extant ribozyme as well as the mutational landscape surrounding these ribozymes. The results demonstrate that in addition to sequence conservation, the self-cleavage activity of the CPEB3 ribozyme has persisted over millions of years of mammalian evolution. 
    more » « less
  5. 2938 Using a Human Liver Tissue Equivalent (hLTE) Platform to Define the Functional Impact of Liver-Directed AAV Gene Therapy 63rd ASH Annual Meeting and Exposition, December 11-14, 2021, Georgia World Congress Center, Atlanta, GA Program: Oral and Poster Abstracts Session: 801. Gene Therapies: Poster II Hematology Disease Topics & Pathways: Bleeding and Clotting, Biological, Translational Research, Hemophilia, Genetic Disorders, Clinically Relevant, Diseases, Gene Therapy, Therapies Sunday, December 12, 2021, 6:00 PM-8:00 PM Ritu M Ramamurthy1*, Wen Ting Zheng2*, Sunil George, PhD1*, Meimei Wan1*, Yu Zhou, PhD1*, Baisong Lu, PhD1*, Colin E Bishop, PhD1*, Anthony Atala, M.D.1*, Christopher D Porada, PhD1* and M. Graca Almeida-Porada, MD3 1Fetal Research and Therapy Program, Wake Forest Institute for Regenerative Medicine, Winston Salem, NC 2Massachusetts Institute of Technology, Cambridge, MA 3Fetal Research and Therapy Program, Wake Forest Institute For Regenerative Medicine, Winston-Salem, NC Clinical trials employing AAV vectors for hemophilia A have been hindered by unanticipated immunological and/or inflammatory responses in some of the patients. Also, these trials have often yielded lower levels of transgene expression than were expected based upon preclinical studies, highlighting the poor correlation between the transduction efficiency observed in traditional 2D cultures of primary cells in vitro, and that observed in those same cell types in vivo. It has been also recognized that there are marked species-specific differences in AAV-vector tropism, raising the critical question of the accuracy with which various animal models will likely predict tropism/vector transduction efficiency, and eventual treatment success in humans. Human liver tissue equivalents (hLTEs) are comprised of major cell types in the liver in physiologically relevant frequencies and possess the ability to recapitulate the biology and function of native human liver. Here, we hypothesize that hLTEs can be used as a better model to predict the efficacy and safety of AAV gene therapy in humans. We fabricated hLTEs using 75% hepatocytes, 10% stellate cells, 10% Kupffer cells, and 5% liver sinusoid-derived endothelial cells in 96-well Elplasia plates with 79 microwells per well. hLTEs were transduced at an MOI of 105vg/cell, on the day of fabrication, with the clinically relevant serotypes AAV5 (hLTE-5) or AAV3b (hLTE-3b), both encoding a GFP reporter. After 4 days of self-aggregation, live/dead assay was performed to confirm viability. Non-transduced hLTEs served as negative controls (hLTE(-)), and hLTEs exposed to 20 mM acetaminophen were used as positive controls for liver inflammation/damage. Incucyte® Live-Cell Imaging system was used to track the aggregation and GFP expression of hLTEs. Over the course of the next 5 days, media was collected to determine hepatic functionality, RNA was isolated to assess dysregulation of genes involved in inflammation and fibrosis, DNA was isolated to determine whether AAV vectors integrate into the genome of human hepatocytes and, if so, to define the frequency at which this occurs and the genomic loci of integration, and hLTEs were fixed and processed at appropriate times for histological analyses and transmission electron microscopy (TEM). TEM analysis revealed that all groups exhibited microvilli and bile-canaliculus-like structures, demonstrating the formation of a rudimentary biliary system and, more importantly, proving that hLTEs resemble native liver structure. Incucyte® imaging showed that AAV5 and AAV3b transduction impaired formation of hLTEs (57.57 ± 2.42 and 24.57 ± 4.01 spheroids/well, respectively) in comparison with hLTE(-) (74.86 ± 3.8 spheroids/well). Quantification of GFP expression demonstrated that AAV5 yielded the most efficient transduction of hLTEs (fold change in GFP expression compared to control: 2.73 ± 0.09 and 1.19 ± 0.03 for hLTE-5 and hLTE-3b, respectively). Chromogenic assays showed decreased urea production in cell culture supernatants of AAV transduced groups compared to the non-transduced hLTEs on days 6 and 10 of culture, demonstrating decreased hepatocyte functionality. However, ALT and AST levels were similar in all groups. On day 10, hLTEs were either used for RNA isolation or fixed in 4% PFA and processed for histology. Masson’s Trichrome and Alcian Blue/Sirius Red staining was performed to detect fibrosis, which was then quantified using ImageJ. These analyses showed no significant increase in fibrosis in either hLTE-5 or hLTE-3b compared to hLTE(-). Nevertheless, RT2 PCR Array for Human Fibrosis detected dysregulation of several genes involved in fibrosis/inflammation in both hLTE-5 and hLTE-3b (16/84 and 26/84, respectively). In conclusion, data collected thus far show successful recapitulation of native liver biology and demonstrate that AAV5 transduces hLTEs more efficiently than AAV3b. However, impaired self-aggregation and decreased hepatocyte functionality was observed in both AAV-transduced groups. Studies to address the incidence and location(s) of AAV integration are ongoing. We have thus shown that the hLTE system can provide critical new knowledge regarding the efficacy and safety of AAV gene therapy in the human liver. Disclosures: No relevant conflicts of interest to declare. 
    more » « less