Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher.
Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?
Some links on this page may take you to non-federal websites. Their policies may differ from this site.
-
The relationship between genotype and phenotype, or the fitness landscape, is the foundation of genetic engineering and evolution. However, mapping fitness landscapes poses a major technical challenge due to the amount of quantifiable data that is required. Catalytic RNA is a special topic in the study of fitness landscapes due to its relatively small sequence space combined with its importance in synthetic biology. The combination of in vitro selection and high-throughput sequencing has recently provided empirical maps of both complete and local RNA fitness landscapes, but the astronomical size of sequence space limits purely experimental investigations. Next steps are likely to involve data-driven interpolation and extrapolation over sequence space using various machine learning techniques. We discuss recent progress in understanding RNA fitness landscapes, particularly with respect to protocells and machine representations of RNA. The confluence of technical advances may significantly impact synthetic biology in the near future.more » « less
-
Abstract Modern life is essentially homochiral, containing D-sugars in nucleic acid backbones and L-amino acids in proteins. Since coded proteins are theorized to have developed from a prebiotic RNA World, the homochirality of L-amino acids observed in all known life presumably resulted from chiral transfer from a homochiral D-RNA World. This transfer would have been mediated by aminoacyl-RNAs defining the genetic code. Previous work on aminoacyl transfer using tRNA mimics has suggested that aminoacylation using D-RNA may be inherently biased toward reactivity with L-amino acids, implying a deterministic path from a D-RNA World to L-proteins. Using a model system of self-aminoacylating D-ribozymes and epimerizable activated amino acid analogs, we test the chiral selectivity of 15 ribozymes derived from an exhaustive search of sequence space. All of the ribozymes exhibit detectable selectivity, and a substantial fraction react preferentially to produce the D-enantiomer of the product. Furthermore, chiral preference is conserved within sequence families. These results are consistent with the transfer of chiral information from RNA to proteins but do not support an intrinsic bias of D-RNA for L-amino acids. Different aminoacylation structures result in different directions of chiral selectivity, such that L-proteins need not emerge from a D-RNA World.more » « less
-
The identification of catalytic RNAs is typically achieved through primarily experimental means. However, only a small fraction of sequence space can be analyzed even with high-throughput techniques. Methods to extrapolate from a limited data set to predict additional ribozyme sequences, particularly in a human-interpretable fashion, could be useful both for designing new functional RNAs and for generating greater understanding about a ribozyme fitness landscape. Using information theory, we express the effects of epistasis (i.e., deviations from additivity) on a ribozyme. This representation was incorporated into a simple model of the epistatic fitness landscape, which identified potentially exploitable combinations of mutations. We used this model to theoretically predict mutants of high activity for a self-aminoacylating ribozyme, identifying potentially active triple and quadruple mutants beyond the experimental data set of single and double mutants. The predictions were validated experimentally, with nine out of nine sequences being accurately predicted to have high activity. This set of sequences included mutants that form a previously unknown evolutionary ‘bridge’ between two ribozyme families that share a common motif. Individual steps in the method could be examined, understood, and guided by a human, combining interpretability and performance in a simple model to predict ribozyme sequences by extrapolation.more » « less
An official website of the United States government
