skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: The Role of Orthogonality in Genetic Code Expansion
The genetic code defines how information in the genome is translated into protein. Aside from a handful of isolated exceptions, this code is universal. Researchers have developed techniques to artificially expand the genetic code, repurposing codons and translational machinery to incorporate nonstandard amino acids (nsAAs) into proteins. A key challenge for robust genetic code expansion is orthogonality; the engineered machinery used to introduce nsAAs into proteins must co-exist with native translation and gene expression without cross-reactivity or pleiotropy. The issue of orthogonality manifests at several levels, including those of codons, ribosomes, aminoacyl-tRNA synthetases, tRNAs, and elongation factors. In this concept paper, we describe advances in genome recoding, translational engineering and associated challenges rooted in establishing orthogonality needed to expand the genetic code.  more » « less
Award ID(s):
1714860 1716766
PAR ID:
10112261
Author(s) / Creator(s):
; ;
Date Published:
Journal Name:
Life
Volume:
9
Issue:
3
ISSN:
2075-1729
Page Range / eLocation ID:
58
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Multiple genetic codes developed during the evolution of eukaryotes and bacteria, yet no alternative genetic code is known for archaea. We used proteomics to confirm our prediction that certain archaea consistently incorporate pyrrolysine (Pyl) at TAG codons, supporting an alternative archaeal genetic code that we designate the Pyl code. This genetic code has 62 sense codons encoding 21 amino acids. In contrast to monophyletic genetic code distributions in bacteria, the archaeal Pyl code occurs sporadically, indicating that it arose independently in multiple lineages. We discovered that more than 1800 archaeal proteins contain Pyl, increasing the number of such proteins by two orders of magnitude. Additionally, five Pyl transfer RNA (tRNA) pyrrolysyl–tRNA synthetase pairs from Pyl-code archaea were used to introduce Pyl analogs into proteins inEscherichia coli. 
    more » « less
  2. Abstract Bacillus subtilis is a model gram-positive bacterium, commonly used to explore questions across bacterial cell biology and for industrial uses. To enable greater understanding and control of proteins in B. subtilis , here we report broad and efficient genetic code expansion in B. subtilis by incorporating 20 distinct non-standard amino acids within proteins using 3 different families of genetic code expansion systems and two choices of codons. We use these systems to achieve click-labelling, photo-crosslinking, and translational titration. These tools allow us to demonstrate differences between E. coli and B. subtilis stop codon suppression, validate a predicted protein-protein binding interface, and begin to interrogate properties underlying bacterial cytokinesis by precisely modulating cell division dynamics in vivo. We expect that the establishment of this simple and easily accessible chemical biology system in B. subtilis will help uncover an abundance of biological insights and aid genetic code expansion in other organisms. 
    more » « less
  3. Generating protein conjugates using the bioorthogonal ligation between tetrazines and trans-cyclooctene groups avoids the need to manipulate cysteine amino acids, and the ligation is rapid, site-specific, stoichiometric and allows for labeling of proteins in complex biological environments. Here, we provide a protocol for the expression of conjugation-ready proteins at high yields in Escherichia coli with greater than 95% encoding and labeling fidelity. This protocol focuses on installing the “Tet2” tetrazine amino acid using an optimized genetic code expansion (GCE) machinery system, Tet2 “pAJE-E7”, to direct Tet2 encoding at TAG stop codons in BL21 E. coli strains, enabling reproducible expression of Tet2-proteins that quantitatively react with trans-cyclooctene (TCO) groups within 5 minutes at room temperature and physiological pH. Use of the BL21 derivative B95(DE3) minimizes premature truncation byproducts caused by incomplete suppression of TAG stop codons and this makes it possible to use more diverse protein construct designs. Here, using a superfolder green fluorescent protein construct as an example protein, we describe in detail a four-day process for encoding Tet2 with yields of ~200 mg per liter culture. Additionally, a simple and fast diagnostic gel electrophoretic mobility shift assay to confirm Tet2-Et encoding, and reactivity is described. Finally, strategies to adapt the protocol to alternative proteins of interest and optimize expression yields and reactivity for that protein are discussed. 
    more » « less
  4. Abstract Seed dormancy and germination represent a critical developmental transition that determines plant fitness, yet the contribution of translational regulation to this process remains poorly understood. Here, we used genome-wide ribosome profiling (Ribo-seq) combined with RNA sequencing (RNA-seq) to investigate how translational control shapes the transition from dormancy to germination inArabidopsis thalianaseeds. We analyzed dry dormant seeds, stratified non-dormant seeds, and seeds during early imbibition, enabling simultaneous assessment of transcript abundance and ribosome occupancy. Our analyses reveal that dry seeds harbor an unexpectedly organized translational machinery, with ribosomes pre-positioned at start codons and within coding regions of thousands of stored mRNAs, indicating a poised translational state. Dormancy release and early imbibition triggered extensive gene-specific changes in translational efficiency that were largely uncoupled from transcript abundance, highlighting selective translation as a key regulatory layer. Genes involved in ribosome biogenesis, protein folding, and hormone signaling were preferentially translated during dormancy maintenance, whereas germination-promoting factors showed increased ribosome occupancy following stratification. Global ribosome profiling further uncovered dynamic ribosome pausing at stop codons and pronounced modulation of translation initiation during imbibition.We also identified widespread translation of upstream open reading frames (uORFs) and demonstrated that uORF-mediated repression constitutes a major translational checkpoint during seed imbibition. Functional assays confirmed that uORFs fromMARD1andPAO4repress downstream translationin vivo. Together, our results establish translational regulation as a central mechanism governing seed dormancy and germination, revealing how ribosome positioning and uORF activity fine-tune protein synthesis to control developmental transitions in response to environmental cues. 
    more » « less
  5. Xu, Jianping (Ed.)
    ABSTRACT Mitochondria originated from an ancient bacterial endosymbiont that underwent reductive evolution by gene loss and endosymbiont gene transfer to the nuclear genome. The diversity of mitochondrial genomes published to date has revealed that gene loss and transfer processes are ongoing in many lineages. Most well-studied eukaryotic lineages are represented in mitochondrial genome databases, except for the superphylum Retaria—the lineage comprising Foraminifera and Radiolaria. Using single-cell approaches, we determined two complete mitochondrial genomes of Foraminifera and two nearly complete mitochondrial genomes of radiolarians. We report the complete coding content of an additional 14 foram species. We show that foraminiferan and radiolarian mitochondrial genomes contain a nearly fully overlapping but reduced mitochondrial gene complement compared to other sequenced rhizarians. In contrast to animals and fungi, many protists encode a diverse set of proteins on their mitochondrial genomes, including several ribosomal genes; however, some aerobic eukaryotic lineages (euglenids, myzozoans, and chlamydomonas-like algae) have reduced mitochondrial gene content and lack all ribosomal genes. Similar to these reduced outliers, we show that retarian mitochondrial genomes lack ribosomal protein and tRNA genes, contain truncated and divergent small and large rRNA genes, and contain only 14 or 15 protein-coding genes, including nad1 , - 3 , - 4 , - 4L , - 5 , and - 7 , cob , cox1 , - 2 , and - 3 , and atp1 , - 6 , and - 9 , with forams and radiolarians additionally carrying nad2 and nad6 , respectively. In radiolarian mitogenomes, a noncanonical genetic code was identified in which all three stop codons encode amino acids. Collectively, these results add to our understanding of mitochondrial genome evolution and fill in one of the last major gaps in mitochondrial sequence databases. IMPORTANCE We present the reduced mitochondrial genomes of Retaria, the rhizarian lineage comprising the phyla Foraminifera and Radiolaria. By applying single-cell genomic approaches, we found that foraminiferan and radiolarian mitochondrial genomes contain an overlapping but reduced mitochondrial gene complement compared to other sequenced rhizarians. An alternative genetic code was identified in radiolarian mitogenomes in which all three stop codons encode amino acids. Collectively, these results shed light on the divergent nature of the mitochondrial genomes from an ecologically important group, warranting further questions into the biological underpinnings of gene content variability and genetic code variation between mitochondrial genomes. 
    more » « less