skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


This content will become publicly available on December 24, 2025

Title: Order of amino acid recruitment into the genetic code resolved by last universal common ancestor’s protein domains
The current “consensus” order in which amino acids were added to the genetic code is based on potentially biased criteria, such as the absence of sulfur-containing amino acids from the Urey–Miller experiment which lacked sulfur. More broadly, abiotic abundance might not reflect biotic abundance in the organisms in which the genetic code evolved. Here, we instead identify which protein domains date to the last universal common ancestor (LUCA) and then infer the order of recruitment from deviations of their ancestrally reconstructed amino acid frequencies from the still-ancient post-LUCA controls. We find that smaller amino acids were added to the code earlier, with no additional predictive power in the previous consensus order. Metal-binding (cysteine and histidine) and sulfur-containing (cysteine and methionine) amino acids were added to the genetic code much earlier than previously thought. Methionine and histidine were added to the code earlier than expected from their molecular weights and glutamine later. Early methionine availability is compatible with inferred early use of S-adenosylmethionine and early histidine with its purine-like structure and the demand for metal binding. Even more ancient protein sequences—those that had already diversified into multiple distinct copies prior to LUCA—have significantly higher frequencies of aromatic amino acids (tryptophan, tyrosine, phenylalanine, and histidine) and lower frequencies of valine and glutamic acid than single-copy LUCA sequences. If at least some of these sequences predate the current code, then their distinct enrichment patterns provide hints about earlier, alternative genetic codes.  more » « less
Award ID(s):
2333243
PAR ID:
10621340
Author(s) / Creator(s):
; ; ; ; ; ;
Publisher / Repository:
National Academy of Sciences
Date Published:
Journal Name:
Proceedings of the National Academy of Sciences
Volume:
121
Issue:
52
ISSN:
0027-8424
Page Range / eLocation ID:
e2410311121
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. RationaleSulfur isotope analysis of organic sulfur‐containing molecules has previously been hindered by challenging preparatory chemistry and analytical requirements for large sample sizes. The natural‐abundance sulfur isotopic compositions of the sulfur‐containing amino acids, cysteine and methionine, have therefore not yet been investigated despite potential utility in biomedicine, ecology, oceanography, biogeochemistry, and other fields. MethodsCysteine and methionine were subjected to hot acid hydrolysis followed by quantitative oxidation in performic acid to yield cysteic acid and methionine sulfone. These stable, oxidized products were then separated by reversed‐phase high‐performance liquid chromatography (HPLC) and verified via offline liquid chromatography/mass spectrometry (LC/MS). The sulfur isotope ratios (δ34S values) of purified analytes were then measured via combustion elemental analyzer coupled to isotope ratio mass spectrometry (EA/IRMS). The EA was equipped with a temperature‐ramped chromatographic column and programmable helium carrier flow rates. ResultsOn‐column focusing of SO2in the EA/IRMS system, combined with reduced He carrier flow during elution, greatly improved sensitivity, allowing precise (0.1–0.3‰ 1 s.d.) δ34S measurements of 1 to 10 μg sulfur. We validated that our method for purification of cysteine and methionine was negligibly fractionating using amino acid and protein standards. Proof‐of‐concept measurements of fish muscle tissue and bacteria demonstrated differences up to 4‰ between the δ34S values of cysteine and methionine that can be connected to biosynthetic pathways. ConclusionsWe have developed a sensitive, precise method for measuring the natural‐abundance sulfur isotopic compositions of cysteine and methionine isolated from biological samples. This capability opens up diverse applications of sulfur isotopes in amino acids and proteins, from use as a tracer in organisms and the environment, to fundamental aspects of metabolism and biosynthesis. 
    more » « less
  2. Zinc finger (ZF) proteins are proteins that use zinc as a structural cofactor. The common feature among all ZFs is that they contain repeats of four cysteine and/or histidine residues within their primary amino acid sequence. With the explosion of genome sequencing in the early 2000s, a large number of proteins were annotated as ZFs based solely upon amino acid sequence. As these proteins began to be characterizedexperimentally, it was discovered that some of these proteins contain iron–sulfur sites either in place of or in addition to zinc. Here, we describe methods to isolate and characterize one such ZF protein, cleavage and polyadenylation specificity factor 30 (CPSF3O) with respect to its metal-loading and RNA-binding activity. 
    more » « less
  3. Abstract In this procedure, we describe a high‐throughput absolute quantification protocol for the protein‐bound sulfur amino acids, cysteine (Cys) and methionine (Met), from plant seeds. This procedure consists of performic acid oxidation that transforms bound Cys into cysteic acid (CysA) and bound Met into methionine sulfone (MetS) followed by acid hydrolysis. The absolute quantification step is performed by multiple reaction monitoring tandem mass spectrometry (LC‐MS/MS). The approach facilitates the analysis of a few hundred samples per week by using a 96‐well plate extraction setup. Importantly, the method uses only ∼4 mg of tissue per sample and uses the common acid hydrolysis protocol, followed by water extraction that includes DL‐Ser‐d3 and L‐Met‐d3 as internal standards to enable the quantification of the absolute levels of the protein‐bound Cys and Met with high precision, accuracy, and reproducibility. The protocol described herein has been optimized for seed samples from Arabidopsis thaliana , Glycine max , and Zea mays but could be applied to other plant tissues. © 2023 Wiley Periodicals LLC. Basic Protocol : Analysis of protein‐bound cysteine and methionine from seeds 
    more » « less
  4. Peracetic acid (PAA) is a sanitizer with increasing use in food, medical and water treatment industries. Amino acids are important components in targeted foods for PAA treatment and ubiquitous in natural waterbodies and wastewater effluents as the primary form of dissolved organic nitrogen. To better understand the possible reactions, this work investigated the reaction kinetics and transformation pathways of selected amino acids towards PAA. Experimental results demonstrated that most amino acids showed sluggish reactivity to PAA except cysteine (CYS), methionine (MET), and histidine (HIS). CYS showed the highest reactivity with a very rapid reaction rate. Reactions of MET and HIS with PAA followed second-order kinetics with rate constants of 4.6 ± 0.2, and 1.8 ± 0.1 M−1s−1 at pH 7, respectively. The reactions were faster at pH 5 and 7 than at pH 9 due to PAA speciation. Low concentrations of H2O2 coexistent with PAA contributed little to the oxidation of amino acids. The primary oxidation products of amino acids with PAA were [O] addition compounds on the reactive sites at thiol, thioether and imidazole groups. Theoretical calculations were applied to predict the reactivity and regioselectivity of PAA electrophilic attacks on amino acids and improved mechanistic understanding. As an oxidative disinfectant, the reaction of PAA with organics to form byproducts is inevitable; however, this study shows that PAA exhibits lower and more selective reactivity towards biomolecules such as amino acids than other common disinfectants, causing less concern of toxic disinfection byproducts. This attribute may allow greater stability and more targeted actions of PAA in various applications. 
    more » « less
  5. Thermoflexus hugenholtzii JAD2 T , the only cultured representative of the Chloroflexota order Thermoflexales , is abundant in Great Boiling Spring (GBS), NV, United States, and close relatives inhabit geothermal systems globally. However, no defined medium exists for T. hugenholtzii JAD2 T and no single carbon source is known to support its growth, leaving key knowledge gaps in its metabolism and nutritional needs. Here, we report comparative genomic analysis of the draft genome of T. hugenholtzii JAD2 T and eight closely related metagenome-assembled genomes (MAGs) from geothermal sites in China, Japan, and the United States, representing “ Candidatus Thermoflexus japonica,” “ Candidatus Thermoflexus tengchongensis,” and “ Candidatus Thermoflexus sinensis.” Genomics was integrated with targeted exometabolomics and 13 C metabolic probing of T. hugenholtzii . The Thermoflexus genomes each code for complete central carbon metabolic pathways and an unusually high abundance and diversity of peptidases, particularly Metallo- and Serine peptidase families, along with ABC transporters for peptides and some amino acids. The T. hugenholtzii JAD2 T exometabolome provided evidence of extracellular proteolytic activity based on the accumulation of free amino acids. However, several neutral and polar amino acids appear not to be utilized, based on their accumulation in the medium and the lack of annotated transporters. Adenine and adenosine were scavenged, and thymine and nicotinic acid were released, suggesting interdependency with other organisms in situ . Metabolic probing of T. hugenholtzii JAD2 T using 13 C-labeled compounds provided evidence of oxidation of glucose, pyruvate, cysteine, and citrate, and functioning glycolytic, tricarboxylic acid (TCA), and oxidative pentose-phosphate pathways (PPPs). However, differential use of position-specific 13 C-labeled compounds showed that glycolysis and the TCA cycle were uncoupled. Thus, despite the high abundance of Thermoflexus in sediments of some geothermal systems, they appear to be highly focused on chemoorganotrophy, particularly protein degradation, and may interact extensively with other microorganisms in situ . 
    more » « less