skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Evidence for the emergence of β-trefoils by ‘Peptide Budding’ from an IgG-like β-sandwich
As sequence and structure comparison algorithms gain sensitivity, the intrinsic interconnectedness of the protein universe has become increasingly apparent. Despite this general trend, β-trefoils have emerged as an uncommon counterexample: They are an isolated protein lineage for which few, if any, sequence or structure associations to other lineages have been identified. If β-trefoils are, in fact, remote islands in sequence-structure space, it implies that the oligomerizing peptide that founded the β-trefoil lineage itself arose de novo . To better understand β-trefoil evolution, and to probe the limits of fragment sharing across the protein universe, we identified both ‘β-trefoil bridging themes’ (evolutionarily-related sequence segments) and ‘β-trefoil-like motifs’ (structure motifs with a hallmark feature of the β-trefoil architecture) in multiple, ostensibly unrelated, protein lineages. The success of the present approach stems, in part, from considering β-trefoil sequence segments or structure motifs rather than the β-trefoil architecture as a whole, as has been done previously. The newly uncovered inter-lineage connections presented here suggest a novel hypothesis about the origins of the β-trefoil fold itself–namely, that it is a derived fold formed by ‘budding’ from an Immunoglobulin-like β-sandwich protein. These results demonstrate how the evolution of a folded domain from a peptide need not be a signature of antiquity and underpin an emerging truth: few protein lineages escape nature’s sewing table.  more » « less
Award ID(s):
1724300
PAR ID:
10328826
Author(s) / Creator(s):
; ;
Editor(s):
Elofsson, Arne
Date Published:
Journal Name:
PLOS Computational Biology
Volume:
18
Issue:
2
ISSN:
1553-7358
Page Range / eLocation ID:
e1009833
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Ubiquitin and ubiquitin like proteins (UBLs) play key roles in eukaryotes. These proteins are attached to their target proteins through an E1-E2-E3 cascade and modify the functions of these proteins. Since the discovery of ubiquitin, several UBLs have been identified, including Nedd8, SUMO, ISG15, and Atg8. Ubiquitin and UBLs share a similar three-dimensional structure: β -grasp fold and an X-X-[R/A/E/K]-X-X-[G/X]-G motif at the C-terminus. We have previously reported that ubiquitin, Nedd8, and SUMO mimicking peptides which all contain the conserved motif X-X-[R/A/E/K]-X-X-[G/X]-G still retained their reactivity toward their corresponding E1, E2, and E3 enzymes. In our current study, we investigated whether such C-terminal peptides could still be transferred onto related pathway enzymes to probe the function of these enzymes when they are fused with a protein. By bioinformatic search of protein databases, we selected eight proteins carrying the X-X-[R/A/E/K]-X-X-[G/X]-G motif at the C-terminus of the β -grasp fold. We synthesized the C-terminal sequences of these candidates as short peptides and found that three of them showed significant reactivity with the ubiquitin E1 enzyme Ube1. We next fused the three reactive short peptides to three different protein frames, including their respective native protein frames, a ubiquitin frame and a peptidyl carrier protein (PCP) frame, and measured the reactivities of these peptide-fused proteins with Ube1. Peptide-fused proteins on ubiquitin and PCP frames showed obvious reactivity with Ube1. However, when we measured E2 UbcH7 transfer, we found that the PCP-peptide fusions lost their reactivity with UbcH7. Taken together, these results suggested that the recognition of E2 enzymes with peptide-fused proteins depended not only on the C-terminal sequences of the ubiquitin-mimicking peptides, but also on the overall structures of the protein frames. 
    more » « less
  2. Peptide backbone amide substitution can dramatically alter the conformational and physiochemical properties of native sequences. Although uncommon relative to N -alkyl substituents, peptides harboring main-chain N -hydroxy groups exhibit unique conformational preferences and biological activities. Here, we describe a versatile method to prepare N -hydroxy peptide on solid support and evaluate the impact of backbone N -hydroxylation on secondary structure stability. Based on previous work demonstrating the β-sheet-stabilizing effect of α-hydrazino acids, we carried out an analogous study with N -hydroxy-α-amino acids using a model β-hairpin fold. In contrast to N -methyl substituents, backbone N -hydroxy groups are accommodated in the β-strand region of the hairpin without energetic penalty. An enhancement in β-hairpin stability was observed for a di- N -hydroxylated variant. Our results facilitate access to this class of peptide derivatives and inform the use of backbone N -hydroxylation as a tool in the design of constrained peptidomimetics. 
    more » « less
  3. Marshall, Christopher W. (Ed.)
    ABSTRACT Identification of genes encoding β-lactamases (BLs) from short-read sequences remains challenging due to the high frequency of shared amino acid functional domains and motifs in proteins encoded by BL genes and related non-BL gene sequences. Divergent BL homologs can be frequently missed during similarity searches, which has important practical consequences for monitoring antibiotic resistance. To address this limitation, we built ROCker models that targeted broad classes (e.g., class A, B, C, and D) and individual families (e.g., TEM) of BLs and challenged them with mock 150-bp- and 250-bp-read data sets of known composition. ROCker identifies most-discriminant bit score thresholds in sliding windows along the sequence of the target protein sequence and hence can account for nondiscriminative domains shared by unrelated proteins. BL ROCker models showed a 0% false-positive rate (FPR), a 0% to 4% false-negative rate (FNR), and an up-to-50-fold-higher F1 score [2 × precision × recall/(precision + recall)] compared to alternative methods, such as similarity searches using BLASTx with various e-value thresholds and BL hidden Markov models, or tools like DeepARG, ShortBRED, and AMRFinder. The ROCker models and the underlying protein sequence reference data sets and phylogenetic trees for read placement are freely available through http://enve-omics.ce.gatech.edu/data/rocker-bla . Application of these BL ROCker models to metagenomics, metatranscriptomics, and high-throughput PCR gene amplicon data should facilitate the reliable detection and quantification of BL variants encoded by environmental or clinical isolates and microbiomes and more accurate assessment of the associated public health risk, compared to the current practice. IMPORTANCE Resistance genes encoding β-lactamases (BLs) confer resistance to the widely prescribed antibiotic class β-lactams. Therefore, it is important to assess the prevalence of BL genes in clinical or environmental samples for monitoring the spreading of these genes into pathogens and estimating public health risk. However, detecting BLs in short-read sequence data is technically challenging. Our ROCker model-based bioinformatics approach showcases the reliable detection and typing of BLs in complex data sets and thus contributes toward solving an important problem in antibiotic resistance surveillance. The ROCker models developed substantially expand the toolbox for monitoring antibiotic resistance in clinical or environmental settings. 
    more » « less
  4. The genes that encode the α- and β-chain subunits of vertebrate hemoglobin have served as a model system for elucidating general principles of gene family evolution, but little is known about patterns of evolution in amniotes other than mammals and birds. Here, we report a comparative genomic analysis of the α- and β-globin gene clusters in sauropsids (archosaurs and nonavian reptiles). The objectives were to characterize changes in the size and membership composition of the α- and β-globin gene families within and among the major sauropsid lineages, to reconstruct the evolutionary history of the sauropsid α- and β-globin genes, to resolve orthologous relationships, and to reconstruct evolutionary changes in the developmental regulation of gene expression. Our comparisons revealed contrasting patterns of evolution in the unlinked α- and β-globin gene clusters. In the α-globin gene cluster, which has remained in the ancestral chromosomal location, evolutionary changes in gene content are attributable to the differential retention of paralogous gene copies that were present in the common ancestor of tetrapods. In the β-globin gene cluster, which was translocated to a new chromosomal location, evolutionary changes in gene content are attributable to differential gene gains (via lineage-specific duplication events) and gene losses (via lineage-specific deletions and inactivations). Consequently, all major groups of amniotes possess unique repertoires of embryonic and postnatally expressed β-type globin genes that diversified independently in each lineage. These independently derived β-type globins descend from a pair of tandemly linked paralogs in the most recent common ancestor of sauropsids. 
    more » « less
  5. Munson, Mary (Ed.)
    Prior work has identified signal sequences and motifs that are necessary and sufficient to target proteins to specific subcellular regions and organelles such as the plasma membrane, nucleus, endoplasmic reticulum, and mitochondria. In contrast, minimal sequence motifs that are sufficient for Golgi localization remain largely elusive. In this work, we identified a 37–amino acid alternative open reading frame (altORF) within the mRNA of the centromere protein CENP-R. This altORF peptide localizes specifically to the cytoplasmic surface of the Golgi apparatus. Through mutational analysis, we identify a minimal 10–amino acid sequence and a critical cysteine residue that are necessary and sufficient for Golgi localization. Pharmacological perturbations suggest that this peptide undergoes lipid modification to promote its localization. Together, our work defines a minimal sequence that is sufficient for Golgi targeting and provide a valuable Golgi marker for live cell imaging. 
    more » « less