skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Comparative Analysis and Ancestral Sequence Reconstruction of Bacterial Sortase Family Proteins Generates Functional Ancestral Mutants with Different Sequence Specificities
Gram-positive bacteria are some of the earliest known life forms, diverging from gram-negative bacteria 2 billion years ago. These organisms utilize sortase enzymes to attach proteins to their peptidoglycan cell wall, a structural feature that distinguishes the two types of bacteria. The transpeptidase activity of sortases make them an important tool in protein engineering applications, e.g., in sortase-mediated ligations or sortagging. However, due to relatively low catalytic efficiency, there are ongoing efforts to create better sortase variants for these uses. Here, we use bioinformatics tools, principal component analysis and ancestral sequence reconstruction, in combination with protein biochemistry, to analyze natural sequence variation in these enzymes. Principal component analysis on the sortase superfamily distinguishes previously described classes and identifies regions of relatively high sequence variation in structurally-conserved loops within each sortase family, including those near the active site. Using ancestral sequence reconstruction, we determined sequences of ancestral Staphylococcus and Streptococcus Class A sortase proteins. Enzyme assays revealed that the ancestral Streptococcus enzyme is relatively active and shares similar sequence variation with other Class A Streptococcus sortases. Taken together, we highlight how natural sequence variation can be utilized to investigate this important protein family, arguing that these and similar techniques may be used to discover or design sortases with increased catalytic efficiency and/or selectivity for sortase-mediated ligation experiments.  more » « less
Award ID(s):
2044958
PAR ID:
10348985
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ;
Date Published:
Journal Name:
Bacteria
Volume:
1
Issue:
2
ISSN:
2674-1334
Page Range / eLocation ID:
121 to 135
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Bacterial sortases are a family of cysteine transpeptidases in Gram‐positive bacteria of which sortase A (SrtA) enzymes are responsible for ligating proteins to the peptidoglycan layer of the cell surface. Engineered versions of sortases are also used in sortase‐mediated ligation (SML) strategies for a variety of protein engineering applications. Although a versatile tool, substrate recognition byStaphylococcus aureusSrtA (saSrtA), the most commonly utilized enzyme in SML, is stringent and relies on an LPXTG pentapeptide motif. Previous structural studies revealed that the requirement of a glycine in the binding motif may be due to potential steric hindrance of amino acids possessing a β‐carbon by W194, a tryptophan located in the β7‐β8 loop of the enzyme. Here, we measured the effect of seven single point mutants of W194 (A, D, F, G, N, S, Y) saSrtA using a FRET‐based activity assay. We found that while the LPXTG motif remains a requirement for initial proteolytic cleavage, the nucleophile specificity of our variants is altered. In particular, W194A and W194S saSrtA recognize a D‐Ala nucleophile and are able to perform ligation reactions. Notably, an LPXT(D‐Ala) peptide was not cleaved by either mutant enzyme. We hypothesize that these variants may potentially be utilized to develop an irreversible sortase‐mediated reaction. Taken together, this experiment reveals new insight into sortase specificity and possible future SML strategies. 
    more » « less
  2. Studies of enzymes in modern-day plants have documented the diversity of metabolic activities retained by species today but only provide limited insight into how those properties evolved. Ancestral sequence reconstruction (ASR) is an approach that provides statistical estimates of ancient plant enzyme sequences which can then be resurrected to test hypotheses about the evolution of catalytic activities and pathway assembly. Here, I review the insights that have been obtained using ASR to study plant metabolism and highlight important methodological aspects. Overall, studies of resurrected plant enzymes show that (i) exaptation is widespread such that even low or undetectable levels of ancestral activity with a substrate can later become the apparent primary activity of descendant enzymes, (ii) intramolecular epistasis may or may not limit evolutionary paths towards catalytic or substrate preference switches, and (iii) ancient pathway flux often differs from modern-day metabolic networks. These and other insights gained from ASR would not have been possible using only modern-day sequences. Future ASR studies characterizing entire ancestral metabolic networks as well as those that link ancient structures with enzymatic properties should continue to provide novel insights into how the chemical diversity of plants evolved. This article is part of the theme issue ‘The evolution of plant metabolism’. 
    more » « less
  3. Abstract TEM-1 β-lactamase degrades β-lactam antibiotics with a strong preference for penicillins. Sequence reconstruction studies indicate that it evolved from ancestral enzymes that degraded a variety of β-lactam antibiotics with moderate efficiency. This generalist to specialist conversion involved more than 100 mutational changes, but conserved fold and catalytic residues, suggesting a role for dynamics in enzyme evolution. Here, we develop a conformational dynamics computational approach to rationally mold a protein flexibility profile on the basis of a hinge-shift mechanism. By deliberately weighting and altering the conformational dynamics of a putative Precambrian β-lactamase, we engineer enzyme specificity that mimics the modern TEM-1 β-lactamase with only 21 amino acid replacements. Our conformational dynamics design thus re-enacts the evolutionary process and provides a rational allosteric approach for manipulating function while conserving the enzyme active site. 
    more » « less
  4. dos Reis, Mario (Ed.)
    Abstract Ancestral sequence reconstruction (ASR) uses an alignment of extant protein sequences, a phylogeny describing the history of the protein family and a model of the molecular-evolutionary process to infer the sequences of ancient proteins, allowing researchers to directly investigate the impact of sequence evolution on protein structure and function. Like all statistical inferences, ASR can be sensitive to violations of its underlying assumptions. Previous studies have shown that, whereas phylogenetic uncertainty has only a very weak impact on ASR accuracy, uncertainty in the protein sequence alignment can more strongly affect inferred ancestral sequences. Here, we show that errors in sequence alignment can produce errors in ASR across a range of realistic and simplified evolutionary scenarios. Importantly, sequence reconstruction errors can lead to errors in estimates of structural and functional properties of ancestral proteins, potentially undermining the reliability of analyses relying on ASR. We introduce an alignment-integrated ASR approach that combines information from many different sequence alignments. We show that integrating alignment uncertainty improves ASR accuracy and the accuracy of downstream structural and functional inferences, often performing as well as highly accurate structure-guided alignment. Given the growing evidence that sequence alignment errors can impact the reliability of ASR studies, we recommend that future studies incorporate approaches to mitigate the impact of alignment uncertainty. Probabilistic modeling of insertion and deletion events has the potential to radically improve ASR accuracy when the model reflects the true underlying evolutionary history, but further studies are required to thoroughly evaluate the reliability of these approaches under realistic conditions. 
    more » « less
  5. Abstract Ancestral sequence reconstruction (ASR) is a powerful tool to study the evolution of proteins and thus gain deep insight into the relationships among protein sequence, structure, and function. A major barrier to its broad use is the complexity of the task: it requires multiple software packages, complex file manipulations, and expert phylogenetic knowledge. Here we introducetopiary, a software pipeline that aims to overcome this barrier. To use topiary, users prepare a spreadsheet with a handful of sequences. Topiary then: (1) Infers the taxonomic scope for the ASR study and finds relevant sequences by BLAST; (2) Does taxonomically informed sequence quality control and redundancy reduction; (3) Constructs a multiple sequence alignment; (4) Generates a maximum‐likelihood gene tree; (5) Reconciles the gene tree to the species tree; (6) Reconstructs ancestral amino acid sequences; and (7) Determines branch supports. The pipeline returns annotated evolutionary trees, spreadsheets with sequences, and graphical summaries of ancestor quality. This is achieved by integrating modern phylogenetics software (Muscle5, RAxML‐NG, GeneRax, and PastML) with online databases (NCBI and the Open Tree of Life). In this paper, we introduce non‐expert readers to the steps required for ASR, describe the specific design choices made intopiary, provide a detailed protocol for users, and then validate the pipeline using datasets from a broad collection of protein families. Topiary is freely available for download:https://github.com/harmslab/topiary. 
    more » « less