skip to main content

Attention:

The NSF Public Access Repository (NSF-PAR) system and access will be unavailable from 11:00 PM ET on Friday, September 13 until 2:00 AM ET on Saturday, September 14 due to maintenance. We apologize for the inconvenience.


Title: Substrate multiplexed protein engineering facilitates promiscuous biocatalytic synthesis
Abstract

Enzymes with high activity are readily produced through protein engineering, but intentionally and efficiently engineering enzymes for an expanded substrate scope is a contemporary challenge. One approach to address this challenge is Substrate Multiplexed Screening (SUMS), where enzyme activity is measured on competing substrates. SUMS has long been used to rigorously quantitate native enzyme specificity, primarily for in vivo settings. SUMS has more recently found sporadic use as a protein engineering approach but has not been widely adopted by the field, despite its potential utility. Here, we develop principles of how to design and interpret SUMS assays to guide protein engineering. This rich information enables improving activity with multiple substrates simultaneously, identifies enzyme variants with altered scope, and indicates potential mutational hot-spots as sites for further engineering. These advances leverage common laboratory equipment and represent a highly accessible and customizable method for enzyme engineering.

 
more » « less
NSF-PAR ID:
10373023
Author(s) / Creator(s):
; ;
Publisher / Repository:
Nature Publishing Group
Date Published:
Journal Name:
Nature Communications
Volume:
13
Issue:
1
ISSN:
2041-1723
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. The ability to site-selectively modify equivalent functional groups in a molecule has the potential to streamline syntheses and increase product yields by lowering step counts. Enzymes catalyze site-selective transformations throughout primary and secondary metabolism, but leveraging this capability for non-native substrates and reactions requires a detailed understanding of the potential and limitations of enzyme catalysis and how these bounds can be extended by protein engineering. In this review, we discuss representative examples of site-selective enzyme catalysis involving functional group manipulation and C–H bond functionalization. We include illustrative examples of native catalysis, but our focus is on cases involving non-native substrates and reactions often using engineered enzymes. We then discuss the use of these enzymes for chemoenzymatic transformations and target-oriented synthesis and conclude with a survey of tools and techniques that could expand the scope of non-native site-selective enzyme catalysis. 
    more » « less
  2. Abstract

    Predicting the range of substrates accepted by an enzyme from its amino acid sequence is challenging. Although sequence‐ and structure‐based annotation approaches are often accurate for predicting broad categories of substrate specificity, they generally cannot predict which specific molecules will be accepted as substrates for a given enzyme, particularly within a class of closely related molecules. Combining targeted experimental activity data with structural modeling, ligand docking, and physicochemical properties of proteins and ligands with various machine learning models provides complementary information that can lead to accurate predictions of substrate scope for related enzymes. Here we describe such an approach that can predict the substrate scope of bacterial nitrilases, which catalyze the hydrolysis of nitrile compounds to the corresponding carboxylic acids and ammonia. Each of the four machine learning models (logistic regression, random forest, gradient‐boosted decision trees, and support vector machines) performed similarly (average ROC = 0.9, average accuracy = ~82%) for predicting substrate scope for this dataset, although random forest offers some advantages. This approach is intended to be highly modular with respect to physicochemical property calculations and software used for structural modeling and docking.

     
    more » « less
  3. Abstract

    Biocatalytic cascades are uniquely powerful for the efficient, asymmetric synthesis of bioactive compounds. However, high substrate specificity can hinder the scope of biocatalytic cascades because the constituent enzymes may have non‐complementary activity. In this study, we implemented a substrate multiplexed screening (SUMS) based directed evolution approach to improve the substrate scope overlap between a transaldolase (ObiH) and a decarboxylase for the production of chiral 1,2‐amino alcohols. To generate a promiscuous cascade, we engineered a tryptophan decarboxylase to act efficiently on β‐OH amino acids while avoiding activity onl‐threonine, which is needed for ObiH activity. We leveraged this exquisite selectivity with matched substrate scope to produce a variety of enantiopure 1,2‐amino alcohols in a one‐pot cascade from aldehydes or styrene oxides. This demonstration shows how SUMS can be used to guide the development of promiscuous, C−C bond forming cascades.

     
    more » « less
  4. Abstract

    Biocatalytic cascades are uniquely powerful for the efficient, asymmetric synthesis of bioactive compounds. However, high substrate specificity can hinder the scope of biocatalytic cascades because the constituent enzymes may have non‐complementary activity. In this study, we implemented a substrate multiplexed screening (SUMS) based directed evolution approach to improve the substrate scope overlap between a transaldolase (ObiH) and a decarboxylase for the production of chiral 1,2‐amino alcohols. To generate a promiscuous cascade, we engineered a tryptophan decarboxylase to act efficiently on β‐OH amino acids while avoiding activity onl‐threonine, which is needed for ObiH activity. We leveraged this exquisite selectivity with matched substrate scope to produce a variety of enantiopure 1,2‐amino alcohols in a one‐pot cascade from aldehydes or styrene oxides. This demonstration shows how SUMS can be used to guide the development of promiscuous, C−C bond forming cascades.

     
    more » « less
  5. Abstract

    De novo enzyme design has sought to introduce active sites and substrate-binding pockets that are predicted to catalyse a reaction of interest into geometrically compatible native scaffolds1,2, but has been limited by a lack of suitable protein structures and the complexity of native protein sequence–structure relationships. Here we describe a deep-learning-based ‘family-wide hallucination’ approach that generates large numbers of idealized protein structures containing diverse pocket shapes and designed sequences that encode them. We use these scaffolds to design artificial luciferases that selectively catalyse the oxidative chemiluminescence of the synthetic luciferin substrates diphenylterazine3and 2-deoxycoelenterazine. The designed active sites position an arginine guanidinium group adjacent to an anion that develops during the reaction in a binding pocket with high shape complementarity. For both luciferin substrates, we obtain designed luciferases with high selectivity; the most active of these is a small (13.9 kDa) and thermostable (with a melting temperature higher than 95 °C) enzyme that has a catalytic efficiency on diphenylterazine (kcat/Km = 106 M−1 s−1) comparable to that of native luciferases, but a much higher substrate specificity. The creation of highly active and specific biocatalysts from scratch with broad applications in biomedicine is a key milestone for computational enzyme design, and our approach should enable generation of a wide range of luciferases and other enzymes.

     
    more » « less