skip to main content

Attention:

The NSF Public Access Repository (PAR) system and access will be unavailable from 11:00 PM ET on Friday, December 13 until 2:00 AM ET on Saturday, December 14 due to maintenance. We apologize for the inconvenience.


Title: Enhancing computational enzyme design by a maximum entropy strategy
Although computational enzyme design is of great importance, the advances utilizing physics-based approaches have been slow, and further progress is urgently needed. One promising direction is using machine learning, but such strategies have not been established as effective tools for predicting the catalytic power of enzymes. Here, we show that the statistical energy inferred from homologous sequences with the maximum entropy (MaxEnt) principle significantly correlates with enzyme catalysis and stability at the active site region and the more distant region, respectively. This finding decodes enzyme architecture and offers a connection between enzyme evolution and the physical chemistry of enzyme catalysis, and it deepens our understanding of the stability–activity trade-off hypothesis for enzymes. Overall, the strong correlations found here provide a powerful way of guiding enzyme design.  more » « less
Award ID(s):
2142727
PAR ID:
10408002
Author(s) / Creator(s):
; ;
Date Published:
Journal Name:
Proceedings of the National Academy of Sciences
Volume:
119
Issue:
7
ISSN:
0027-8424
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Laboratory evolution combined with computational enzyme design provides the opportunity to generate novel biocatalysts. Nevertheless, it has been challenging to understand how laboratory evolution optimizes designer enzymes by introducing seemingly random mutations. A typical enzyme optimized with laboratory evolution is the abiological Kemp eliminase, initially designed by grafting active site residues into a natural protein scaffold. Here, we relate the catalytic power of laboratory-evolved Kemp eliminases to the statistical energy ( E MaxEnt ) inferred from their natural homologous sequences using the maximum entropy model. The E MaxEnt of designs generated by directed evolution is correlated with enhanced activity and reduced stability, thus displaying a stability-activity trade-off. In contrast, the E MaxEnt for mutants in catalytic-active remote regions (in which remote residues are important for catalysis) is strongly anticorrelated with the activity. These findings provide an insight into the role of protein scaffolds in the adaption to new enzymatic functions. It also indicates that the valley in the E MaxEnt landscape can guide enzyme design for abiological catalysis. Overall, the connection between laboratory and natural evolution contributes to understanding what is optimized in the laboratory and how new enzymatic function emerges in nature, and provides guidance for computational enzyme design. 
    more » « less
  2. Abstract

    In recent decades, biocatalysis has emerged as an important alternative to chemical catalysis in pharmaceutical manufacturing. Biocatalysis is attractive because enzymatic cascades can synthesize complex molecules with incredible selectivity, yield, and in an environmentally benign manner. Enzymes for pharmaceutical biocatalysis are typically used in their unpurified state, since it is time‐consuming and cost‐prohibitive to purify enzymes using conventional chromatographic processes at scale. However, impurities present in crude enzyme preparations can consume substrate, generate unwanted byproducts, as well as make the isolation of desired products more cumbersome. Hence, a facile, nonchromatographic purification method would greatly benefit pharmaceutical biocatalysis. To address this issue, here we have captured enzymes into membraneless compartments by fusing enzymes with an intrinsically disordered protein region, the RGG domain from LAF‐1. The RGG domain can undergo liquid–liquid phase separation, forming liquid condensates triggered by changes in temperature or salt concentration. By centrifuging these liquid condensates, we have successfully purified enzyme‐RGG fusions, resulting in significantly enhanced purity compared to cell lysate. Furthermore, we performed enzymatic reactions utilizing purified fusion proteins to assay enzyme activity. Results from the enzyme assays indicate that enzyme‐RGG fusions purified by the centrifugation method retain enzymatic activity, with greatly reduced background activity compared to crude enzyme preparations. Our work focused on three different enzymes—a kinase, a phosphorylase, and an ATP‐dependent ligase. The kinase and phosphorylase are components of the biocatalytic cascade for manufacturing molnupiravir, and we demonstrated facile co‐purification of these two enzymes by co‐phase separation. To conclude, enzyme capture by RGG tagging promises to overcome difficulties in bioseparations and biocatalysis for pharmaceutical synthesis.

     
    more » « less
  3. Cold temperature is prevalent across the biosphere and slows the rates of chemical reactions. Increased catalysis has been predicted to be a dominant adaptive trait of enzymes to reduced temperature, and this expectation has informed physical models for enzyme catalysis and influenced bioprospecting strategies. To systematically test rate enhancement as an adaptive trait to cold, we paired kinetic constants of 2223 enzyme reactions with their organism’s optimal growth temperature ( T Growth ) and analyzed trends of rate constants as a function of T Growth . These data do not support a general increase in rate enhancement in cold adaptation. In the model enzyme ketosteroid isomerase (KSI), there is prior evidence for temperature adaptation from a change in an active site residue that results in a tradeoff between activity and stability. Nevertheless, we found that little of the rate constant variation for 20 KSI variants was accounted for by T Growth . In contrast, and consistent with prior expectations, we observed a correlation between stability and T Growth across 433 proteins. These results suggest that temperature exerts a weaker selection pressure on enzyme rate constants than stability and that evolutionary forces other than temperature are responsible for the majority of enzymatic rate constant variation. 
    more » « less
  4. Enzyme immobilization offers a number of advantages that improve biocatalysis; however, finding a proper way to immobilize enzymes is often a challenging task. Implanting enzymes in metal–organic frameworks (MOFs) via co-crystallization, also known as biomineralization, provides enhanced reusability and stability with minimal perturbation and substrate selectivity to the enzyme. Currently, there are limited metal–ligand combinations with a proper protocol guiding the experimental procedures. We have recently explored 10 combinations that allow custom immobilization of enzymes according to enzyme stability and activity in different metals/ligands. Here, as a follow-up of that work, we present a protocol for how to carry out custom immobilization of enzymes using the available combinations of metal ions and ligands. Detailed procedures to prepare metal ions, ligands, and enzymes for their co-crystallization, together with characterization and assessment, are discussed. Precautions for each experimental step and result analysis are highlighted as well. This protocol is important for enzyme immobilization in various research and industrial fields. 
    more » « less
  5. ABSTRACT

    Enzymes, as paramount protein catalysts, occupy a central role in fostering remarkable progress across numerous fields. However, the intricacy of sequence-function relationships continues to obscure our grasp of enzyme behaviors and curtails our capabilities in rational enzyme engineering. Generative artificial intelligence (AI), known for its proficiency in handling intricate data distributions, holds the potential to offer novel perspectives in enzyme research. Generative models could discern elusive patterns within the vast sequence space and uncover new functional enzyme sequences. This review highlights the recent advancements in employing generative AI for enzyme sequence analysis. We delve into the impact of generative AI in predicting mutation effects on enzyme fitness, catalytic activity and stability, rationalizing the laboratory evolution of de novo enzymes, and decoding protein sequence semantics and their application in enzyme engineering. Notably, the prediction of catalytic activity and stability of enzymes using natural protein sequences serves as a vital link, indicating how enzyme catalysis shapes enzyme evolution. Overall, we foresee that the integration of generative AI into enzyme studies will remarkably enhance our knowledge of enzymes and expedite the creation of superior biocatalysts.

     
    more » « less