skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Validation of DBFOLD: An efficient algorithm for computing folding pathways of complex proteins
Atomistic simulations can provide valuable, experimentally-verifiable insights into protein folding mechanisms, but existing ab initio simulation methods are restricted to only the smallest proteins due to severe computational speed limits. The folding of larger proteins has been studied using native-centric potential functions, but such models omit the potentially crucial role of non-native interactions. Here, we present an algorithm, entitled DBFOLD, which can predict folding pathways for a wide range of proteins while accounting for the effects of non-native contacts. In addition, DBFOLD can predict the relative rates of different transitions within a protein’s folding pathway. To accomplish this, rather than directly simulating folding, our method combines equilibrium Monte-Carlo simulations, which deploy enhanced sampling, with unfolding simulations at high temperatures. We show that under certain conditions, trajectories from these two types of simulations can be jointly analyzed to compute unknown folding rates from detailed balance. This requires inferring free energies from the equilibrium simulations, and extrapolating transition rates from the unfolding simulations to lower, physiologically-reasonable temperatures at which the native state is marginally stable. As a proof of principle, we show that our method can accurately predict folding pathways and Monte-Carlo rates for the well-characterized Streptococcal protein G. We then show that our method significantly reduces the amount of computation time required to compute the folding pathways of large, misfolding-prone proteins that lie beyond the reach of existing direct simulation. Our algorithm, which is available online , can generate detailed atomistic models of protein folding mechanisms while shedding light on the role of non-native intermediates which may crucially affect organismal fitness and are frequently implicated in disease.  more » « less
Award ID(s):
1764269
PAR ID:
10328969
Author(s) / Creator(s):
; ;
Editor(s):
Kasson, Peter M.
Date Published:
Journal Name:
PLOS Computational Biology
Volume:
16
Issue:
11
ISSN:
1553-7358
Page Range / eLocation ID:
e1008323
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Proteins are constantly undergoing folding and unfolding transitions, with rates that determine their homeostasis in vivo and modulate their biological function. The ability to optimize these rates without affecting overall native stability is hence highly desirable for protein engineering and design. The great challenge is, however, that mutations generally affect folding and unfolding rates with inversely complementary fractions of the net free energy change they inflict on the native state. Here we address this challenge by targeting the folding transition state (FTS) of chymotrypsin inhibitor 2 (CI2), a very slow and stable two‐state folding protein with an FTS known to be refractory to change by mutation. We first discovered that the CI2's FTS is energetically taxed by the desolvation of several, highly conserved, charges that form a buried salt bridge network in the native structure. Based on these findings, we designed a CI2 variant that bears just four mutations and aims to selectively stabilize the FTS. This variant has >250‐fold faster rates in both directions and hence identical native stability, demonstrating the success of our FTS‐centric design strategy. With an optimized FTS, CI2 also becomes 250‐fold more sensitive to proteolytic degradation by its natural substrate chymotrypsin, and completely loses its activity as inhibitor. These results indicate that CI2 has been selected through evolution to have a very unstable FTS in order to attain the kinetic stability needed to effectively function as protease inhibitor. Moreover, the CI2 case showcases that protein (un)folding rates can critically pivot around a few key residues‐interactions, which can strongly modify the general effects of known structural factors such as domain size and fold topology. From a practical standpoint, our results suggest that future efforts should perhaps focus on identifying such critical residues‐interactions in proteins as best strategy to significantly improve our ability to predict and engineer protein (un)folding rates. 
    more » « less
  2. Abstract Ultrafast folding proteins have become an important paradigm in the study of protein folding dynamics. Due to their low energetic barriers and fast kinetics, they are amenable for study by both experiment and simulation. However, single molecule force spectroscopy experiments on these systems are challenging as these proteins do not provide the mechanical fingerprints characteristic of more mechanically stable proteins, which makes it difficult to extract information about the folding dynamics of the molecule. Here, we investigate the unfolding of the ultrafast protein Engrailed Homeodomain (EnHD) by single-molecule atomic force microscopy experiments. Constant speed experiments on EnHD result in featureless transitions typical of compliant proteins. However, in the force-ramp mode we recover sigmoidal curves that we interpret as a very compliant protein that folds and unfolds many times over a marginal barrier. This is supported by a simple theoretical model and coarse-grained molecular simulations. Our results show the ability of force to modulate the unfolding dynamics of ultrafast folding proteins. 
    more » « less
  3. Stretched-exponential protein refolding kinetics, first observed decades ago, were attributed to a nonnative ensemble of structures with parallel, non-interconverting folding pathways. However, the structural origin of the large energy barriers preventing interconversion between these folding pathways is unknown. Here, we combine simulations with limited proteolysis (LiP) and cross-linking (XL) mass spectrometry (MS) to study the protein phosphoglycerate kinase (PGK). Simulations recapitulate its stretched-exponential folding kinetics and reveal that misfolded states involving changes of entanglement underlie this behavior: either formation of a nonnative, noncovalent lasso entanglement or failure to form a native entanglement. These misfolded states act as kinetic traps, requiring extensive unfolding to escape, which results in a distribution of free energy barriers and pathway partitioning. Using LiP-MS and XL-MS, we propose heterogeneous structural ensembles consistent with these data that represent the potential long-lived misfolded states PGK populates. This structural and energetic heterogeneity creates a hierarchy of refolding timescales, explaining stretched-exponential kinetics. 
    more » « less
  4. Abstract The forces that stabilize membrane proteins remain elusive to precise quantification. Particularly important, but poorly resolved, are the forces present during the initial unfolding of a membrane protein, where the most native set of interactions is present. A high‐precision, atomic force microscopy assay was developed to study the initial unfolding of bacteriorhodopsin. A rapid near‐equilibrium folding between the first three unfolding states was discovered, the two transitions corresponded to the unfolding of five and three amino acids, respectively, when using a cantilever optimized for 2 μs resolution. The third of these states was retinal‐stabilized and previously undetected, despite being the most mechanically stable state in the whole unfolding pathway, supporting 150 pN for more than 1 min. This ability to measure the dynamics of the initial unfolding of bacteriorhodopsin provides a platform for quantifying the energetics of membrane proteins under native‐like conditions. 
    more » « less
  5. Abstract Many proteins must interact with molecular chaperones to achieve their native state in the cell. Yet, how chaperone binding‐site characteristics affect the folding process is poorly understood. The ubiquitous Hsp70 chaperone system prevents client‐protein aggregation by holding unfolded conformations and by unfolding misfolded states. Hsp70 binding sites of client proteins comprise a nonpolar core surrounded by positively charged residues. However, a detailed analysis of Hsp70 binding sites on a proteome‐wide scale is still lacking. Further, it is not known whether proteins undergo some degree of folding while chaperone bound. Here, we begin to address the above questions by identifying Hsp70 binding sites in 2258Escherichia coli(E. coli) proteins. We find that most proteins bear at least one Hsp70 binding site and that the number of Hsp70 binding sites is directly proportional to protein size. Aggregation propensity upon release from the ribosome correlates with number of Hsp70 binding sites only in the case of large proteins. Interestingly, Hsp70 binding sites are more solvent‐exposed than other nonpolar sites, in protein native states. Our findings show that the majority ofE. coliproteins are systematically enabled to interact with Hsp70 even if this interaction only takes place during a fraction of the protein lifetime. In addition, our data suggest that some conformational sampling may take place within Hsp70‐bound states, due to the solvent exposure of some chaperone binding sites in native proteins. In all, we propose that Hsp70‐chaperone‐binding traits have evolved to favor Hsp70‐assisted protein folding devoid of aggregation. 
    more » « less