skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: A Topological Selection of Folding Pathways from Native States of Knotted Proteins
Understanding how knotted proteins fold is a challenging problem in biology. Researchers have proposed several models for their folding pathways, based on theory, simulations and experiments. The geometry of proteins with the same knot type can vary substantially and recent simulations reveal different folding behaviour for deeply and shallow knotted proteins. We analyse proteins forming open-ended trefoil knots by introducing a topologically inspired statistical metric that measures their entanglement. By looking directly at the geometry and topology of their native states, we are able to probe different folding pathways for such proteins. In particular, the folding pathway of shallow knotted carbonic anhydrases involves the creation of a double-looped structure, contrary to what has been observed for other knotted trefoil proteins. We validate this with Molecular Dynamics simulations. By leveraging the geometry and local symmetries of knotted proteins’ native states, we provide the first numerical evidence of a double-loop folding mechanism in trefoil proteins.  more » « less
Award ID(s):
2019745 1427654
PAR ID:
10320590
Author(s) / Creator(s):
; ; ; ; ;
Date Published:
Journal Name:
Symmetry
Volume:
13
Issue:
9
ISSN:
2073-8994
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. How knotted proteins fold has remained controversial since the identification of deeply knotted proteins nearly two decades ago. Both computational and experimental approaches have been used to investigate protein knot formation. Motivated by the computer simulations of Bölinger et al. [Bölinger D, et al. (2010)PLoS Comput Biol6:e1000731] for the folding of the 6 1 -knotted α-haloacid dehalogenase (DehI) protein, we introduce a topological description of knot folding that could describe pathways for the formation of all currently known protein knot types and predicts knot types that might be identified in the future. We analyze fingerprint data from crystal structures of protein knots as evidence that particular protein knots may fold according to specific pathways from our theory. Our results confirm Taylor’s twisted hairpin theory of knot folding for the 3 1 -knotted proteins and the 4 1 -knotted ketol-acid reductoisomerases and present alternative folding mechanisms for the 4 1 -knotted phytochromes and the 5 2 - and 6 1 -knotted proteins. 
    more » « less
  2. Abstract Proteins fold in 3-dimensional conformations which are important for their function. Characterizing the global conformation of proteins rigorously and separating secondary structure effects from topological effects is a challenge. New developments in applied knot theory allow to characterize the topological characteristics of proteins (knotted or not). By analyzing a small set of two-state and multi-state proteins with no knots or slipknots, our results show that 95.4% of the analyzed proteins have non-trivial topological characteristics, as reflected by the second Vassiliev measure, and that the logarithm of the experimental protein folding rate depends on both the local geometry and the topology of the protein’s native state. 
    more » « less
  3. Kasson, Peter M. (Ed.)
    Atomistic simulations can provide valuable, experimentally-verifiable insights into protein folding mechanisms, but existing ab initio simulation methods are restricted to only the smallest proteins due to severe computational speed limits. The folding of larger proteins has been studied using native-centric potential functions, but such models omit the potentially crucial role of non-native interactions. Here, we present an algorithm, entitled DBFOLD, which can predict folding pathways for a wide range of proteins while accounting for the effects of non-native contacts. In addition, DBFOLD can predict the relative rates of different transitions within a protein’s folding pathway. To accomplish this, rather than directly simulating folding, our method combines equilibrium Monte-Carlo simulations, which deploy enhanced sampling, with unfolding simulations at high temperatures. We show that under certain conditions, trajectories from these two types of simulations can be jointly analyzed to compute unknown folding rates from detailed balance. This requires inferring free energies from the equilibrium simulations, and extrapolating transition rates from the unfolding simulations to lower, physiologically-reasonable temperatures at which the native state is marginally stable. As a proof of principle, we show that our method can accurately predict folding pathways and Monte-Carlo rates for the well-characterized Streptococcal protein G. We then show that our method significantly reduces the amount of computation time required to compute the folding pathways of large, misfolding-prone proteins that lie beyond the reach of existing direct simulation. Our algorithm, which is available online , can generate detailed atomistic models of protein folding mechanisms while shedding light on the role of non-native intermediates which may crucially affect organismal fitness and are frequently implicated in disease. 
    more » « less
  4. Stretched-exponential protein refolding kinetics, first observed decades ago, were attributed to a nonnative ensemble of structures with parallel, non-interconverting folding pathways. However, the structural origin of the large energy barriers preventing interconversion between these folding pathways is unknown. Here, we combine simulations with limited proteolysis (LiP) and cross-linking (XL) mass spectrometry (MS) to study the protein phosphoglycerate kinase (PGK). Simulations recapitulate its stretched-exponential folding kinetics and reveal that misfolded states involving changes of entanglement underlie this behavior: either formation of a nonnative, noncovalent lasso entanglement or failure to form a native entanglement. These misfolded states act as kinetic traps, requiring extensive unfolding to escape, which results in a distribution of free energy barriers and pathway partitioning. Using LiP-MS and XL-MS, we propose heterogeneous structural ensembles consistent with these data that represent the potential long-lived misfolded states PGK populates. This structural and energetic heterogeneity creates a hierarchy of refolding timescales, explaining stretched-exponential kinetics. 
    more » « less
  5. Abstract Folding of ribozymes into well-defined tertiary structures usually requires divalent cations. How Mg2+ ions direct the folding kinetics has been a long-standing unsolved problem because experiments cannot detect the positions and dynamics of ions. To address this problem, we used molecular simulations to dissect the folding kinetics of the Azoarcus ribozyme by monitoring the path each molecule takes to reach the folded state. We quantitatively establish that Mg2+ binding to specific sites, coupled with counter-ion release of monovalent cations, stimulate the formation of secondary and tertiary structures, leading to diverse pathways that include direct rapid folding and trapping in misfolded structures. In some molecules, key tertiary structural elements form when Mg2+ ions bind to specific RNA sites at the earliest stages of the folding, leading to specific collapse and rapid folding. In others, the formation of non-native base pairs, whose rearrangement is needed to reach the folded state, is the rate-limiting step. Escape from energetic traps, driven by thermal fluctuations, occurs readily. In contrast, the transition to the native state from long-lived topologically trapped native-like metastable states is extremely slow. Specific collapse and formation of energetically or topologically frustrated states occur early in the assembly process. 
    more » « less