skip to main content


Title: A Topological Selection of Folding Pathways from Native States of Knotted Proteins
Understanding how knotted proteins fold is a challenging problem in biology. Researchers have proposed several models for their folding pathways, based on theory, simulations and experiments. The geometry of proteins with the same knot type can vary substantially and recent simulations reveal different folding behaviour for deeply and shallow knotted proteins. We analyse proteins forming open-ended trefoil knots by introducing a topologically inspired statistical metric that measures their entanglement. By looking directly at the geometry and topology of their native states, we are able to probe different folding pathways for such proteins. In particular, the folding pathway of shallow knotted carbonic anhydrases involves the creation of a double-looped structure, contrary to what has been observed for other knotted trefoil proteins. We validate this with Molecular Dynamics simulations. By leveraging the geometry and local symmetries of knotted proteins’ native states, we provide the first numerical evidence of a double-loop folding mechanism in trefoil proteins.  more » « less
Award ID(s):
2019745 1427654
NSF-PAR ID:
10320590
Author(s) / Creator(s):
; ; ; ; ;
Date Published:
Journal Name:
Symmetry
Volume:
13
Issue:
9
ISSN:
2073-8994
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. How knotted proteins fold has remained controversial since the identification of deeply knotted proteins nearly two decades ago. Both computational and experimental approaches have been used to investigate protein knot formation. Motivated by the computer simulations of Bölinger et al. [Bölinger D, et al. (2010)PLoS Comput Biol6:e1000731] for the folding of the61-knotted α-haloacid dehalogenase (DehI) protein, we introduce a topological description of knot folding that could describe pathways for the formation of all currently known protein knot types and predicts knot types that might be identified in the future. We analyze fingerprint data from crystal structures of protein knots as evidence that particular protein knots may fold according to specific pathways from our theory. Our results confirm Taylor’s twisted hairpin theory of knot folding for the31-knotted proteins and the41-knotted ketol-acid reductoisomerases and present alternative folding mechanisms for the41-knotted phytochromes and the52- and61-knotted proteins.

     
    more » « less
  2. Kasson, Peter M. (Ed.)
    Atomistic simulations can provide valuable, experimentally-verifiable insights into protein folding mechanisms, but existing ab initio simulation methods are restricted to only the smallest proteins due to severe computational speed limits. The folding of larger proteins has been studied using native-centric potential functions, but such models omit the potentially crucial role of non-native interactions. Here, we present an algorithm, entitled DBFOLD, which can predict folding pathways for a wide range of proteins while accounting for the effects of non-native contacts. In addition, DBFOLD can predict the relative rates of different transitions within a protein’s folding pathway. To accomplish this, rather than directly simulating folding, our method combines equilibrium Monte-Carlo simulations, which deploy enhanced sampling, with unfolding simulations at high temperatures. We show that under certain conditions, trajectories from these two types of simulations can be jointly analyzed to compute unknown folding rates from detailed balance. This requires inferring free energies from the equilibrium simulations, and extrapolating transition rates from the unfolding simulations to lower, physiologically-reasonable temperatures at which the native state is marginally stable. As a proof of principle, we show that our method can accurately predict folding pathways and Monte-Carlo rates for the well-characterized Streptococcal protein G. We then show that our method significantly reduces the amount of computation time required to compute the folding pathways of large, misfolding-prone proteins that lie beyond the reach of existing direct simulation. Our algorithm, which is available online , can generate detailed atomistic models of protein folding mechanisms while shedding light on the role of non-native intermediates which may crucially affect organismal fitness and are frequently implicated in disease. 
    more » « less
  3. Abstract

    Proteins fold in 3-dimensional conformations which are important for their function. Characterizing the global conformation of proteins rigorously and separating secondary structure effects from topological effects is a challenge. New developments in applied knot theory allow to characterize the topological characteristics of proteins (knotted or not). By analyzing a small set of two-state and multi-state proteins with no knots or slipknots, our results show that 95.4% of the analyzed proteins have non-trivial topological characteristics, as reflected by the second Vassiliev measure, and that the logarithm of the experimental protein folding rate depends on both the local geometry and the topology of the protein’s native state.

     
    more » « less
  4. Abstract

    We examine the influence of cellular interactions in all‐atom models of a section of theHomo sapienscytoplasm on the early folding events of the three‐helix bundle protein B (PB). While genetically engineered PB is known to fold in dilute water box simulations in three microseconds, the three initially unfolded PB copies in our two cytoplasm models using a similar force field did not reach the native state during 30‐microsecond simulations. We did however capture the formation of all three helices in a compact native‐like topology. Folding in vivo is delayed because intramolecular contact formation within PB is in direct competition with intermolecular contacts between PB and surrounding macromolecules. In extreme cases, intermolecular beta‐sheets are formed. Interactions with other macromolecules are also observed to promote structure formation, for example when a PB helix in our simulations is shielded from solvent by macromolecular crowding. Sticking and crowding in our models initiate sampling of helix/sheet structural plasticity of PB. Relatedly, in past in vitro experiments, similar GA domains were shown to switch between two different folds. Finally, we also observed that stickiness between PB and the cellular environment can be modulated in our simulations through the reduction in protein hydrophobicity when we reversed PB back to the wild‐type sequence. This study demonstrates that even fast‐folding proteins can get stuck in non‐native states in the cell, making them useful models for protein–chaperone interactions and early stages of aggregate formation relevant to cellular disease.

     
    more » « less
  5. Chaperonins are biological nanomachines that help newly translated proteins to fold by rescuing them from kinetically trapped misfolded states. Protein folding assistance by the chaperonin machinery is obligatory in vivo for a subset of proteins in the bacterial proteome. Chaperonins are large oligomeric complexes, with unusual seven fold symmetry (group I) or eight/nine fold symmetry (group II), that form double-ring constructs, enclosing a central cavity that serves as the folding chamber. Dramatic large-scale conformational changes, that take place during ATP-driven cycles, allow chaperonins to bind misfolded proteins, encapsulate them into the expanded cavity and release them back into the cellular environment, regardless of whether they are folded or not. The theory associated with the iterative annealing mechanism, which incorporated the conformational free energy landscape description of protein folding, quantitatively explains most, if not all, the available data. Misfolded conformations are associated with low energy minima in a rugged energy landscape. Random disruptions of these low energy conformations result in higher free energy, less folded, conformations that can stochastically partition into the native state. Two distinct mechanisms of annealing action have been described. Group I chaperonins (GroEL homologues in eubacteria and endosymbiotic organelles), recognize a large number of misfolded proteins non-specifically and operate through highly coordinated cooperative motions. By contrast, the less well understood group II chaperonins (CCT in Eukarya and thermosome/TF55 in Archaea), assist a selected set of substrate proteins. Sequential conformational changes within a CCT ring are observed, perhaps promoting domain-by-domain substrate folding. Chaperonins are implicated in bacterial infection, autoimmune disease, as well as protein aggregation and degradation diseases. Understanding the chaperonin mechanism and the specific proteins they rescue during the cell cycle is important not only for the fundamental aspect of protein folding in the cellular environment, but also for effective therapeutic strategies. 
    more » « less