skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Quantitative prediction of ensemble dynamics, shapes and contact propensities of intrinsically disordered proteins
Intrinsically disordered proteins (IDPs) are highly dynamic systems that play an important role in cell signaling processes and their misfunction often causes human disease. Proper understanding of IDP function not only requires the realistic characterization of their three-dimensional conformational ensembles at atomic-level resolution but also of the time scales of interconversion between their conformational substates. Large sets of experimental data are often used in combination with molecular modeling to restrain or bias models to improve agreement with experiment. It is shown here for the N-terminal transactivation domain of p53 (p53TAD) and Pup, which are two IDPs that fold upon binding to their targets, how the latest advancements in molecular dynamics (MD) simulations methodology produces native conformational ensembles by combining replica exchange with series of microsecond MD simulations. They closely reproduce experimental data at the global conformational ensemble level, in terms of the distribution properties of the radius of gyration tensor, and at the local level, in terms of NMR properties including 15 N spin relaxation, without the need for reweighting. Further inspection revealed that 10–20% of the individual MD trajectories display the formation of secondary structures not observed in the experimental NMR data. The IDP ensembles were analyzed by graph theory to identify dominant inter-residue contact clusters and characteristic amino-acid contact propensities. These findings indicate that modern MD force fields with residue-specific backbone potentials can produce highly realistic IDP ensembles sampling a hierarchy of nano- and picosecond time scales providing new insights into their biological function.  more » « less
Award ID(s):
2103637
PAR ID:
10414931
Author(s) / Creator(s):
;
Editor(s):
de Groot, Bert L.
Date Published:
Journal Name:
PLOS Computational Biology
Volume:
18
Issue:
9
ISSN:
1553-7358
Page Range / eLocation ID:
e1010036
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Intrinsically disordered proteins (IDPs) are highly prevalent and play important roles in biology and human diseases. It is now also recognized that many IDPs remain dynamic even in specific complexes and functional assemblies. Computer simulations are essential for deriving a molecular description of the disordered protein ensembles and dynamic interactions for a mechanistic understanding of IDPs in biology, diseases, and therapeutics. Here, we provide an in-depth review of recent advances in the multi-scale simulation of disordered protein states, with a particular emphasis on the development and application of advanced sampling techniques for studying IDPs. These techniques are critical for adequate sampling of the manifold functionally relevant conformational spaces of IDPs. Together with dramatically improved protein force fields, these advanced simulation approaches have achieved substantial success and demonstrated significant promise towards the quantitative and predictive modeling of IDPs and their dynamic interactions. We will also discuss important challenges remaining in the atomistic simulation of larger systems and how various coarse-grained approaches may help to bridge the remaining gaps in the accessible time- and length-scales of IDP simulations. 
    more » « less
  2. Abstract Proteins are inherently dynamic, and their conformational ensembles are functionally important in biology. Large-scale motions may govern protein structure–function relationship, and numerous transient but stable conformations of intrinsically disordered proteins (IDPs) can play a crucial role in biological function. Investigating conformational ensembles to understand regulations and disease-related aggregations of IDPs is challenging both experimentally and computationally. In this paper we first introduced an unsupervised deep learning-based model, termed Internal Coordinate Net (ICoN), which learns the physical principles of conformational changes from molecular dynamics (MD) simulation data. Second, we selected interpolating data points in the learned latent space that rapidly identify novel synthetic conformations with sophisticated and large-scale sidechains and backbone arrangements. Third, with the highly dynamic amyloid-β1-42(Aβ42) monomer, our deep learning model provided a comprehensive sampling of Aβ42’s conformational landscape. Analysis of these synthetic conformations revealed conformational clusters that can be used to rationalize experimental findings. Additionally, the method can identify novel conformations with important interactions in atomistic details that are not included in the training data. New synthetic conformations showed distinct sidechain rearrangements that are probed by our EPR and amino acid substitution studies. This approach is highly transferable and can be used for any available data for training. The work also demonstrated the ability for deep learning to utilize learned natural atomistic motions in protein conformation sampling. 
    more » « less
  3. Nuclear magnetic resonance (NMR) spectroscopy is widely recognized for its ability to provide atomic-level resolution of structures and interactions in intrinsically disordered proteins (IDPs). However, its application is often limited when studying large proteins that contain both structured and disordered regions. This challenge arises due to the broad peaks exhibited by structured regions in such proteins, which result from local compaction and restricted motions, complicating spectral analysis. Additionally, broadening in IDP complexes caused by exchange between free and bound states and/or the large size of the bound state, further obscures NMR signals and hinders the mapping of interaction sites. Moreover, IDPs are highly sensitive to proteolytic cleavage, necessitating careful handling and optimization during expression, purification, and data collection. In this study, we demonstrate how we successfully overcame these hurdles using examples from our work on the N-terminal region of the dynein intermediate chain (IC), which contains both ɑ-helical and intrinsically disordered regions. By employing paramagnetic relaxation enhancement (PRE) NMR to probe conformational dynamics, water-amide chemical exchange to measure solvent accessibility, and saturation transfer difference (STD) NMR to map specific interactions with p150Glued and Nudel, we identified novel transient structures and interaction networks within IC. Our findings highlight the utility of these advanced NMR techniques in elucidating the dynamic behavior of IDPs and their complexes, providing valuable insights into their structural and functional roles. 
    more » « less
  4. Abstract The preponderance of intrinsically disordered proteins (IDPs) in the eukaryotic proteome, and their ability to interact with each other, and with folded proteins, RNA, and DNA for functional purposes, have made it important to quantitatively characterize their biophysical properties. Toward this end, we developed the transferable self‐organized polymer (SOP‐IDP) model to calculate the properties of several IDPs. The values of the radius of gyration () obtained from SOP‐IDP simulations are in excellent agreement (correlation coefficient of 0.96) with those estimated from SAXS experiments. For AP180 and Epsin, the predicted values of the hydrodynamic radii () are in nearly quantitative agreement with those from fluorescence correlation spectroscopy (FCS) experiments. Strikingly, the calculated SAXS profiles for 36 IDPs are also nearly superimposable on the experimental profiles. The dependence of and the mean end‐to‐end distance () on chain length, , follows Flory's scaling law, ( and ), suggesting that globally IDPs behave as synthetic polymers in a good solvent. This finding depends on the solvent quality, which can be altered by changing variables such as pH and salt concentration. The values of and are 0.20 and 0.48 nm, respectively. Surprisingly, finite size corrections to scaling, expected on theoretical grounds, are negligible for and . In contrast, only by accounting for the finite sizes of the IDPs, the dependence of experimentally measurable on can be quantitatively explained using . Although Flory scaling law captures the estimates for , , and accurately, the spread of the simulated data around the theoretical curve is suggestive of of sequence‐specific features that emerge through a fine‐grained analysis of the conformational ensembles using hierarchical clustering. Typically, the ensemble of conformations partitions into three distinct clusters, having different equilibrium populations and structural properties. Without any further readjustments to the parameters of the SOP‐IDP model, we also obtained nearly quantitative agreement with paramagnetic relaxation enhancement (PRE) measurements forα‐synuclein. The transferable SOP‐IDP model sets the stage for several applications, including the study of phase separation in IDPs and interactions with nucleic acids. 
    more » « less
  5. Dr. Sudipta Maiti (Ed.)
    IA3 is a 68 amino acid natural peptide/protein inhibitor of yeast aspartic proteinase A (YPRA) that is intrinsically dis-ordered in solution with induced N-terminal helicity when in the protein complex with YPRA. Based upon the intrinsical-ly disordered proteins (IDPs) parameters of fractional net charge (FNC), of net charge density per residue (NCPR) and of charge patterning (), the two domains of IA3 are defined to occupy different domains within conformationally based subclasses of IDPs; thus, making IA3 a bimodal-domain IDP. Site-directed spin-labeling (SDSL) electron paramagnetic resonance (EPR) spectroscopy and low-field Overhauser dynamic nuclear polarization (ODNP) spectroscopy results show that these two domains possess different degrees of compaction and hydration diffusivity behavior. This work suggests that SDSL EPR line shapes – analyzed in terms of their local tumbling volume (VL) – provide insight into the compaction of the unstructured IDP ensemble in solution and that protein sequence and net charge distribution pat-terns within a conformational subclass can impact bound water hydration dynamics; thus, possibly offering an alter-native thermodynamic property that can encode conforma-tional binding and behavior of IDPs and liquid-liquid phase separations. 
    more » « less