skip to main content


Title: Modeling pKas of unfolded proteins to probe structural models of unfolded state
Modeling unfolded states of proteins has implications for protein folding and stability. Since in unfolded state proteins adopt multiple conformations, any experimentally measured quantity is ensemble averaged, therefore the computed quantity should be ensemble averaged as well. Here, we investigate the possibility that one can model an unfolded state ensemble with the coil model approach, algorithm such as “flexible-meccano” [Ozenne V et al., Flexible-meccano: A tool for the generation of explicit ensemle descriptions of intrinsically disordered proteins and their associated experimental observables, Bioinformatics 28:1463–1470, 2012], developed to generate structures for intrinsically disordered proteins. We probe such a possibility by using generated structures to calculate pKas of titratable groups and compare with experimental data. It is demonstrated that even with a small number of representative structures of unfolded state, the average calculated pKas are in very good agreement with experimentally measured pKas. Also, predictions are made for titratable groups for which there is no experimental data available. This suggests that the coil model approach is suitable for generating 3D structures of unfolded state of proteins. To make the approach suitable for large-scale modeling, which requires limited number of structures, we ranked the structures according to their solvent accessible surface area (SASA). It is shown that in the majority of cases, the top structures with smallest SASA are enough to represent unfolded state.  more » « less
Award ID(s):
1725573
NSF-PAR ID:
10201301
Author(s) / Creator(s):
;
Date Published:
Journal Name:
Journal of Theoretical and Computational Chemistry
Volume:
18
Issue:
04
ISSN:
0219-6336
Page Range / eLocation ID:
1950020
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Outer membrane proteins (OMPs) must exist as an unfolded ensemble while interacting with a chaperone network in the periplasm of Gram-negative bacteria. Here, we developed a method to model unfolded OMP (uOMP) conformational ensembles using the experimental properties of two well-studied OMPs. The overall sizes and shapes of the unfolded ensembles in the absence of a denaturant were experimentally defined by measuring the sedimentation coefficient as a function of urea concentration. We used these data to model a full range of unfolded conformations by parameterizing a targeted coarse-grained simulation protocol. The ensemble members were further refined by short molecular dynamics simulations to reflect proper torsion angles. The final conformational ensembles have polymer properties different from unfolded soluble and intrinsically disordered proteins and reveal inherent differences in the unfolded states that necessitate further investigation. Building these uOMP ensembles advances the understanding of OMP biogenesis and provides essential information for interpreting structures of uOMP-chaperone complexes. 
    more » « less
  2. Over the last thirty years the unfolded state of proteins has attracted considerable interest owing to the discovery of intrinsically disordered proteins which perform a plethora of functions despite resembling unfolded proteins to a significant extent. Research on both, unfolded and disordered proteins has revealed that their conformational properties can deviate locally from random coil behavior. In this context results from work on short oligopeptides suggest that individual amino acid residues sample the sterically allowed fraction of the Ramachandran plot to a different extent. Alanine has been found to exhibit a peculiarity in that it has a very high propensity for adopting polyproline II like conformations. This Perspectives article reviews work on short peptides aimed at exploring the Ramachandran distributions of amino acid residues in different contexts with experimental and computational means. Based on the thus provided overview the article discussed to what extent short peptides can serve as tools for exploring unfolded and disordered proteins and as benchmarks for the development of a molecular dynamics force field. 
    more » « less
  3. Denatured, unfolded, and intrinsically disordered proteins (collectively referred to here as unfolded proteins) can be described using analytical polymer models. These models capture various polymeric properties and can be fit to simulation results or experimental data. However, the model parameters commonly require users’ decisions, making them useful for data interpretation but less clearly applicable as stand-alone reference models. Here we use all-atom simulations of polypeptides in conjunction with polymer scaling theory to parameterize an analytical model of unfolded polypeptides that behave as ideal chains (ν = 0.50). The model, which we call the analytical Flory random coil (AFRC), requires only the amino acid sequence as input and provides direct access to probability distributions of global and local conformational order parameters. The model defines a specific reference state to which experimental and computational results can be compared and normalized. As a proof-of-concept, we use the AFRC to identify sequence-specific intramolecular interactions in simulations of disordered proteins. We also use the AFRC to contextualize a curated set of 145 different radii of gyration obtained from previously published small-angle X-ray scattering experiments of disordered proteins. The AFRC is implemented as a stand-alone software package and is also available via a Google Colab notebook. In summary, the AFRC provides a simple-to-use reference polymer model that can guide intuition and aid in interpreting experimental or simulation results. 
    more » « less
  4. The Flory isolated pair hypothesis (IPH) is one of the corner stones of the random coil model, which is generally invoked to describe the conformational dynamics of unfolded and intrinsically disordered proteins (IDPs). It stipulates, that individual residues sample the entire sterically allowed space of the Ramachandran plot without exhibiting any correlations with the conformational dynamics of its neighbors. However, multiple lines of computational, bioinformatic and experimental evidence suggest that nearest neighbors have a significant influence on the conformational sampling of amino acid residues. This implies that the conformational entropy of unfolded polypeptides and proteins is much less than one would expect based on the Ramachandran plots of individual residues. A further implication is that the Gibbs energies of residues in unfolded proteins or polypeptides are not additive. This review provides an overview of what is currently known and what has yet to be explored regarding nearest neighbor interactions in unfolded proteins. 
    more » « less
  5. Much attention is being paid to conformational biases in the ensembles of intrinsically disordered proteins. However, it is currently unknown whether or how conformational biases within the disordered ensembles of foldable proteins affect function in vivo. Recently, we demonstrated that water can be a good solvent for unfolded polypeptide chains, even those with a hydrophobic and charged sequence composition typical of folded proteins. These results run counter to the generally accepted model that protein folding begins with hydrophobicity-driven chain collapse. Here we investigate what other features, beyond amino acid composition, govern chain collapse. We found that local clustering of hydrophobic and/or charged residues leads to significant collapse of the unfolded ensemble of pertactin, a secreted autotransporter virulence protein fromBordetella pertussis, as measured by small angle X-ray scattering (SAXS). Sequence patterns that lead to collapse also correlate with increased intermolecular polypeptide chain association and aggregation. Crucially, sequence patterns that support an expanded conformational ensemble enhance pertactin secretion to the bacterial cell surface. Similar sequence pattern features are enriched across the large and diverse family of autotransporter virulence proteins, suggesting sequence patterns that favor an expanded conformational ensemble are under selection for efficient autotransporter protein secretion, a necessary prerequisite for virulence. More broadly, we found that sequence patterns that lead to more expanded conformational ensembles are enriched across water-soluble proteins in general, suggesting protein sequences are under selection to regulate collapse and minimize protein aggregation, in addition to their roles in stabilizing folded protein structures.

     
    more » « less