skip to main content


Title: Properties of protein unfolded states suggest broad selection for expanded conformational ensembles

Much attention is being paid to conformational biases in the ensembles of intrinsically disordered proteins. However, it is currently unknown whether or how conformational biases within the disordered ensembles of foldable proteins affect function in vivo. Recently, we demonstrated that water can be a good solvent for unfolded polypeptide chains, even those with a hydrophobic and charged sequence composition typical of folded proteins. These results run counter to the generally accepted model that protein folding begins with hydrophobicity-driven chain collapse. Here we investigate what other features, beyond amino acid composition, govern chain collapse. We found that local clustering of hydrophobic and/or charged residues leads to significant collapse of the unfolded ensemble of pertactin, a secreted autotransporter virulence protein fromBordetella pertussis, as measured by small angle X-ray scattering (SAXS). Sequence patterns that lead to collapse also correlate with increased intermolecular polypeptide chain association and aggregation. Crucially, sequence patterns that support an expanded conformational ensemble enhance pertactin secretion to the bacterial cell surface. Similar sequence pattern features are enriched across the large and diverse family of autotransporter virulence proteins, suggesting sequence patterns that favor an expanded conformational ensemble are under selection for efficient autotransporter protein secretion, a necessary prerequisite for virulence. More broadly, we found that sequence patterns that lead to more expanded conformational ensembles are enriched across water-soluble proteins in general, suggesting protein sequences are under selection to regulate collapse and minimize protein aggregation, in addition to their roles in stabilizing folded protein structures.

 
more » « less
NSF-PAR ID:
10190154
Author(s) / Creator(s):
; ; ; ; ; ;
Publisher / Repository:
Proceedings of the National Academy of Sciences
Date Published:
Journal Name:
Proceedings of the National Academy of Sciences
Volume:
117
Issue:
38
ISSN:
0027-8424
Page Range / eLocation ID:
p. 23356-23364
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Intracellular compartmentalization plays a pivotal role in cellular function, with membrane-bound organelles and membrane-less biomolecular 'condensates' playing key roles. These condensates, formed through liquid-liquid phase separation (LLPS), enable selective compartmentalization without the barrier of a lipid bilayer, thereby facilitating rapid formation/dissolution in response to stimuli. Intrinsically disordered proteins (IDPs) and/or proteins with intrinsically disordered regions (IDRs), which are often rich in charged and polar amino acid sequences, scaffold many condensates, often in conjunction with RNA. Comprehending the impact of IDP/IDR sequences on phase separation poses a challenge due to the extensive chemical diversity resulting from the myriad amino acids and post-translational modifications. To tackle this hurdle, one approach has been to investigate LLPS in simplified polypeptide systems, which offer a narrower scope within the chemical space for exploration. This strategy is supported by studies that have demonstrated how IDP function can largely be understood based on general chemical features, such as clusters or patterns of charged amino acids, rather than residue-level effects, and the ways in which these kinds of motifs give rise to an ensemble of conformations. Our lab has utilized complex coacervates assembled from oppositely-charged polypeptides as a simplified material analogue to the complexity of liquid-liquid phase separated biological condensates. Complex coacervation is an associative LLPS that occurs due to the electrostatic complexation of oppositely-charged macro-ions. This process is believed to be driven by the entropic gains resulting from the release of bound counterions and the reorganization of water upon complex formation. Apart from their direct applicability to IDPs, polypeptides also serve as excellent model polymers for investigating molecular interactions due to the wide range of available side-chain functionalities and the capacity to finely regulate their sequence, thus enabling precise control over interactions with guest molecules. Here, we discuss fundamental studies examining how charge patterning, hydrophobicity, chirality, and architecture affect the phase separation of polypeptide-based complex coacervates. These efforts have leveraged a combination of experimental and computational approaches that provide insight into the molecular level interactions. We also examine how these parameters affect the ability of complex coacervates to incorporate globular proteins and viruses. These efforts couple directly with our fundamental studies into coacervate formation, as such ‘guest’ molecules should not be considered as experiencing simple encapsulation and are instead active participants in the electrostatic assembly of coacervate materials. Interestingly, we observed trends in the incorporation of proteins and viruses into coacervates formed using different chain length polypeptides that are not well explained by simple electrostatic arguments and may be the result of more complex interactions between globular and polymeric species. Additionally, we describe experimental evidence supporting the potential for complex coacervates to improve the thermal stability of embedded biomolecules such as viral vaccines. Ultimately, peptide-based coacervates have the potential to help unravel the physics behind biological condensates while paving the way for innovative methods in compartmentalization, purification, and biomolecule stabilization. These advancements could have implications spanning from medicine to biocatalysis. 
    more » « less
  2. Disordered proline-rich motifs are common across the proteomes of many species and are often involved in protein-protein interactions. Proline is a unique amino acid due to the covalent bond between the backbone nitrogen and the proline side chain. The resulting five-membered ring allows proline to sample the cis state about its peptide bond, which other residues cannot do as readily. Because proline-rich disordered sequences exist as ensembles that likely include structures with the proline peptide bond in cis , a robust methodology to accurately account for these conformations in the overall ensemble is crucial. Observing the cis conformations of proline in a disordered sequence is challenging both experimentally and computationally. Nitrogen-hydrogen NMR spectroscopy cannot directly observe proline residues, which lack an amide bond, and computational methods struggle to overcome the large kinetic barrier between the cis and trans states, since isomerization usually occurs on the order of seconds. In the current work, Gaussian accelerated molecular dynamics was used to overcome this free energy barrier and simulate proline isomerization in a tetrapeptide (KPTP) and in the 12-residue proline-rich SH3 binding peptide, ArkA. We found that Gaussian accelerated molecular dynamics, when combined with a lowered peptide bond dihedral angle potential energy barrier (15 kcal/mol), allowed sufficient sampling of the proline cis and trans states on a microsecond timescale. All ArkA prolines spend a significant fraction of time in cis , leading to a more compact ensemble with less polyproline II helix structure than an ArkA ensemble with all peptide bonds in trans . The ensemble containing cis prolines also matches more closely to in vitro circular dichroism data than the all- trans ensemble. The ability of the ArkA prolines to isomerize likely affects the peptide’s ability to bind its partner SH3 domain, and should be studied further. This is the first molecular dynamics simulation study of proline isomerization in a biologically relevant proline-rich sequence that we know of, and a similar protocol could be applied to study multi-proline isomerization in other proline-containing proteins to improve conformational diversity and agreement with in vitro data. 
    more » « less
  3. Protein aggregation is associated with a growing list of human diseases. A substantial fraction of proteins in eukaryotic proteomes constitutes a proteostasis network—a collection of proteins that work together to maintain properly folded proteins. One of the overarching functions of the proteostasis network is the prevention or reversal of protein aggregation. How proteins aggregate in spite of the anti-aggregation activity of the proteostasis machinery is incompletely understood. Exposed hydrophobic patches can trigger degradation by the ubiquitin-proteasome system, a key branch of the proteostasis network. However, in a recent study, we found that model glycine (G)-rich or glutamine/asparagine (Q/N)-rich prion-like domains differ in their susceptibility to detection and degradation by this system. Here, we expand upon this work by examining whether the features controlling the degradation of our model prion-like domains generalize broadly to G-rich and Q/N-rich domains. Experimentally, native yeast G-rich domains in isolation are sensitive to the degradation-promoting effects of hydrophobic residues, whereas native Q/N-rich domains completely resist these effects and tend to aggregate instead. Bioinformatic analyses indicate that native G-rich domains from yeast and humans tend to avoid degradation-promoting features, suggesting that the proteostasis network may act as a form of selection at the molecular level that constrains the sequence space accessible to G-rich domains. However, the sensitivity or resistance of G-rich and Q/N-rich domains, respectively, was not always preserved in their native protein contexts, highlighting that proteins can evolve other sequence features to overcome the intrinsic sensitivity of some LCDs to degradation. 
    more » « less
  4. The dimensions that unfolded proteins, including intrinsically disordered proteins (IDPs), adopt in the absence of denaturant remain controversial. We developed an analysis procedure for small-angle X-ray scattering (SAXS) profiles and used it to demonstrate that even relatively hydrophobic IDPs remain nearly as expanded in water as they are in high denaturant concentrations. In contrast, as demonstrated here, most fluorescence resonance energy transfer (FRET) measurements have indicated that relatively hydrophobic IDPs contract significantly in the absence of denaturant. We use two independent approaches to further explore this controversy. First, using SAXS we show that fluorophores employed in FRET can contribute to the observed discrepancy. Specifically, we find that addition of Alexa-488 to a normally expanded IDP causes contraction by an additional 15%, a value in reasonable accord with the contraction reported in FRET-based studies. Second, using our simulations and analysis procedure to accurately extract both the radius of gyration (Rg) and end-to-end distance (Ree) from SAXS profiles, we tested the recent suggestion that FRET and SAXS results can be reconciled if the Rgand Reeare “uncoupled” (i.e., no longer simply proportional), in contrast to the case for random walk homopolymers. We find, however, that even for unfolded proteins, these two measures of unfolded state dimensions remain proportional. Together, these results suggest that improved analysis procedures and a correction for significant, fluorophore-driven interactions are sufficient to reconcile prior SAXS and FRET studies, thus providing a unified picture of the nature of unfolded polypeptide chains in the absence of denaturant.

     
    more » « less
  5. Outer membrane proteins (OMPs) must exist as an unfolded ensemble while interacting with a chaperone network in the periplasm of Gram-negative bacteria. Here, we developed a method to model unfolded OMP (uOMP) conformational ensembles using the experimental properties of two well-studied OMPs. The overall sizes and shapes of the unfolded ensembles in the absence of a denaturant were experimentally defined by measuring the sedimentation coefficient as a function of urea concentration. We used these data to model a full range of unfolded conformations by parameterizing a targeted coarse-grained simulation protocol. The ensemble members were further refined by short molecular dynamics simulations to reflect proper torsion angles. The final conformational ensembles have polymer properties different from unfolded soluble and intrinsically disordered proteins and reveal inherent differences in the unfolded states that necessitate further investigation. Building these uOMP ensembles advances the understanding of OMP biogenesis and provides essential information for interpreting structures of uOMP-chaperone complexes. 
    more » « less