skip to main content


Title: Identifying sequence perturbations to an intrinsically disordered protein that determine its phase-separation behavior

Phase separation of intrinsically disordered proteins (IDPs) commonly underlies the formation of membraneless organelles, which compartmentalize molecules intracellularly in the absence of a lipid membrane. Identifying the protein sequence features responsible for IDP phase separation is critical for understanding physiological roles and pathological consequences of biomolecular condensation, as well as for harnessing phase separation for applications in bioinspired materials design. To expand our knowledge of sequence determinants of IDP phase separation, we characterized variants of the intrinsically disordered RGG domain from LAF-1, a model protein involved in phase separation and a key component of P granules. Based on a predictive coarse-grained IDP model, we identified a region of the RGG domain that has high contact probability and is highly conserved between species; deletion of this region significantly disrupts phase separation in vitro and in vivo. We determined the effects of charge patterning on phase behavior through sequence shuffling. We designed sequences with significantly increased phase separation propensity by shuffling the wild-type sequence, which contains well-mixed charged residues, to increase charge segregation. This result indicates the natural sequence is under negative selection to moderate this mode of interaction. We measured the contributions of tyrosine and arginine residues to phase separation experimentally through mutagenesis studies and computationally through direct interrogation of different modes of interaction using all-atom simulations. Finally, we show that despite these sequence perturbations, the RGG-derived condensates remain liquid-like. Together, these studies advance our fundamental understanding of key biophysical principles and sequence features important to phase separation.

 
more » « less
Award ID(s):
1845734
NSF-PAR ID:
10149664
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ;
Publisher / Repository:
Proceedings of the National Academy of Sciences
Date Published:
Journal Name:
Proceedings of the National Academy of Sciences
Volume:
117
Issue:
21
ISSN:
0027-8424
Page Range / eLocation ID:
p. 11421-11431
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Intracellular compartmentalization plays a pivotal role in cellular function, with membrane-bound organelles and membrane-less biomolecular 'condensates' playing key roles. These condensates, formed through liquid-liquid phase separation (LLPS), enable selective compartmentalization without the barrier of a lipid bilayer, thereby facilitating rapid formation/dissolution in response to stimuli. Intrinsically disordered proteins (IDPs) and/or proteins with intrinsically disordered regions (IDRs), which are often rich in charged and polar amino acid sequences, scaffold many condensates, often in conjunction with RNA. Comprehending the impact of IDP/IDR sequences on phase separation poses a challenge due to the extensive chemical diversity resulting from the myriad amino acids and post-translational modifications. To tackle this hurdle, one approach has been to investigate LLPS in simplified polypeptide systems, which offer a narrower scope within the chemical space for exploration. This strategy is supported by studies that have demonstrated how IDP function can largely be understood based on general chemical features, such as clusters or patterns of charged amino acids, rather than residue-level effects, and the ways in which these kinds of motifs give rise to an ensemble of conformations. Our lab has utilized complex coacervates assembled from oppositely-charged polypeptides as a simplified material analogue to the complexity of liquid-liquid phase separated biological condensates. Complex coacervation is an associative LLPS that occurs due to the electrostatic complexation of oppositely-charged macro-ions. This process is believed to be driven by the entropic gains resulting from the release of bound counterions and the reorganization of water upon complex formation. Apart from their direct applicability to IDPs, polypeptides also serve as excellent model polymers for investigating molecular interactions due to the wide range of available side-chain functionalities and the capacity to finely regulate their sequence, thus enabling precise control over interactions with guest molecules. Here, we discuss fundamental studies examining how charge patterning, hydrophobicity, chirality, and architecture affect the phase separation of polypeptide-based complex coacervates. These efforts have leveraged a combination of experimental and computational approaches that provide insight into the molecular level interactions. We also examine how these parameters affect the ability of complex coacervates to incorporate globular proteins and viruses. These efforts couple directly with our fundamental studies into coacervate formation, as such ‘guest’ molecules should not be considered as experiencing simple encapsulation and are instead active participants in the electrostatic assembly of coacervate materials. Interestingly, we observed trends in the incorporation of proteins and viruses into coacervates formed using different chain length polypeptides that are not well explained by simple electrostatic arguments and may be the result of more complex interactions between globular and polymeric species. Additionally, we describe experimental evidence supporting the potential for complex coacervates to improve the thermal stability of embedded biomolecules such as viral vaccines. Ultimately, peptide-based coacervates have the potential to help unravel the physics behind biological condensates while paving the way for innovative methods in compartmentalization, purification, and biomolecule stabilization. These advancements could have implications spanning from medicine to biocatalysis. 
    more » « less
  2. null (Ed.)
    Abstract Ribonucleoprotein (RNP) granules are membraneless organelles (MLOs), which majorly consist of RNA and RNA-binding proteins and are formed via liquid–liquid phase separation (LLPS). Experimental studies investigating the drivers of LLPS have shown that intrinsically disordered proteins (IDPs) and nucleic acids like RNA and other polynucleotides play a key role in modulating protein phase separation. There is currently a dearth of modelling techniques which allow one to delve deeper into how polynucleotides play the role of a modulator/promoter of LLPS in cells using computational methods. Here, we present a coarse-grained polynucleotide model developed to fill this gap, which together with our recently developed HPS model for protein LLPS, allows us to capture the factors driving protein-polynucleotide phase separation. We explore the capabilities of the modelling framework with the LAF-1 RGG system which has been well studied in experiments and also with the HPS model previously. Further taking advantage of the fact that the HPS model maintains sequence specificity we explore the role of charge patterning on controlling polynucleotide incorporation into condensates. With increased charge patterning we observe formation of structured or patterned condensates which suggests the possible roles of polynucleotides in not only shifting the phase boundaries but also introducing microscopic organization in MLOs. 
    more » « less
  3. Abstract

    The intrinsically disordered RG/RGG repeat domain is found in several nucleolar and P-granule proteins, but how it influences their phase separation into biomolecular condensates is unclear. We survey all RG/RGG repeats inC. elegansand uncover nucleolar and P-granule-specific RG/RGG motifs. An uncharacterized protein, K07H8.10, contains the longest nucleolar-like RG/RGG domain inC. elegans. Domain and sequence similarity, as well as nucleolar localization, reveals K07H8.10 (NUCL-1) to be the homolog of Nucleolin, a protein conserved across animals, plants, and fungi, but previously thought to be absent in nematodes. Deleting the RG/RGG repeats within endogenous NUCL-1 and a second nucleolar protein, GARR-1 (GAR1), demonstrates these domains are dispensable for nucleolar accumulation. Instead, their RG/RGG repeats contribute to the phase separation of proteins into nucleolar sub-compartments. Despite this common RG/RGG repeat function, only removal of the GARR-1 RG/RGG domain affects worm fertility and development, decoupling precise sub-nucleolar structure from nucleolar function.

     
    more » « less
  4. Across bacteria, protein-based organelles called bacterial microcompartments (BMCs) encapsulate key enzymes to regulate their activities. The model BMC is the carboxysome that encapsulates enzymes for CO2fixation to increase efficiency and is found in many autotrophic bacteria, such as cyanobacteria. Despite their importance in the global carbon cycle, little is known about how carboxysomes are spatially regulated. We recently identified the two-factor system required for the maintenance of carboxysome distribution (McdAB). McdA drives the equal spacing of carboxysomes via interactions with McdB, which associates with carboxysomes. McdA is a ParA/MinD ATPase, a protein family well studied in positioning diverse cellular structures in bacteria. However, the adaptor proteins like McdB that connect these ATPases to their cargos are extremely diverse. In fact, McdB represents a completely unstudied class of proteins. Despite the diversity, many adaptor proteins undergo phase separation, but functional roles remain unclear. Here, we define the domain architecture of McdB from the model cyanobacteriumSynechococcus elongatusPCC 7942, and dissect its mode of biomolecular condensate formation. We identify an N-terminal intrinsically disordered region (IDR) that modulates condensate solubility, a central coiled-coil dimerizing domain that drives condensate formation, and a C-terminal domain that trimerizes McdB dimers and provides increased valency for condensate formation. We then identify critical basic residues in the IDR, which we mutate to glutamines to solubilize condensates. Finally, we find that a condensate-defective mutant of McdB has altered association with carboxysomes and influences carboxysome enzyme content. The results have broad implications for understanding spatial organization of BMCs and the molecular grammar of protein condensates.

     
    more » « less
  5. Abstract

    Shuttle protein UBQLN2 functions in protein quality control (PQC) by binding to proteasomal receptors and ubiquitinated substrates via its N‐terminal ubiquitin‐like (UBL) and C‐terminal ubiquitin‐associated (UBA) domains, respectively. Between these two folded domains are low‐complexity STI1‐I and STI1‐II regions, connected by disordered linkers. The STI1 regions bind other components, such as HSP70, that are important to the PQC functions of UBQLN2. We recently determined that the STI1‐II region enables UBQLN2 to undergo liquid–liquid phase separation (LLPS) to form liquid droplets in vitro and biomolecular condensates in cells. However, how the interplay between the folded (UBL/UBA) domains and the intrinsically disordered regions mediates phase separation is largely unknown. Using engineered domain deletion constructs, we found that removing the UBA domain inhibits UBQLN2 LLPS while removing the UBL domain enhances LLPS, suggesting that UBA and UBL domains contribute asymmetrically in modulating UBQLN2 LLPS. To explain these differential effects, we interrogated the interactions that involve the UBA and UBL domains across the entire UBQLN2 molecule using nuclear magnetic resonance spectroscopy. To our surprise, aside from well‐studied canonical UBL:UBA interactions, there also exist moderate interactions between the UBL and several disordered regions, including STI1‐I and residues 555–570, the latter of which is a known contributor to UBQLN2 LLPS. Our findings are essential for the understanding of both the molecular driving forces of UBQLN2 LLPS and the effects of ligand binding to UBL, UBA, or disordered regions on the phase behavior and physiological functions of UBQLN2.

     
    more » « less