skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Negatively charged, intrinsically disordered regions can accelerate target search by DNA-binding proteins
Abstract In eukaryotes, many DNA/RNA-binding proteins possess intrinsically disordered regions (IDRs) with large negative charge, some of which involve a consecutive sequence of aspartate (D) or glutamate (E) residues. We refer to them as D/E repeats. The functional role of D/E repeats is not well understood, though some of them are known to cause autoinhibition through intramolecular electrostatic interaction with functional domains. In this work, we investigated the impacts of D/E repeats on the target DNA search kinetics for the high-mobility group box 1 (HMGB1) protein and the artificial protein constructs of the Antp homeodomain fused with D/E repeats of varied lengths. Our experimental data showed that D/E repeats of particular lengths can accelerate the target association in the overwhelming presence of non-functional high-affinity ligands (‘decoys’). Our coarse-grained molecular dynamics (CGMD) simulations showed that the autoinhibited proteins can bind to DNA and transition into the uninhibited complex with DNA through an electrostatically driven induced-fit process. In conjunction with the CGMD simulations, our kinetic model can explain how D/E repeats can accelerate the target association process in the presence of decoys. This study illuminates an unprecedented role of the negatively charged IDRs in the target search process.  more » « less
Award ID(s):
2026805
PAR ID:
10396617
Author(s) / Creator(s):
; ; ; ; ;
Publisher / Repository:
Oxford University Press
Date Published:
Journal Name:
Nucleic Acids Research
Volume:
51
Issue:
10
ISSN:
0305-1048
Page Range / eLocation ID:
p. 4701-4712
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Intrinsically disordered regions (IDRs) are important components of protein functionality, with their charge distribution serving as a key factor in determining their roles. Notably, many proteins possess IDRs that are highly negatively charged, characterized by sequences rich in aspartate (D) or glutamate (E) residues. Bioinformatic analyses indicate that negatively charged low-complexity IDRs are significantly more common than their positively charged counterparts rich in arginine (R) or lysine (K). For instance, sequences of 10 or more consecutive negatively charged residues (D or E) are present in 268 human proteins. In contrast, corresponding sequences of 10 or more consecutive positively charged residues (K or R) are present in only 12 human proteins. Interestingly, about 50% of proteins containing D/E tracts function as DNA-binding or RNA-binding proteins. Negatively charged IDRs can electrostatically mimic nucleic acids and dynamically compete with them for the DNA-binding domains (DBDs) or RNA-binding domains (RBDs) that are positively charged. This leads to a phenomenon known as autoinhibition, in which the negatively charged IDRs inhibit binding to nucleic acids by occupying the binding interfaces within the proteins through intramolecular interactions. Rather than merely reducing binding activity, negatively charged IDRs offer significant advantages for the functions of DNA/RNA-binding proteins. The dynamic competition between negatively charged IDRs and nucleic acids can accelerate the target search processes for these proteins. When a protein encounters DNA or RNA, the electrostatic repulsion force between the nucleic acids and the negatively charged IDRs can trigger conformational changes that allow the nucleic acids to access DBDs or RBDs. Additionally, when proteins are trapped at high-affinity non-target sites on DNA or RNA ("decoys"), the electrostatic repulsion from the negatively charged IDRs can rescue the proteins from these traps. Negatively charged IDRs act as gatekeepers, rejecting nonspecific ligands while allowing the target to access the molecular interfaces of DBDs or RBDs, which increases binding specificity. These IDRs can also promote proper protein folding, facilitate chromatin remodeling by displacing other proteins bound to DNA, and influence phase separation, affecting local pH. The functions of negatively charged IDRs can be regulated through protein-protein interactions, post-translational modifications, and proteolytic processing. These characteristics can be harnessed as tools for protein engineering. Some frame-shift mutations that convert negatively charged IDRs into positively charged ones are linked to human diseases. Therefore, it is crucial to understand the physicochemical properties and functional roles of negatively charged IDRs that compete with nucleic acids. 
    more » « less
  2. Nucleoli are multicomponent condensates defined by coexisting sub-phases. We identified distinct intrinsically disordered regions (IDRs), including acidic (D/E) tracts and K-blocks interspersed by E-rich regions, as defining features of nucleolar proteins. We show that the localization preferences of nucleolar proteins are determined by their IDRs and the types of RNA or DNA binding domains they encompass. In vitro reconstitutions and studies in cells showed how condensation, which combines binding and complex coacervation of nucleolar components, contributes to nucleolar organization. D/E tracts of nucleolar proteins contribute to lowering the pH of co-condensates formed with nucleolar RNAs in vitro. In cells, this sets up a pH gradient between nucleoli and the nucleoplasm. By contrast, juxta-nucleolar bodies, which have different macromolecular compositions, featuring protein IDRs with very different charge profiles, have pH values that are equivalent to or higher than the nucleoplasm. Our findings show that distinct compositional specificities generate distinct physicochemical properties for condensates. 
    more » « less
  3. Protein-DNA interactions are critical for the successful functioning of all natural systems. The key role in these interactions is played by processes of protein search for specific sites on DNA. Although it has been studied for many years, only recently microscopic aspects of these processes became more clear. In this work, we present a review on current theoretical understanding of the molecular mechanisms of the protein target search. A comprehensive discrete-state stochastic method to explain the dynamics of the protein search phenomena is introduced and explained. Our theoretical approach utilizes a first-passage analysis and it takes into account the most relevant physical-chemical processes. It is able to describe many fascinating features of the protein search, including unusually high effective association rates, high selectivity and specificity, and the robustness in the presence of crowders and sequence heterogeneity. 
    more » « less
  4. Abstract Some misfolded protein conformations can bypass proteostasis machinery and remain soluble in vivo. This is an unexpected observation, as cellular quality control mechanisms should remove misfolded proteins. Three questions, then, are: how do long-lived, soluble, misfolded proteins bypass proteostasis? How widespread are such misfolded states? And how long do they persist? We address these questions using coarse-grain molecular dynamics simulations of the synthesis, termination, and post-translational dynamics of a representative set of cytosolic E. coli proteins. We predict that half of proteins exhibit misfolded subpopulations that bypass molecular chaperones, avoid aggregation, and will not be rapidly degraded, with some misfolded states persisting for months or longer. The surface properties of these misfolded states are native-like, suggesting they will remain soluble, while self-entanglements make them long-lived kinetic traps. In terms of function, we predict that one-third of proteins can misfold into soluble less-functional states. For the heavily entangled protein glycerol-3-phosphate dehydrogenase, limited-proteolysis mass spectrometry experiments interrogating misfolded conformations of the protein are consistent with the structural changes predicted by our simulations. These results therefore provide an explanation for how proteins can misfold into soluble conformations with reduced functionality that can bypass proteostasis, and indicate, unexpectedly, this may be a wide-spread phenomenon. 
    more » « less
  5. Coarse-grained molecular dynamics (CGMD) simulations address lengthscales and timescales that are critical to many chemical and material applications. Nevertheless, contemporary CGMD modeling is relatively bespoke and there are no black-box CGMD methodologies available that could play a comparable role in discovery applications that density functional theory plays for electronic structure. This gap might be filled by machine learning (ML)-based CGMD potentials that simplify model development, but these methods are still in their early stages and have yet to demonstrate a significant advantage over existing physics-based CGMD methods. Here, we explore the potential of Δ-learning models to leverage the advantages of these two approaches. This is implemented by using ML-based potentials to learn the difference between the target CGMD variable and the predictions of physics-based potentials. The Δ-models are benchmarked against the baseline models in reproducing on-target and off-target atomistic properties as a function of CG resolution, mapping operator, and system topology. The Δ-models outperform the reference ML-only CGMD models in nearly all scenarios. In several cases, the ML-only models manage to minimize training errors while still producing qualitatively incorrect dynamics, which is corrected by the Δ-models. Given their negligible added cost, Δ-models provide essentially free gains over their ML-only counterparts. Nevertheless, an unexpected finding is that neither the Δ-learning models nor the ML-only models significantly outperform the elementary pairwise models in reproducing atomistic properties. This fundamental failure is attributed to the relatively large irreducible force errors associated with coarse-graining that produces little benefit from using more complex potentials. 
    more » « less