Abstract In eukaryotes, many DNA/RNA-binding proteins possess intrinsically disordered regions (IDRs) with large negative charge, some of which involve a consecutive sequence of aspartate (D) or glutamate (E) residues. We refer to them as D/E repeats. The functional role of D/E repeats is not well understood, though some of them are known to cause autoinhibition through intramolecular electrostatic interaction with functional domains. In this work, we investigated the impacts of D/E repeats on the target DNA search kinetics for the high-mobility group box 1 (HMGB1) protein and the artificial protein constructs of the Antp homeodomain fused with D/E repeats of varied lengths. Our experimental data showed that D/E repeats of particular lengths can accelerate the target association in the overwhelming presence of non-functional high-affinity ligands (‘decoys’). Our coarse-grained molecular dynamics (CGMD) simulations showed that the autoinhibited proteins can bind to DNA and transition into the uninhibited complex with DNA through an electrostatically driven induced-fit process. In conjunction with the CGMD simulations, our kinetic model can explain how D/E repeats can accelerate the target association process in the presence of decoys. This study illuminates an unprecedented role of the negatively charged IDRs in the target search process. 
                        more » 
                        « less   
                    This content will become publicly available on July 9, 2026
                            
                            Competition between Nucleic Acids and Intrinsically Disordered Regions within Proteins
                        
                    
    
            Intrinsically disordered regions (IDRs) are important components of protein functionality, with their charge distribution serving as a key factor in determining their roles. Notably, many proteins possess IDRs that are highly negatively charged, characterized by sequences rich in aspartate (D) or glutamate (E) residues. Bioinformatic analyses indicate that negatively charged low-complexity IDRs are significantly more common than their positively charged counterparts rich in arginine (R) or lysine (K). For instance, sequences of 10 or more consecutive negatively charged residues (D or E) are present in 268 human proteins. In contrast, corresponding sequences of 10 or more consecutive positively charged residues (K or R) are present in only 12 human proteins. Interestingly, about 50% of proteins containing D/E tracts function as DNA-binding or RNA-binding proteins. Negatively charged IDRs can electrostatically mimic nucleic acids and dynamically compete with them for the DNA-binding domains (DBDs) or RNA-binding domains (RBDs) that are positively charged. This leads to a phenomenon known as autoinhibition, in which the negatively charged IDRs inhibit binding to nucleic acids by occupying the binding interfaces within the proteins through intramolecular interactions. Rather than merely reducing binding activity, negatively charged IDRs offer significant advantages for the functions of DNA/RNA-binding proteins. The dynamic competition between negatively charged IDRs and nucleic acids can accelerate the target search processes for these proteins. When a protein encounters DNA or RNA, the electrostatic repulsion force between the nucleic acids and the negatively charged IDRs can trigger conformational changes that allow the nucleic acids to access DBDs or RBDs. Additionally, when proteins are trapped at high-affinity non-target sites on DNA or RNA ("decoys"), the electrostatic repulsion from the negatively charged IDRs can rescue the proteins from these traps. Negatively charged IDRs act as gatekeepers, rejecting nonspecific ligands while allowing the target to access the molecular interfaces of DBDs or RBDs, which increases binding specificity. These IDRs can also promote proper protein folding, facilitate chromatin remodeling by displacing other proteins bound to DNA, and influence phase separation, affecting local pH. The functions of negatively charged IDRs can be regulated through protein-protein interactions, post-translational modifications, and proteolytic processing. These characteristics can be harnessed as tools for protein engineering. Some frame-shift mutations that convert negatively charged IDRs into positively charged ones are linked to human diseases. Therefore, it is crucial to understand the physicochemical properties and functional roles of negatively charged IDRs that compete with nucleic acids. 
        more » 
        « less   
        
    
                            - Award ID(s):
- 2026805
- PAR ID:
- 10614226
- Publisher / Repository:
- American Chemical Society
- Date Published:
- Journal Name:
- Accounts of Chemical Research
- ISSN:
- 0001-4842
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
- 
            
- 
            Abstract Charged residues on the surface of proteins are critical for both protein stability and interactions. However, many proteins contain binding regions with a high net charge that may destabilize the protein but are useful for binding to oppositely charged targets. We hypothesized that these domains would be marginally stable, as electrostatic repulsion would compete with favorable hydrophobic collapse during folding. Furthermore, by increasing the salt concentration, we predict that these protein folds would be stabilized by mimicking some of the favorable electrostatic interactions that take place during target binding. We varied the salt and urea concentrations to probe the contributions of electrostatic and hydrophobic interactions for the folding of the yeast SH3 domain found in Abp1p. The SH3 domain was significantly stabilized with increased salt concentrations due to Debye–Huckel screening and a nonspecific territorial ion‐binding effect. Molecular dynamics and NMR show that sodium ions interact with all 15 acidic residues but do little to change backbone dynamics or overall structure. Folding kinetics experiments show that the addition of urea or salt primarily affects the folding rate, indicating that almost all the hydrophobic collapse and electrostatic repulsion occur in the transition state. After the transition state formation, modest yet favorable short‐range salt bridges are formed along with hydrogen bonds, as the native state fully folds. Thus, hydrophobic collapse offsets electrostatic repulsion to ensure this highly charged binding domain can still fold and be ready to bind to its charged peptide targets, a property that is likely evolutionarily conserved over 1 billion years.more » « less
- 
            Musier-Forsyth, Karin (Ed.)RNA-binding proteins play crucial roles in various cellular functions, and contain abundant disordered protein regions. The disordered regions in RNA-binding proteins are rich in repetitive sequences, such as poly-K/R, poly-N/Q, poly-A, and poly-G residues. Our bioinformatic analysis identified a largely neglected repetitive sequence family we define as electronegative clusters (ENCs) that contain acidic residues and/or phosphorylation sites. The abundance and length of ENCs exceed other known repetitive sequences. Despite their abundance, the functions of ENCs in RNA-binding proteins are still elusive. To investigate the impacts of ENCs on protein stability, RNA-binding affinity, and specificity, we selected one RNA-binding protein, the ribosomal biogenesis factor 15 (Nop15) as a model. We found that the Nop15 ENC increases protein stability and inhibits nonspecific RNA binding, but minimally interferes with specific RNA binding. To investigate the effect of ENCs on sequence specificity of RNA binding, we grafted an ENC to another RNA-binding protein, Ser/Arg-rich splicing factor 3 (SRSF3). Using RNA Bind-n-Seq, we found that the engineered ENC inhibits disparate RNA motifs differently, instead of weakening all RNA motifs to the same extent. The motif site directly involved in electrostatic interaction is more susceptible to the ENC inhibition. These results suggest that one of functions of ENCs is to regulate RNA binding via electrostatic interaction. This is consistent with our finding that ENCs are also overrepresented in DNA-binding proteins, while underrepresented in halophiles, in which nonspecific nucleic acid binding is inhibited by high concentrations of salts.more » « less
- 
            TheVibrio choleraeCascade–TniQ complex unveiled a new paradigm in biology, demonstrating that CRISPR-associated proteins can direct DNA transposition. Despite the tremendous potential of “knocking-in” genes at desired sites, the mechanisms underlying DNA binding and transposition remain elusive. In this system, a conformational change of the Cas8 protein is essential for DNA binding, yet how it occurs is unclear. Here, structural modeling and free energy simulations reconstruct the Cas8 helical bundle and reveal an open–closed conformational change that is key for the complex’s function. We show that when Cascade–TniQ binds RNA, the Cas8 bundle changes conformation mediated by the interaction with the Cas7.1 protein. This interaction promotes the bundle’s transition toward the open state, priming the complex for DNA binding. As the target DNA binds the guide RNA, the opening of the Cas8 bundle becomes more favorable, exposing positively charged residues and facilitating their interaction with DNA, which ultimately leads the DNA-binding process to completion. These outcomes provide a dynamic representation of a critical conformational change in one of the largest CRISPR systems and illustrate its role at critical steps of the Cascade–TniQ biophysical function, advancing our understanding of nucleic acid binding and transposition mechanisms.more » « less
- 
            Intrinsically disordered protein regions (IDRs) are well established as contributors to intermolecular interactions and the formation of biomolecular condensates. In particular, RNA-binding proteins (RBPs) often harbor IDRs in addition to folded RNA-binding domains that contribute to RBP function. To understand the dynamic interactions of an IDR–RNA complex, we characterized the RNA-binding features of a small (68 residues), positively charged IDR-containing protein, Small ERDK-Rich Factor (SERF). At high concentrations, SERF and RNA undergo charge-driven associative phase separation to form a protein- and RNA-rich dense phase. A key advantage of this model system is that this threshold for demixing is sufficiently high that we could use solution-state biophysical methods to interrogate the stoichiometric complexes of SERF with RNA in the one-phase regime. Herein, we describe our comprehensive characterization of SERF alone and in complex with a small fragment of the HIV-1 Trans-Activation Response (TAR) RNA with complementary biophysical methods and molecular simulations. We find that this binding event is not accompanied by the acquisition of structure by either molecule; however, we see evidence for a modest global compaction of the SERF ensemble when bound to RNA. This behavior likely reflects attenuated charge repulsion within SERF via binding to the polyanionic RNA and provides a rationale for the higher-order assembly of SERF in the context of RNA. We envision that the SERF–RNA system will lower the barrier to accessing the details that support IDR–RNA interactions and likewise deepen our understanding of the role of IDR–RNA contacts in complex formation and liquid–liquid phase separation.more » « less
 An official website of the United States government
An official website of the United States government 
				
			 
					 
					
