skip to main content


Title: Intrinsically disordered electronegative clusters improve stability and binding specificity of RNA-binding proteins
RNA-binding proteins play crucial roles in various cellular functions, and contain abundant disordered protein regions. The disordered regions in RNA-binding proteins are rich in repetitive sequences, such as poly-K/R, poly-N/Q, poly-A, and poly-G residues. Our bioinformatic analysis identified a largely neglected repetitive sequence family we define as electronegative clusters (ENCs) that contain acidic residues and/or phosphorylation sites. The abundance and length of ENCs exceed other known repetitive sequences. Despite their abundance, the functions of ENCs in RNA-binding proteins are still elusive. To investigate the impacts of ENCs on protein stability, RNA-binding affinity, and specificity, we selected one RNA-binding protein, the ribosomal biogenesis factor 15 (Nop15) as a model. We found that the Nop15 ENC increases protein stability and inhibits nonspecific RNA binding, but minimally interferes with specific RNA binding. To investigate the effect of ENCs on sequence specificity of RNA binding, we grafted an ENC to another RNA-binding protein, Ser/Arg-rich splicing factor 3 (SRSF3). Using RNA Bind-n-Seq, we found that the engineered ENC inhibits disparate RNA motifs differently, instead of weakening all RNA motifs to the same extent. The motif site directly involved in electrostatic interaction is more susceptible to the ENC inhibition. These results suggest that one of functions of ENCs is to regulate RNA binding via electrostatic interaction. This is consistent with our finding that ENCs are also overrepresented in DNA-binding proteins, while underrepresented in halophiles, in which nonspecific nucleic acid binding is inhibited by high concentrations of salts.  more » « less
Award ID(s):
2024964
NSF-PAR ID:
10282471
Author(s) / Creator(s):
; ; ; ; ; ;
Editor(s):
Musier-Forsyth, Karin
Date Published:
Journal Name:
Journal of biological chemistry
ISSN:
1083-351X
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Human RNA‐binding motif 3 protein (RBM3) is a cold‐shock protein which functions in various aspects of global protein synthesis, cell proliferation and apoptosis by interacting with the components of basal translational machinery. RBM3 plays important roles in tumour progression and cancer metastasis, and also has been shown to be involved in neuroprotection and endoplasmic reticulum stress response. Here, we have solved the solution NMR structure of the N‐terminal 84 residue RNA recognition motif (RRM) of RBM3. The remaining residues are rich in RGG and YGG motifs and are disordered. The RRM domain adopts a βαββαβ topology, which is found in many RNA‐binding proteins. NMR‐monitored titration experiments and molecular dynamic simulations show that the beta‐sheet and two loops form the RNA‐binding interface. Hydrogen bond, pi–pi and pi–cation are the key interactions between the RNA and the RRM domain. NMR, size exclusion chromatography and chemical cross‐linking experiments show that RBM3 forms oligomers in solution, which is favoured by decrease in temperature, thus, potentially linking it to its function as a cold‐shock protein. Temperature‐dependent NMR studies revealed that oligomerization of the RRM domain occurs via nonspecific interactions. Overall, this study provides the detailed structural analysis of RRM domain of RBM3, its interaction with RNA and the molecular basis of its temperature‐dependent oligomerization.

     
    more » « less
  2. Disordered binding regions (DBRs), which are embedded within intrinsically disordered proteins or regions (IDPs or IDRs), enable IDPs or IDRs to mediate multiple protein-protein interactions. DBR-protein complexes were collected from the Protein Data Bank for which two or more DBRs having different amino acid sequences bind to the same (100% sequence identical) globular protein partner, a type of interaction herein called many-to-one binding. Two distinct binding profiles were identified: independent and overlapping. For the overlapping binding profiles, the distinct DBRs interact by means of almost identical binding sites (herein called “similar”), or the binding sites contain both common and divergent interaction residues (herein called “intersecting”). Further analysis of the sequence and structural differences among these three groups indicate how IDP flexibility allows different segments to adjust to similar, intersecting, and independent binding pockets. 
    more » « less
  3. Abstract

    In eukaryotes, many DNA/RNA-binding proteins possess intrinsically disordered regions (IDRs) with large negative charge, some of which involve a consecutive sequence of aspartate (D) or glutamate (E) residues. We refer to them as D/E repeats. The functional role of D/E repeats is not well understood, though some of them are known to cause autoinhibition through intramolecular electrostatic interaction with functional domains. In this work, we investigated the impacts of D/E repeats on the target DNA search kinetics for the high-mobility group box 1 (HMGB1) protein and the artificial protein constructs of the Antp homeodomain fused with D/E repeats of varied lengths. Our experimental data showed that D/E repeats of particular lengths can accelerate the target association in the overwhelming presence of non-functional high-affinity ligands (‘decoys’). Our coarse-grained molecular dynamics (CGMD) simulations showed that the autoinhibited proteins can bind to DNA and transition into the uninhibited complex with DNA through an electrostatically driven induced-fit process. In conjunction with the CGMD simulations, our kinetic model can explain how D/E repeats can accelerate the target association process in the presence of decoys. This study illuminates an unprecedented role of the negatively charged IDRs in the target search process.

     
    more » « less
  4. Crosson, Sean (Ed.)
    Quorum sensing is a chemical communication process that bacteria use to coordinate group behaviors. In the global pathogen Vibrio cholerae , one quorum-sensing receptor and transcription factor, called VqmA (VqmA Vc ), activates expression of the vqmR gene encoding the small regulatory RNA VqmR, which represses genes involved in virulence and biofilm formation. Vibriophage VP882 encodes a VqmA homolog called VqmA Phage that activates transcription of the phage gene qtip , and Qtip launches the phage lytic program. Curiously, VqmA Phage can activate vqmR expression but VqmA Vc cannot activate expression of qtip . Here, we investigate the mechanism underlying this asymmetry. We find that promoter selectivity is driven by each VqmA DNA-binding domain and key DNA sequences in the vqmR and qtip promoters are required to maintain specificity. A protein sequence-guided mutagenesis approach revealed that the residue E194 of VqmA Phage and A192, the equivalent residue in VqmA Vc , in the helix-turn-helix motifs contribute to promoter-binding specificity. A genetic screen to identify VqmA Phage mutants that are incapable of binding the qtip promoter but maintain binding to the vqmR promoter delivered additional VqmA Phage residues located immediately C-terminal to the helix-turn-helix motif as required for binding the qtip promoter. Surprisingly, these residues are conserved between VqmA Phage and VqmA Vc . A second, targeted genetic screen revealed a region located in the VqmA Vc DNA-binding domain that is necessary to prevent VqmA Vc from binding the qtip promoter, thus restricting DNA binding to the vqmR promoter. We propose that the VqmA Vc helix-turn-helix motif and the C-terminal flanking residues function together to prohibit VqmA Vc from binding the qtip promoter. 
    more » « less
  5. Abstract

    RNA‐protein interactions play essential roles in regulating gene expression. While some RNA‐protein interactions are “specific”, that is, the RNA‐binding proteins preferentially bind to particular RNA sequence or structural motifs, others are “non‐RNA specific.” Deciphering the protein‐RNA recognition code is essential for comprehending the functional implications of these interactions and for developing new therapies for many diseases. Because of the high cost of experimental determination of protein‐RNA interfaces, there is a need for computational methods to identify RNA‐binding residues in proteins. While most of the existing computational methods for predicting RNA‐binding residues in RNA‐binding proteins are oblivious to the characteristics of the partner RNA, there is growing interest in methods for partner‐specific prediction of RNA binding sites in proteins. In this work, we assess the performance of two recently published partner‐specific protein‐RNA interface prediction tools, PS‐PRIP, and PRIdictor, along with our own new tools. Specifically, we introduce a novel metric, RNA‐specificity metric (RSM), for quantifying the RNA‐specificity of the RNA binding residues predicted by such tools. Our results show that the RNA‐binding residues predicted by previously published methods are oblivious to the characteristics of the putative RNA binding partner. Moreover, when evaluated using partner‐agnostic metrics, RNA partner‐specific methods are outperformed by the state‐of‐the‐art partner‐agnostic methods. We conjecture that either (a) the protein‐RNA complexes in PDB are not representative of the protein‐RNA interactions in nature, or (b) the current methods for partner‐specific prediction of RNA‐binding residues in proteins fail to account for the differences in RNA partner‐specific versus partner‐agnostic protein‐RNA interactions, or both.

     
    more » « less