Abstract RNA‐protein interactions play essential roles in regulating gene expression. While some RNA‐protein interactions are “specific”, that is, the RNA‐binding proteins preferentially bind to particular RNA sequence or structural motifs, others are “non‐RNA specific.” Deciphering the protein‐RNA recognition code is essential for comprehending the functional implications of these interactions and for developing new therapies for many diseases. Because of the high cost of experimental determination of protein‐RNA interfaces, there is a need for computational methods to identify RNA‐binding residues in proteins. While most of the existing computational methods for predicting RNA‐binding residues in RNA‐binding proteins are oblivious to the characteristics of the partner RNA, there is growing interest in methods for partner‐specific prediction of RNA binding sites in proteins. In this work, we assess the performance of two recently published partner‐specific protein‐RNA interface prediction tools, PS‐PRIP, and PRIdictor, along with our own new tools. Specifically, we introduce a novel metric, RNA‐specificity metric (RSM), for quantifying the RNA‐specificity of the RNA binding residues predicted by such tools. Our results show that the RNA‐binding residues predicted by previously published methods are oblivious to the characteristics of the putative RNA binding partner. Moreover, when evaluated using partner‐agnostic metrics, RNA partner‐specific methods are outperformed by the state‐of‐the‐art partner‐agnostic methods. We conjecture that either (a) the protein‐RNA complexes in PDB are not representative of the protein‐RNA interactions in nature, or (b) the current methods for partner‐specific prediction of RNA‐binding residues in proteins fail to account for the differences in RNA partner‐specific versus partner‐agnostic protein‐RNA interactions, or both.
more »
« less
Intrinsically disordered electronegative clusters improve stability and binding specificity of RNA-binding proteins
RNA-binding proteins play crucial roles in various cellular functions, and contain abundant disordered protein regions. The disordered regions in RNA-binding proteins are rich in repetitive sequences, such as poly-K/R, poly-N/Q, poly-A, and poly-G residues. Our bioinformatic analysis identified a largely neglected repetitive sequence family we define as electronegative clusters (ENCs) that contain acidic residues and/or phosphorylation sites. The abundance and length of ENCs exceed other known repetitive sequences. Despite their abundance, the functions of ENCs in RNA-binding proteins are still elusive. To investigate the impacts of ENCs on protein stability, RNA-binding affinity, and specificity, we selected one RNA-binding protein, the ribosomal biogenesis factor 15 (Nop15) as a model. We found that the Nop15 ENC increases protein stability and inhibits nonspecific RNA binding, but minimally interferes with specific RNA binding. To investigate the effect of ENCs on sequence specificity of RNA binding, we grafted an ENC to another RNA-binding protein, Ser/Arg-rich splicing factor 3 (SRSF3). Using RNA Bind-n-Seq, we found that the engineered ENC inhibits disparate RNA motifs differently, instead of weakening all RNA motifs to the same extent. The motif site directly involved in electrostatic interaction is more susceptible to the ENC inhibition. These results suggest that one of functions of ENCs is to regulate RNA binding via electrostatic interaction. This is consistent with our finding that ENCs are also overrepresented in DNA-binding proteins, while underrepresented in halophiles, in which nonspecific nucleic acid binding is inhibited by high concentrations of salts.
more »
« less
- Award ID(s):
- 2024964
- PAR ID:
- 10282471
- Editor(s):
- Musier-Forsyth, Karin
- Date Published:
- Journal Name:
- Journal of biological chemistry
- ISSN:
- 1083-351X
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Human RNA‐binding motif 3 protein (RBM3) is a cold‐shock protein which functions in various aspects of global protein synthesis, cell proliferation and apoptosis by interacting with the components of basal translational machinery. RBM3 plays important roles in tumour progression and cancer metastasis, and also has been shown to be involved in neuroprotection and endoplasmic reticulum stress response. Here, we have solved the solution NMR structure of the N‐terminal 84 residue RNA recognition motif (RRM) of RBM3. The remaining residues are rich in RGG and YGG motifs and are disordered. The RRM domain adopts a βαββαβ topology, which is found in many RNA‐binding proteins. NMR‐monitored titration experiments and molecular dynamic simulations show that the beta‐sheet and two loops form the RNA‐binding interface. Hydrogen bond, pi–pi and pi–cation are the key interactions between the RNA and the RRM domain. NMR, size exclusion chromatography and chemical cross‐linking experiments show that RBM3 forms oligomers in solution, which is favoured by decrease in temperature, thus, potentially linking it to its function as a cold‐shock protein. Temperature‐dependent NMR studies revealed that oligomerization of the RRM domain occurs via nonspecific interactions. Overall, this study provides the detailed structural analysis of RRM domain of RBM3, its interaction with RNA and the molecular basis of its temperature‐dependent oligomerization.more » « less
-
Disordered binding regions (DBRs), which are embedded within intrinsically disordered proteins or regions (IDPs or IDRs), enable IDPs or IDRs to mediate multiple protein-protein interactions. DBR-protein complexes were collected from the Protein Data Bank for which two or more DBRs having different amino acid sequences bind to the same (100% sequence identical) globular protein partner, a type of interaction herein called many-to-one binding. Two distinct binding profiles were identified: independent and overlapping. For the overlapping binding profiles, the distinct DBRs interact by means of almost identical binding sites (herein called “similar”), or the binding sites contain both common and divergent interaction residues (herein called “intersecting”). Further analysis of the sequence and structural differences among these three groups indicate how IDP flexibility allows different segments to adjust to similar, intersecting, and independent binding pockets.more » « less
-
null (Ed.)Electrostatic protein/DNA interactions arise from the neutralization of the DNA phosphodiester backbone as well as coupled exchanges by charged protein residues as salt bridges or with mobile ions. Much focus has been and continues to be paid to interfacial ion pairs with DNA. The role of extra-interfacial ionic interactions, particularly as dynamic drivers of DNA sequence selectivity, remain poorly known. The ETS family of transcription factors represents an attractive model for addressing this knowledge gap given their diverse ionic composition in primary structures that fold to a tightly conserved DNA-binding motif. To probe the importance of extra-interfacial salt bridges in DNA recognition, we compared the salt-dependent binding by Elk1 with ETV6, two ETS homologs differing markedly in ionic composition. While both proteins exhibit salt-dependent binding with cognate DNA that corresponds to interfacial phosphate contacts, their nonspecific binding diverges from cognate binding as well as each other. Molecular dynamics simulations in explicit solvent, which generated ionic interactions in agreement with the experimental binding data, revealed distinct salt-bridge dynamics in the nonspecific complexes formed by the two proteins. Impaired DNA contact by ETV6 resulted in fewer backbone contacts in the nonspecific complex, while Elk1 exhibited a redistribution of extra-interfacial salt bridges via residues that are non-conserved between the two ETS relatives. Thus, primary structure variation in ionic residues can encode highly differentiated specificity mechanisms in a highly conserved DNA-binding motif.more » « less
-
Abstract Charged residues on the surface of proteins are critical for both protein stability and interactions. However, many proteins contain binding regions with a high net charge that may destabilize the protein but are useful for binding to oppositely charged targets. We hypothesized that these domains would be marginally stable, as electrostatic repulsion would compete with favorable hydrophobic collapse during folding. Furthermore, by increasing the salt concentration, we predict that these protein folds would be stabilized by mimicking some of the favorable electrostatic interactions that take place during target binding. We varied the salt and urea concentrations to probe the contributions of electrostatic and hydrophobic interactions for the folding of the yeast SH3 domain found in Abp1p. The SH3 domain was significantly stabilized with increased salt concentrations due to Debye–Huckel screening and a nonspecific territorial ion‐binding effect. Molecular dynamics and NMR show that sodium ions interact with all 15 acidic residues but do little to change backbone dynamics or overall structure. Folding kinetics experiments show that the addition of urea or salt primarily affects the folding rate, indicating that almost all the hydrophobic collapse and electrostatic repulsion occur in the transition state. After the transition state formation, modest yet favorable short‐range salt bridges are formed along with hydrogen bonds, as the native state fully folds. Thus, hydrophobic collapse offsets electrostatic repulsion to ensure this highly charged binding domain can still fold and be ready to bind to its charged peptide targets, a property that is likely evolutionarily conserved over 1 billion years.more » « less