skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Improved modeling of RNA-binding protein motifs in an interpretable neural model of RNA splicing
Abstract Sequence-specific RNA-binding proteins (RBPs) play central roles in splicing decisions. Here, we describe a modular splicing architecture that leverages in vitro-derived RNA affinity models for 79 human RBPs and the annotated human genome to produce improved models of RBP binding and activity. Binding and activity are modeled by separate Motif and Aggregator components that can be mixed and matched, enforcing sparsity to improve interpretability. Training a new Adjusted Motif (AM) architecture on the splicing task not only yields better splicing predictions but also improves prediction of RBP-binding sites in vivo and of splicing activity, assessed using independent data.  more » « less
Award ID(s):
1918839
PAR ID:
10486175
Author(s) / Creator(s):
; ; ; ; ; ;
Publisher / Repository:
Springer Science + Business Media
Date Published:
Journal Name:
Genome Biology
Volume:
25
Issue:
1
ISSN:
1474-760X
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. RNA-binding proteins (RBPs) participate in all stages of RNA life cycle from transcription, splicing, to translation. Under the ENCODE project, a large number of RBPs were knocked down in human cancer cell lines, offering an excellent opportunity to infer targets of RBPs. Taking both RBP binding sites and RNA-seq profiles of RBP knockdown samples as input, we present a pipeline to identify causal RBP RNA interactions. The pipeline employs a recent functional chi-square test (FunChisq) that deciphers directional association, and utilizes a novel functional index that measures the effect size of functional dependency. We examined ∼45 million RBP RNA pairs in leukemia (K562) and liver cancer (HepG2) cell lines for functional patterns as causal interaction candidates. Here, we report a total of 936,707 RBP RNA pairs in the two cell lines that show statistically significant linear or nonlinear functional patterns. About 31% of these pairs have supportive biological evidence from other sources, suggesting the effectiveness of the pipeline. The interactions constitute RBP specific regulatory networks that may potentially represent core mechanisms in the two cancers. The pipeline is implemented through an R interface with pre-computed results and data libraries for users to query specific networks and visualize RBP RNA interactions. Such networks serve as a useful resource for studying RNA dysregulation in cancer. 
    more » « less
  2. RNA binding proteins (RBPs) regulate all aspects of RNA biogenesis from transcription, splicing, and translation to degradation, and they have a critical role in cellular homeostasis and functional diversity. Recent studies have indicated that altered expressions of RBPs are associated with many human diseases ranging from neurologic disorders to cancer. The transcriptional coregulator yes-associated protein 1 (YAP1), a critical nuclear effector of the mammalian Hippo pathway, regulates cell fate, cell contact, metabolism, and developmental processes. This study demonstrates a link between YAP1 and nucleophosmin1 (NPM1) protein. NPM1 is an RNA-binding protein that regulates many cellular activities, including ribosome biogenesis, RNA processing, chromatin remodeling, DNA repair, and genomic stability. We identified NPM1 from YAP1 protein complexes of androgen-responsive human cancer cells using proteomics approaches. Our proximity ligation assay demonstrated that YAP1 and NPM1 physically interacted with each other. The interaction between YAP1 and NPM1 occurred in cell nuclei and was regulated by androgen hormone signaling. In addition, our GST-pulldown assay demonstrated that NPM1 formed a protein complex with the proline-rich domain of YAP1. Furthermore, our enhanced RNA interactome capture (eRIC) assay showed that androgen also regulated the interaction of RBPs to polyA+ mRNA within the cell. Consistent with this observation, our eRIC assay combined with the mass spectrometry method enabled us to identify distinct RBP patterns in human cancer cells that are genetically related but phenotypically different. These observations indicate that global alterations of RBPs under changing environmental conditions may have essential roles in cellular physiology and disease biology. 
    more » « less
  3. Abstract Nucleic acid-binding proteins (NABPs), including DNA-binding proteins (DBPs) and RNA-binding proteins (RBPs), play important roles in essential biological processes. To facilitate functional annotation and accurate prediction of different types of NABPs, many machine learning-based computational approaches have been developed. However, the datasets used for training and testing as well as the prediction scopes in these studies have limited their applications. In this paper, we developed new strategies to overcome these limitations by generating more accurate and robust datasets and developing deep learning-based methods including both hierarchical and multi-class approaches to predict the types of NABPs for any given protein. The deep learning models employ two layers of convolutional neural network and one layer of long short-term memory. Our approaches outperform existing DBP and RBP predictors with a balanced prediction between DBPs and RBPs, and are more practically useful in identifying novel NABPs. The multi-class approach greatly improves the prediction accuracy of DBPs and RBPs, especially for the DBPs with ~12% improvement. Moreover, we explored the prediction accuracy of single-stranded DNA binding proteins and their effect on the overall prediction accuracy of NABP predictions. 
    more » « less
  4. To cope with abiotic environmental stress, plants rapidly change their gene expression transcriptionally and post-transcriptionally, the latter by translational suppression of selected proteins and the assembly of cytoplasmic stress granules (SGs) that sequester mRNA transcripts. RNA-binding proteins (RBPs) are the major players in these post-transcriptional processes, which control RNA processing in the nucleus, their export from the nucleus, and overall RNA metabolism in the cytoplasm. Because of their diverse modular domain structures, various RBP types dynamically co-assemble with their targeted RNAs and interacting proteins to form SGs, a process that finely regulates stress-responsive gene expression. This review summarizes recent findings on the involvement of RBPs in adapting plants to various abiotic stresses via modulation of specific gene expression events and SG formation. The relationship of these processes with the stress hormone abscisic acid (ABA) is discussed. 
    more » « less
  5. Millet, Oscar (Ed.)
    Cyp33 is an essential human cyclophilin prolyl isomerase that plays myriad roles in splicing and chromatin remodeling. In addition to a canonical cyclophilin (Cyp) domain, Cyp33 contains an RNA-recognition motif (RRM) domain, and RNA-binding triggers proline isomerase activity. One prominent role for Cyp33 is through a direct interaction with the mixed lineage leukemia protein 1 (MLL1, also known as KMT2A) complex, which is a histone methyltransferase that serves as a global regulator of human transcription. MLL activity is regulated by Cyp33, which isomerizes a key proline in the linker between the PHD3 and Bromo domains of MLL1, acting as a switch between gene activation and repression. The direct interaction between MLL1 and Cyp33 is critical, as deletion of the MLL1-PHD3 domain responsible for this interaction results in oncogenesis. The Cyp33 RRM is central to these activities, as it binds both the PHD3 domain and RNA. To better understand how RNA binding drives the action of Cyp33, we performed RNA-SELEX against full-length Cyp33 accompanied by deep sequencing. We have identified an enriched Cyp33 binding motif ( AAUAAUAA ) broadly represented in the cellular RNA pool as well as tightly binding RNA aptamers with affinities comparable and competitive with the Cyp33 MLL1-PHD3 interaction. RNA binding extends beyond the canonical RRM domain, but not to the Cyp domain, suggesting an indirect mechanism of interaction. NMR chemical shift mapping confirms an overlapping, but not identical, interface on Cyp33 for RNA and PHD3 binding. This finding suggests RNA can disrupt the gene repressive Cyp33-MLL1 complex providing another layer of regulation for chromatin remodeling by MLL1. 
    more » « less