skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: ROSE: a deep learning based framework for predicting ribosome stalling
We present a deep learning based framework, called ROSE, to accurately predict ribosome stalling events in translation elongation from coding sequences based on high-throughput ribosome profiling data. Our validation results demonstrate the superior performance of ROSE over conventional prediction models. ROSE provides an effective index to estimate the likelihood of translational pausing at codon resolution and understand diverse putative regulatory factors of ribosome stalling. Also, the ribosome stalling landscape computed by ROSE can recover the functional interplay between ribosome stalling and cotranslational events in protein biogenesis, including protein targeting by the signal recognition particle (SRP) and protein secondary structure formation.  more » « less
Award ID(s):
1646333 1262107
PAR ID:
10025989
Author(s) / Creator(s):
; ; ; ; ;
Date Published:
Journal Name:
Proc. 21st Annual International Conference on Research in Computational Molecular Biology (RECOMB)
Volume:
21
Page Range / eLocation ID:
402-403
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Cell-free gene expression (CFE) systems are powerful tools for transcribing and translating genes outside of a living cell. Synthesis of membrane proteins is of particular interest, but their yield in CFE is substantially lower than that for soluble proteins. In this paper, we study the CFE of membrane proteins and develop a quantitative kinetic model. We identify that ribosome stalling during the translation of membrane proteins is a strong predictor of membrane protein synthesis due to aggregation between the ribosome nascent chains. Synthesis can be improved by the addition of lipid membranes, which incorporate protein nascent chains and, therefore, kinetically compete with aggregation. We show that the balance between peptide-membrane association and peptide aggregation rates determines the yield of the synthesized membrane protein. We define a membrane protein expression score that can be used to rationalize the engineering of lipid composition and the N-terminal domain of a native and computationally designed membrane proteins produced through CFE. 
    more » « less
  2. Abstract A crucial step in functional genomics is identifying actively translated open reading frames (ORFs) and linking them to biological functions. The challenge lies in identifying short ORFs, as their identification is greatly influenced by data quality and depth. Here, we improved the coverage of super-resolution Ribo-seq in Arabidopsis (Arabidopsis thaliana), revealing uncharacterized translation events for nuclear, chloroplastic, and mitochondrial genes. Assisted by a transcriptome assembly, we identified 7,751 unconventional translation events, comprising 6,996 upstream ORFs (uORFs) and 209 downstream ORFs on annotated protein-coding genes, as well as 546 ORFs in presumed non-coding RNAs. Proteomics data confirmed the production of stable proteins from some of these unannotated translation events. We present evidence of active translation from primary transcripts of tasiRNAs (TAS1–4) and microRNAs (pri-MIR163, pri-MIR169), and periodic ribosome stalling supporting co-translational decay. Additionally, we developed a method for identifying extremely short uORFs, including 370 minimum uORFs (AUG-stop), and 2,921 tiny uORFs (2–10 amino acids), and 681 uORFs that overlap with each other. Remarkably, these short uORFs exhibit strong translational repression as do longer uORFs. We also systematically discovered 594 uORFs regulated by alternative splicing, suggesting widespread isoform-specific translational control. Finally, these prevalent uORFs are associated with numerous important pathways. In summary, our improved Arabidopsis translational landscape provides valuable resources to study gene expression regulation. 
    more » « less
  3. During the last few decades, the ribosome has been regarded primarily as a major cell player devoted to the catalysis of protein biosynthesis during translation [1-5]. It is therefore not surprising that several processes related to translation exploit the ribosome as a central hub. For instance, it is well-known that many events related to translational regulation are mediated by interactions between the ribosome and initiation, elongation or termination factors [6-9]. In addition, the ribosome is involved in mRNA-code recognition and proofreading [10-12] as well as in the control of translation rates via interactions with mRNA codons bearing high- and lowfrequency [13-15] and associated with variable tRNA abundance within the translation machinery [16-18]. Interestingly, the ribosome also assists de novo protein structure formation by minimizing cotranslational aggregation, thus increasing the yield of native-protein production [19,20]. The latter event, however, has not been shown to require -- or even involve -- direct interactions between the ribosome and the nascent protein chain. A notable exception is that of nascent chains bearing either N-terminal signal sequences or translational-arrest tags. These proteins are known to establish short- or long-term contacts with various regions of the ribosome during translation [21-25]. In summary, until recently very little knowledge has been available about direct contacts between the ribosome and nascent polypeptides and proteins that do not carry signal or arrest sequences. Studies based on fluorescence depolarization in the frequency domain [26] and NMR spectroscopy [27-30] provided interesting data that are consistent with, but do not unequivocally establish, the presence of these interactions. 
    more » « less
  4. Abstract Ribosome engineering is a powerful approach for expanding the catalytic potential of the protein synthesis apparatus. Due to the potential detriment the properties of the engineered ribosome may have on the cell, the designer ribosome needs to be functionally isolated from the translation machinery synthesizing cellular proteins. One solution to this problem was offered by Ribo-T, an engineered ribosome with tethered subunits which, while producing a desired protein, could be excluded from general translation. Here, we provide a conceptually different design of a cell with two orthogonal protein synthesis systems, where Ribo-T produces the proteome, while the dissociable ribosome is committed to the translation of a specific mRNA. The utility of this system is illustrated by generating a comprehensive collection of mutants with alterations at every rRNA nucleotide of the peptidyl transferase center and isolating gain-of-function variants that enable the ribosome to overcome the translation termination blockage imposed by an arrest peptide. 
    more » « less
  5. Abstract Protein biogenesis is essential in all cells and initiates when a nascent polypeptide emerges from the ribosome exit tunnel, where multiple ribosome-associated protein biogenesis factors (RPBs) direct nascent proteins to distinct fates. How distinct RPBs spatiotemporally coordinate with one another to affect accurate protein biogenesis is an emerging question. Here, we address this question by studying the role of a cotranslational chaperone, nascent polypeptide-associated complex (NAC), in regulating substrate selection by signal recognition particle (SRP), a universally conserved protein targeting machine. We show that mammalian SRP and SRP receptors (SR) are insufficient to generate the biologically required specificity for protein targeting to the endoplasmic reticulum. NAC co-binds with and remodels the conformational landscape of SRP on the ribosome to regulate its interaction kinetics with SR, thereby reducing the nonspecific targeting of signalless ribosomes and pre-emptive targeting of ribosomes with short nascent chains. Mathematical modeling demonstrates that the NAC-induced regulations of SRP activity are essential for the fidelity of cotranslational protein targeting. Our work establishes a molecular model for how NAC acts as a triage factor to prevent protein mislocalization, and demonstrates how the macromolecular crowding of RPBs at the ribosome exit site enhances the fidelity of substrate selection into individual protein biogenesis pathways. 
    more » « less