skip to main content


Title: RiboA: a web application to identify ribosome A-site locations in ribosome profiling data
Abstract Background Translation is a fundamental process in gene expression. Ribosome profiling is a method that enables the study of transcriptome-wide translation. A fundamental, technical challenge in analyzing Ribo-Seq data is identifying the A-site location on ribosome-protected mRNA fragments. Identification of the A-site is essential as it is at this location on the ribosome where a codon is translated into an amino acid. Incorrect assignment of a read to the A-site can lead to lower signal-to-noise ratio and loss of correlations necessary to understand the molecular factors influencing translation. Therefore, an easy-to-use and accurate analysis tool is needed to accurately identify the A-site locations. Results We present RiboA, a web application that identifies the most accurate A-site location on a ribosome-protected mRNA fragment and generates the A-site read density profiles. It uses an Integer Programming method that reflects the biological fact that the A-site of actively translating ribosomes is generally located between the second codon and stop codon of a transcript, and utilizes a wide range of mRNA fragment sizes in and around the coding sequence (CDS). The web application is containerized with Docker, and it can be easily ported across platforms. Conclusions The Integer Programming method that RiboA utilizes is the most accurate in identifying the A-site on Ribo-Seq mRNA fragments compared to other methods. RiboA makes it easier for the community to use this method via a user-friendly and portable web application. In addition, RiboA supports reproducible analyses by tracking all the input datasets and parameters, and it provides enhanced visualization to facilitate scientific exploration. RiboA is available as a web service at https://a-site.vmhost.psu.edu/ . The code is publicly available at https://github.com/obrien-lab/aip_web_docker under the MIT license.  more » « less
Award ID(s):
1759860
NSF-PAR ID:
10252831
Author(s) / Creator(s):
; ; ;
Date Published:
Journal Name:
BMC Bioinformatics
Volume:
22
Issue:
1
ISSN:
1471-2105
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Background

    Ribosome profiling, also known as Ribo-seq, is a powerful technique to study genome-wide mRNA translation. It reveals the precise positions and quantification of ribosomes on mRNAs through deep sequencing of ribosome footprints. We previously optimized the resolution of this technique in plants. However, several key reagents in our original method have been discontinued, and thus, there is an urgent need to establish an alternative protocol.

    Results

    Here we describe a step-by-step protocol that combines our optimized ribosome footprinting in plants with available custom library construction methods established in yeast and bacteria. We tested this protocol in 7-day-old Arabidopsis seedlings and evaluated the quality of the sequencing data regarding ribosome footprint length, mapped genomic features, and the periodic properties corresponding to actively translating ribosomes through open resource bioinformatic tools. We successfully generated high-quality Ribo-seq data comparable with our original method.

    Conclusions

    We established a custom library construction method for super-resolution Ribo-seq in Arabidopsis. The experimental protocol and bioinformatic pipeline should be readily applicable to other plant tissues and species.

     
    more » « less
  2. null (Ed.)
    Abstract Ribosome profiling, also known as Ribo-seq, has become a popular approach to investigate regulatory mechanisms of translation in a wide variety of biological contexts. Ribo-seq not only provides a measurement of translation efficiency based on the relative abundance of ribosomes bound to transcripts, but also has the capacity to reveal dynamic and local regulation at different stages of translation based on positional information of footprints across individual transcripts. While many computational tools exist for the analysis of Ribo-seq data, no method is currently available for rigorous testing of the pattern differences in ribosome footprints. In this work, we develop a novel approach together with an R package, RiboDiPA, for Differential Pattern Analysis of Ribo-seq data. RiboDiPA allows for quick identification of genes with statistically significant differences in ribosome occupancy patterns for model organisms ranging from yeast to mammals. We show that differential pattern analysis reveals information that is distinct and complimentary to existing methods that focus on translational efficiency analysis. Using both simulated Ribo-seq footprint data and three benchmark data sets, we illustrate that RiboDiPA can uncover meaningful pattern differences across multiple biological conditions on a global scale, and pinpoint characteristic ribosome occupancy patterns at single codon resolution. 
    more » « less
  3. Abstract

    A crucial step in functional genomics is identifying actively translated open reading frames (ORFs) and linking them to biological functions. The challenge lies in identifying short ORFs, as their identification is greatly influenced by data quality and depth. Here, we improved the coverage of super-resolution Ribo-seq in Arabidopsis (Arabidopsis thaliana), revealing uncharacterized translation events for nuclear, chloroplastic, and mitochondrial genes. Assisted by a transcriptome assembly, we identified 7,751 unconventional translation events, comprising 6,996 upstream ORFs (uORFs) and 209 downstream ORFs on annotated protein-coding genes, as well as 546 ORFs in presumed non-coding RNAs. Proteomics data confirmed the production of stable proteins from some of these unannotated translation events. We present evidence of active translation from primary transcripts of tasiRNAs (TAS1–4) and microRNAs (pri-MIR163, pri-MIR169), and periodic ribosome stalling supporting co-translational decay. Additionally, we developed a method for identifying extremely short uORFs, including 370 minimum uORFs (AUG-stop), and 2,921 tiny uORFs (2–10 amino acids), and 681 uORFs that overlap with each other. Remarkably, these short uORFs exhibit strong translational repression as do longer uORFs. We also systematically discovered 594 uORFs regulated by alternative splicing, suggesting widespread isoform-specific translational control. Finally, these prevalent uORFs are associated with numerous important pathways. In summary, our improved Arabidopsis translational landscape provides valuable resources to study gene expression regulation.

     
    more » « less
  4. Abstract Background Ribo-seq has revolutionized the study of genome-wide mRNA translation. High-quality Ribo-seq data display strong 3-nucleotide (nt) periodicity, which corresponds to translating ribosomes deciphering three nts at a time. While 3-nt periodicity has been widely used to study novel translation events such as upstream ORFs in 5′ untranslated regions and small ORFs in presumed non-coding RNAs, tools that allow the visualization of these events remain underdeveloped. Results RiboPlotR is a visualization package written in R that presents both RNA-seq coverage and Ribo-seq reads in genomic coordinates for all annotated transcript isoforms of a gene. Specifically, for individual isoform models, RiboPlotR plots Ribo-seq data in the context of gene structures, including 5′ and 3′ untranslated regions and introns, and it presents the reads for all three reading frames in three different colors. The inclusion of gene structures and color-coding the reading frames facilitate observing new translation events and identifying potential regulatory mechanisms. Conclusions RiboPlotR is freely available ( https://github.com/hsinyenwu/RiboPlotR and https://sourceforge.net/projects/riboplotr/ ) and allows the visualization of translated features identified in Ribo-seq data. 
    more » « less
  5. Abstract

    Decay of mRNAs can be triggered by ribosome slowdown at stretches of rare codons or positively charged amino acids. However, the full diversity of sequences that trigger co-translational mRNA decay is poorly understood. To comprehensively identify sequence motifs that trigger mRNA decay, we use a massively parallel reporter assay to measure the effect of all possible combinations of codon pairs on mRNA levels in S. cerevisiae. In addition to known mRNA-destabilizing sequences, we identify several dipeptide repeats whose translation reduces mRNA levels. These include combinations of positively charged and bulky residues, as well as proline-glycine and proline-aspartate dipeptide repeats. Genetic deletion of the ribosome collision sensor Hel2 rescues the mRNA effects of these motifs, suggesting that they trigger ribosome slowdown and activate the ribosome-associated quality control (RQC) pathway. Deep mutational scanning of an mRNA-destabilizing dipeptide repeat reveals a complex interplay between the charge, bulkiness, and location of amino acid residues in conferring mRNA instability. Finally, we show that the mRNA effects of codon pairs are predictive of the effects of endogenous sequences. Our work highlights the complexity of sequence motifs driving co-translational mRNA decay in eukaryotes, and presents a high throughput approach to dissect their requirements at the codon level.

     
    more » « less