Visualizing Ribo-seq and other sequencing data within genes of interest is a powerful approach to studying gene expression, but its application is limited by a lack of robust tools. Here, we introduce ggRibo, a user-friendly R package for visualizing individual gene expression, integrating Ribo-seq, RNA-seq, and other genome-wide datasets with flexible scaling options. ggRibo visualizes 3-nucleotide periodicity, a hallmark of translating ribosomes, within a gene-structure context, including introns and untranslated regions, enabling the study of novel ORFs, translation of different isoforms, and mechanisms of translational regulation. ggRibo can plot multiple Ribo-seq/RNA-seq datasets from different conditions for comparison. It also contains functions for plotting single-transcript view, reading-frame decomposition, and RNA-seq coverage alone. Importantly, ggRibo supports the visualization of other omics datasets that could also be presented with single-nucleotide resolution, such as RNA degradome, transcription start sites, translation initiation sites, and epitranscriptomic modifications. We demonstrate its utility with examples of upstream ORFs, downstream ORFs, nested ORFs, and differential isoform translation in humans,Arabidopsis, tomato, and rice. We also provide examples of multiomic comparisons that reveal insights that connect the transcriptome, translatome, and degradome. In summary, ggRibo is an advanced single-gene viewer that offers a valuable resource for studying gene expression regulation through its intuitive and flexible platform.
more »
« less
RiboPlotR: a visualization tool for periodic Ribo-seq reads
Abstract Background Ribo-seq has revolutionized the study of genome-wide mRNA translation. High-quality Ribo-seq data display strong 3-nucleotide (nt) periodicity, which corresponds to translating ribosomes deciphering three nts at a time. While 3-nt periodicity has been widely used to study novel translation events such as upstream ORFs in 5′ untranslated regions and small ORFs in presumed non-coding RNAs, tools that allow the visualization of these events remain underdeveloped. Results RiboPlotR is a visualization package written in R that presents both RNA-seq coverage and Ribo-seq reads in genomic coordinates for all annotated transcript isoforms of a gene. Specifically, for individual isoform models, RiboPlotR plots Ribo-seq data in the context of gene structures, including 5′ and 3′ untranslated regions and introns, and it presents the reads for all three reading frames in three different colors. The inclusion of gene structures and color-coding the reading frames facilitate observing new translation events and identifying potential regulatory mechanisms. Conclusions RiboPlotR is freely available ( https://github.com/hsinyenwu/RiboPlotR and https://sourceforge.net/projects/riboplotr/ ) and allows the visualization of translated features identified in Ribo-seq data.
more »
« less
- Award ID(s):
- 2051885
- PAR ID:
- 10317163
- Date Published:
- Journal Name:
- Plant Methods
- Volume:
- 17
- Issue:
- 1
- ISSN:
- 1746-4811
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Abstract A crucial step in functional genomics is identifying actively translated open reading frames (ORFs) and linking them to biological functions. The challenge lies in identifying short ORFs, as their identification is greatly influenced by data quality and depth. Here, we improved the coverage of super-resolution Ribo-seq in Arabidopsis (Arabidopsis thaliana), revealing uncharacterized translation events for nuclear, chloroplastic, and mitochondrial genes. Assisted by a transcriptome assembly, we identified 7,751 unconventional translation events, comprising 6,996 upstream ORFs (uORFs) and 209 downstream ORFs on annotated protein-coding genes, as well as 546 ORFs in presumed non-coding RNAs. Proteomics data confirmed the production of stable proteins from some of these unannotated translation events. We present evidence of active translation from primary transcripts of tasiRNAs (TAS1–4) and microRNAs (pri-MIR163, pri-MIR169), and periodic ribosome stalling supporting co-translational decay. Additionally, we developed a method for identifying extremely short uORFs, including 370 minimum uORFs (AUG-stop), and 2,921 tiny uORFs (2–10 amino acids), and 681 uORFs that overlap with each other. Remarkably, these short uORFs exhibit strong translational repression as do longer uORFs. We also systematically discovered 594 uORFs regulated by alternative splicing, suggesting widespread isoform-specific translational control. Finally, these prevalent uORFs are associated with numerous important pathways. In summary, our improved Arabidopsis translational landscape provides valuable resources to study gene expression regulation.more » « less
-
Abstract BackgroundThe eukaryotic genome is capable of producing multiple isoforms from a gene by alternative polyadenylation (APA) during pre-mRNA processing. APA in the 3′-untranslated region (3′-UTR) of mRNA produces transcripts with shorter or longer 3′-UTR. Often, 3′-UTR serves as a binding platform for microRNAs and RNA-binding proteins, which affect the fate of the mRNA transcript. Thus, 3′-UTR APA is known to modulate translation and provides a mean to regulate gene expression at the post-transcriptional level. Current bioinformatics pipelines have limited capability in profiling 3′-UTR APA events due to incomplete annotations and a low-resolution analyzing power: widely available bioinformatics pipelines do not reference actionable polyadenylation (cleavage) sites but simulate 3′-UTR APA only using RNA-seq read coverage, causing false positive identifications. To overcome these limitations, we developed APA-Scan, a robust program that identifies 3′-UTR APA events and visualizes the RNA-seq short-read coverage with gene annotations. MethodsAPA-Scan utilizes either predicted or experimentally validated actionable polyadenylation signals as a reference for polyadenylation sites and calculates the quantity of long and short 3′-UTR transcripts in the RNA-seq data. APA-Scan works in three major steps: (i) calculate the read coverage of the 3′-UTR regions of genes; (ii) identify the potential APA sites and evaluate the significance of the events among two biological conditions; (iii) graphical representation of user specific event with 3′-UTR annotation and read coverage on the 3′-UTR regions. APA-Scan is implemented in Python3. Source code and a comprehensive user’s manual are freely available athttps://github.com/compbiolabucf/APA-Scan. ResultAPA-Scan was applied to both simulated and real RNA-seq datasets and compared with two widely used baselines DaPars and APAtrap. In simulation APA-Scan significantly improved the accuracy of 3′-UTR APA identification compared to the other baselines. The performance of APA-Scan was also validated by 3′-end-seq data and qPCR on mouse embryonic fibroblast cells. The experiments confirm that APA-Scan can detect unannotated 3′-UTR APA events and improve genome annotation. ConclusionAPA-Scan is a comprehensive computational pipeline to detect transcriptome-wide 3′-UTR APA events. The pipeline integrates both RNA-seq and 3′-end-seq data information and can efficiently identify the significant events with a high-resolution short reads coverage plots.more » « less
-
Ranaviruses (Iridoviridae), including Frog Virus 3 (FV3), are large dsDNA viruses that cause devastating infections globally in amphibians, fish, and reptiles, and contribute to catastrophic amphibian declines. FV3’s large genome (~105 kb) contains at least 98 putative open reading frames (ORFs) as annotated in its reference genome. Previous studies have classified these coding genes into temporal classes as immediate early, delayed early, and late viral transcripts based on their sequential expression during FV3 infection. To establish a high-throughput characterization of ranaviral gene expression at the genome scale, we performed a whole transcriptomic analysis (RNA-Seq) using total RNA samples containing both viral and cellular transcripts from FV3-infected Xenopus laevis adult tissues using two FV3 strains, a wild type (FV3-WT) and an ORF64R-deleted recombinant (FV3-∆64R). In samples from the infected intestine, liver, spleen, lung, and especially kidney, an FV3-targeted transcriptomic analysis mapped reads spanning the full-genome coverage at ~10× depth on both positive and negative strands. By contrast, reads were only mapped to partial genomic regions in samples from the infected thymus, skin, and muscle. Extensive analyses validated the expression of almost all of the 98 annotated ORFs and profiled their differential expression in a tissue-, virus-, and temporal class-dependent manner. Further studies identified several putative ORFs that encode hypothetical proteins containing viral mimicking conserved domains found in host interferon (IFN) regulatory factors (IRFs) and IFN receptors. This study provides the first comprehensive genome-wide viral transcriptome profiling during infection and across multiple amphibian host tissues that will serve as an instrumental reference. Our findings imply that Ranaviruses like FV3 have acquired previously unknown molecular mimics, interfering with host IFN signaling during evolution.more » « less
-
null (Ed.)Abstract Background Translation is a fundamental process in gene expression. Ribosome profiling is a method that enables the study of transcriptome-wide translation. A fundamental, technical challenge in analyzing Ribo-Seq data is identifying the A-site location on ribosome-protected mRNA fragments. Identification of the A-site is essential as it is at this location on the ribosome where a codon is translated into an amino acid. Incorrect assignment of a read to the A-site can lead to lower signal-to-noise ratio and loss of correlations necessary to understand the molecular factors influencing translation. Therefore, an easy-to-use and accurate analysis tool is needed to accurately identify the A-site locations. Results We present RiboA, a web application that identifies the most accurate A-site location on a ribosome-protected mRNA fragment and generates the A-site read density profiles. It uses an Integer Programming method that reflects the biological fact that the A-site of actively translating ribosomes is generally located between the second codon and stop codon of a transcript, and utilizes a wide range of mRNA fragment sizes in and around the coding sequence (CDS). The web application is containerized with Docker, and it can be easily ported across platforms. Conclusions The Integer Programming method that RiboA utilizes is the most accurate in identifying the A-site on Ribo-Seq mRNA fragments compared to other methods. RiboA makes it easier for the community to use this method via a user-friendly and portable web application. In addition, RiboA supports reproducible analyses by tracking all the input datasets and parameters, and it provides enhanced visualization to facilitate scientific exploration. RiboA is available as a web service at https://a-site.vmhost.psu.edu/ . The code is publicly available at https://github.com/obrien-lab/aip_web_docker under the MIT license.more » « less
An official website of the United States government

