skip to main content


Title: Comparative Analysis of rRNA Removal Methods for RNA-Seq Differential Expression in Halophilic Archaea
Despite intense recent research interest in archaea, the scientific community has experienced a bottleneck in the study of genome-scale gene expression experiments by RNA-seq due to the lack of commercial and specifically designed rRNA depletion kits. The high rRNA:mRNA ratio (80–90%: ~10%) in prokaryotes hampers global transcriptomic analysis. Insufficient ribodepletion results in low sequence coverage of mRNA, and therefore, requires a substantially higher number of replicate samples and/or sequencing reads to achieve statistically reliable conclusions regarding the significance of differential gene expression between case and control samples. Here, we show that after the discontinuation of the previous version of RiboZero (Illumina, San Diego, CA, USA) that was useful in partially or completely depleting rRNA from archaea, archaeal transcriptomics studies have experienced a slowdown. To overcome this limitation, here, we analyze the efficiency for four different hybridization-based kits from three different commercial suppliers, each with two sets of sequence-specific probes to remove rRNA from four different species of halophilic archaea. We conclude that the key for transcriptomic success with the currently available tools is the probe-specificity for the rRNA sequence hybridization. With this paper, we provide insights into the archaeal community for selecting certain reagents and strategies over others depending on the archaeal species of interest. These methods yield improved RNA-seq sensitivity and enhanced detection of low abundance transcripts.  more » « less
Award ID(s):
1651117 1936024
PAR ID:
10329346
Author(s) / Creator(s):
; ; ;
Date Published:
Journal Name:
Biomolecules
Volume:
12
Issue:
5
ISSN:
2218-273X
Page Range / eLocation ID:
682
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Background

    RNA sequencing is a powerful approach to quantify the genome-wide distribution of mRNA molecules in a population to gain deeper understanding of cellular functions and phenotypes. However, unlike eukaryotic cells, mRNA sequencing of bacterial samples is more challenging due to the absence of a poly-A tail that typically enables efficient capture and enrichment of mRNA from the abundant rRNA molecules in a cell. Moreover, bacterial cells frequently contain 100-fold lower quantities of RNA compared to mammalian cells, which further complicates mRNA sequencing from non-cultivable and non-model bacterial species. To overcome these limitations, we report EMBR-seq (Enrichment of mRNA by Blocked rRNA), a method that efficiently depletes 5S, 16S and 23S rRNA using blocking primers to prevent their amplification.

    Results

    EMBR-seq results in 90% of the sequenced RNA molecules from anE. coliculture deriving from mRNA. We demonstrate that this increased efficiency provides a deeper view of the transcriptome without introducing technical amplification-induced biases. Moreover, compared to recent methods that employ a large array of oligonucleotides to deplete rRNA, EMBR-seq uses a single or a few oligonucleotides per rRNA, thereby making this new technology significantly more cost-effective, especially when applied to varied bacterial species. Finally, compared to existing commercial kits for bacterial rRNA depletion, we show that EMBR-seq can be used to successfully quantify the transcriptome from more than 500-fold lower starting total RNA.

    Conclusions

    EMBR-seq provides an efficient and cost-effective approach to quantify global gene expression profiles from low input bacterial samples.

     
    more » « less
  2. Single-cell quantification of RNAs is important for understanding cellular heterogeneity and gene regulation, yet current approaches suffer from low sensitivity for individual transcripts, limiting their utility for many applications. Here we present Hybridization of Probes to RNA for sequencing (HyPR-seq), a method to sensitively quantify the expression of hundreds of chosen genes in single cells. HyPR-seq involves hybridizing DNA probes to RNA, distributing cells into nanoliter droplets, amplifying the probes with PCR, and sequencing the amplicons to quantify the expression of chosen genes. HyPR-seq achieves high sensitivity for individual transcripts, detects nonpolyadenylated and low-abundance transcripts, and can profile more than 100,000 single cells. We demonstrate how HyPR-seq can profile the effects of CRISPR perturbations in pooled screens, detect time-resolved changes in gene expression via measurements of gene introns, and detect rare transcripts and quantify cell-type frequencies in tissue using low-abundance marker genes. By directing sequencing power to genes of interest and sensitively quantifying individual transcripts, HyPR-seq reduces costs by up to 100-fold compared to whole-transcriptome single-cell RNA-sequencing, making HyPR-seq a powerful method for targeted RNA profiling in single cells.

     
    more » « less
  3. null (Ed.)
    Corals from the northern Red Sea and Gulf of Aqaba exhibit extreme thermal tolerance. To examine the underlying gene expression dynamics, we exposed Stylophora pistillata from the Gulf of Aqaba to short-term (hours) and long-term (weeks) heat stress with peak seawater temperatures ranging from their maximum monthly mean of 27 °C (baseline) to 29.5 °C, 32 °C, and 34.5 °C. Corals were sampled at the end of the heat stress as well as after a recovery period at baseline temperature. Changes in coral host and symbiotic algal gene expression were determined via RNA-sequencing (RNA-Seq). Shifts in coral microbiome composition were detected by complementary DNA (cDNA)-based 16S ribosomal RNA (rRNA) gene sequencing. In all experiments up to 32 °C, RNA-Seq revealed fast and pervasive changes in gene expression, primarily in the coral host, followed by a return to baseline gene expression for the majority of coral (>94%) and algal (>71%) genes during recovery. At 34.5 °C, large differences in gene expression were observed with minimal recovery, high coral mortality, and a microbiome dominated by opportunistic bacteria (including Vibrio species), indicating that a lethal temperature threshold had been crossed. Our results show that the S. pistillata holobiont can mount a rapid and pervasive gene expression response contingent on the amplitude and duration of the thermal stress. We propose that the transcriptomic resilience and transcriptomic acclimation observed are key to the extraordinary thermal tolerance of this holobiont and, by inference, of other northern Red Sea coral holobionts, up to seawater temperatures of at least 32 °C, that is, 5 °C above their current maximum monthly mean. 
    more » « less
  4. Abstract Background Quantification of gene expression from RNA-seq data is a prerequisite for transcriptome analysis such as differential gene expression analysis and gene co-expression network construction. Individual RNA-seq experiments are larger and combining multiple experiments from sequence repositories can result in datasets with thousands of samples. Processing hundreds to thousands of RNA-seq data can result in challenges related to data management, access to sufficient computational resources, navigation of high-performance computing (HPC) systems, installation of required software dependencies, and reproducibility. Processing of larger and deeper RNA-seq experiments will become more common as sequencing technology matures. Results GEMmaker, is a nf-core compliant, Nextflow workflow, that quantifies gene expression from small to massive RNA-seq datasets. GEMmaker ensures results are highly reproducible through the use of versioned containerized software that can be executed on a single workstation, institutional compute cluster, Kubernetes platform or the cloud. GEMmaker supports popular alignment and quantification tools providing results in raw and normalized formats. GEMmaker is unique in that it can scale to process thousands of local or remote stored samples without exceeding available data storage. Conclusions Workflows that quantify gene expression are not new, and many already address issues of portability, reusability, and scale in terms of access to CPUs. GEMmaker provides these benefits and adds the ability to scale despite low data storage infrastructure. This allows users to process hundreds to thousands of RNA-seq samples even when data storage resources are limited. GEMmaker is freely available and fully documented with step-by-step setup and execution instructions. 
    more » « less
  5. Background

    Since the invention of next‐generation RNA sequencing (RNA‐seq) technologies, they have become a powerful tool to study the presence and quantity of RNA molecules in biological samples and have revolutionized transcriptomic studies. The analysis of RNA‐seq data at four different levels (samples, genes, transcripts, and exons) involve multiple statistical and computational questions, some of which remain challenging up to date.

    Results

    We review RNA‐seq analysis tools at the sample, gene, transcript, and exon levels from a statistical perspective. We also highlight the biological and statistical questions of most practical considerations.

    Conclusions

    The development of statistical and computational methods for analyzing RNA‐seq data has made significant advances in the past decade. However, methods developed to answer the same biological question often rely on diverse statistical models and exhibit different performance under different scenarios. This review discusses and compares multiple commonly used statistical models regarding their assumptions, in the hope of helping users select appropriate methods as needed, as well as assisting developers for future method development.

     
    more » « less