skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Comparative Analysis of rRNA Removal Methods for RNA-Seq Differential Expression in Halophilic Archaea
Despite intense recent research interest in archaea, the scientific community has experienced a bottleneck in the study of genome-scale gene expression experiments by RNA-seq due to the lack of commercial and specifically designed rRNA depletion kits. The high rRNA:mRNA ratio (80–90%: ~10%) in prokaryotes hampers global transcriptomic analysis. Insufficient ribodepletion results in low sequence coverage of mRNA, and therefore, requires a substantially higher number of replicate samples and/or sequencing reads to achieve statistically reliable conclusions regarding the significance of differential gene expression between case and control samples. Here, we show that after the discontinuation of the previous version of RiboZero (Illumina, San Diego, CA, USA) that was useful in partially or completely depleting rRNA from archaea, archaeal transcriptomics studies have experienced a slowdown. To overcome this limitation, here, we analyze the efficiency for four different hybridization-based kits from three different commercial suppliers, each with two sets of sequence-specific probes to remove rRNA from four different species of halophilic archaea. We conclude that the key for transcriptomic success with the currently available tools is the probe-specificity for the rRNA sequence hybridization. With this paper, we provide insights into the archaeal community for selecting certain reagents and strategies over others depending on the archaeal species of interest. These methods yield improved RNA-seq sensitivity and enhanced detection of low abundance transcripts.  more » « less
Award ID(s):
1651117 1936024
PAR ID:
10329346
Author(s) / Creator(s):
; ; ;
Date Published:
Journal Name:
Biomolecules
Volume:
12
Issue:
5
ISSN:
2218-273X
Page Range / eLocation ID:
682
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract BackgroundRNA sequencing is a powerful approach to quantify the genome-wide distribution of mRNA molecules in a population to gain deeper understanding of cellular functions and phenotypes. However, unlike eukaryotic cells, mRNA sequencing of bacterial samples is more challenging due to the absence of a poly-A tail that typically enables efficient capture and enrichment of mRNA from the abundant rRNA molecules in a cell. Moreover, bacterial cells frequently contain 100-fold lower quantities of RNA compared to mammalian cells, which further complicates mRNA sequencing from non-cultivable and non-model bacterial species. To overcome these limitations, we report EMBR-seq (Enrichment of mRNA by Blocked rRNA), a method that efficiently depletes 5S, 16S and 23S rRNA using blocking primers to prevent their amplification. ResultsEMBR-seq results in 90% of the sequenced RNA molecules from anE. coliculture deriving from mRNA. We demonstrate that this increased efficiency provides a deeper view of the transcriptome without introducing technical amplification-induced biases. Moreover, compared to recent methods that employ a large array of oligonucleotides to deplete rRNA, EMBR-seq uses a single or a few oligonucleotides per rRNA, thereby making this new technology significantly more cost-effective, especially when applied to varied bacterial species. Finally, compared to existing commercial kits for bacterial rRNA depletion, we show that EMBR-seq can be used to successfully quantify the transcriptome from more than 500-fold lower starting total RNA. ConclusionsEMBR-seq provides an efficient and cost-effective approach to quantify global gene expression profiles from low input bacterial samples. 
    more » « less
  2. Abstract The gene expression landscape across different tissues and developmental stages reflects their biological functions and evolutionary patterns. Integrative and comprehensive analyses of all transcriptomic data in an organism are instrumental to obtaining a comprehensive picture of gene expression landscape. Such studies are still very limited in sorghum, which limits the discovery of the genetic basis underlying complex agricultural traits in sorghum. We characterized the genome‐wide expression landscape for sorghum using 873 RNA‐sequencing (RNA‐seq) datasets representing 19 tissues. Our integrative analysis of these RNA‐seq data provides the most comprehensive transcriptomic atlas for sorghum, which will be valuable for the sorghum research community for functional characterizations of sorghum genes. Based on the transcriptome atlas, we identified 595 housekeeping genes (HKGs) and 2080 tissue‐specific expression genes (TEGs) for the 19 tissues. We identified different gene features between HKGs and TEGs, and we found that HKGs have experienced stronger selective constraints than TEGs. Furthermore, we built a transcriptome‐wide co‐expression network (TW‐CEN) comprising 35 modules with each module enriched in specific Gene Ontology terms. High‐connectivity genes in TW‐CEN tend to express at high levels while undergoing intensive selective pressure. We also built global and seed‐preferential co‐expression networks of starch synthesis pathways, which indicated that photosynthesis and microtubule‐based movement play important roles in starch synthesis. The global transcriptome atlas of sorghum generated by this study provides an important functional genomics resource for trait discovery and insight into starch synthesis regulation in sorghum. 
    more » « less
  3. null (Ed.)
    Corals from the northern Red Sea and Gulf of Aqaba exhibit extreme thermal tolerance. To examine the underlying gene expression dynamics, we exposed Stylophora pistillata from the Gulf of Aqaba to short-term (hours) and long-term (weeks) heat stress with peak seawater temperatures ranging from their maximum monthly mean of 27 °C (baseline) to 29.5 °C, 32 °C, and 34.5 °C. Corals were sampled at the end of the heat stress as well as after a recovery period at baseline temperature. Changes in coral host and symbiotic algal gene expression were determined via RNA-sequencing (RNA-Seq). Shifts in coral microbiome composition were detected by complementary DNA (cDNA)-based 16S ribosomal RNA (rRNA) gene sequencing. In all experiments up to 32 °C, RNA-Seq revealed fast and pervasive changes in gene expression, primarily in the coral host, followed by a return to baseline gene expression for the majority of coral (>94%) and algal (>71%) genes during recovery. At 34.5 °C, large differences in gene expression were observed with minimal recovery, high coral mortality, and a microbiome dominated by opportunistic bacteria (including Vibrio species), indicating that a lethal temperature threshold had been crossed. Our results show that the S. pistillata holobiont can mount a rapid and pervasive gene expression response contingent on the amplitude and duration of the thermal stress. We propose that the transcriptomic resilience and transcriptomic acclimation observed are key to the extraordinary thermal tolerance of this holobiont and, by inference, of other northern Red Sea coral holobionts, up to seawater temperatures of at least 32 °C, that is, 5 °C above their current maximum monthly mean. 
    more » « less
  4. Abstract Background Quantification of gene expression from RNA-seq data is a prerequisite for transcriptome analysis such as differential gene expression analysis and gene co-expression network construction. Individual RNA-seq experiments are larger and combining multiple experiments from sequence repositories can result in datasets with thousands of samples. Processing hundreds to thousands of RNA-seq data can result in challenges related to data management, access to sufficient computational resources, navigation of high-performance computing (HPC) systems, installation of required software dependencies, and reproducibility. Processing of larger and deeper RNA-seq experiments will become more common as sequencing technology matures. Results GEMmaker, is a nf-core compliant, Nextflow workflow, that quantifies gene expression from small to massive RNA-seq datasets. GEMmaker ensures results are highly reproducible through the use of versioned containerized software that can be executed on a single workstation, institutional compute cluster, Kubernetes platform or the cloud. GEMmaker supports popular alignment and quantification tools providing results in raw and normalized formats. GEMmaker is unique in that it can scale to process thousands of local or remote stored samples without exceeding available data storage. Conclusions Workflows that quantify gene expression are not new, and many already address issues of portability, reusability, and scale in terms of access to CPUs. GEMmaker provides these benefits and adds the ability to scale despite low data storage infrastructure. This allows users to process hundreds to thousands of RNA-seq samples even when data storage resources are limited. GEMmaker is freely available and fully documented with step-by-step setup and execution instructions. 
    more » « less
  5. Chen, Tzong-Yueh (Ed.)
    Salinity gradients act as strong environmental barriers that limit the distribution of aquatic organisms. Changes in gene expression associated with transitions between freshwater and saltwater environments can provide insights into organismal responses to variation in salinity. We used RNA-sequencing (RNA-seq) to investigate genome-wide variation in gene expression between a hypersaline population and a freshwater population of the livebearing fish speciesLimia perugiae(Poeciliidae). Our analyses of gill gene expression revealed potential molecular mechanisms underlying salinity tolerance in this species, including the enrichment of genes involved in ion transport, maintenance of chemical homeostasis, and cell signaling in the hypersaline population. We also found differences in gene expression patterns associated with cell-cycle and protein-folding processes between the hypersaline and freshwaterL.perugiae. Bidirectional freshwater-saltwater transitions have occurred repeatedly during the diversification of fishes, allowing for broad-scale examination of repeatable patterns in evolution. Therefore, we compared transcriptomic variation inL.perugiaewith other teleosts that have made freshwater-saltwater transitions to test for convergence in gene expression. Among the four distantly related population pairs from high- and low-salinity environments that we included in our analysis, we found only ten shared differentially expressed genes, indicating little evidence for convergence. However, we found that differentially expressed genes shared among three or more lineages were functionally enriched for ion transport and immune functioning. Overall, our results—in conjunction with other recent studies—suggest that different genes are involved in salinity transitions across disparate lineages of teleost fishes. 
    more » « less