Abstract Many arthropods undergo a seasonal dormancy termed “diapause” to optimize timing of reproduction in highly seasonal environments. In the North Atlantic, the copepodCalanus finmarchicuscompletes one to three generations annually with some individuals maturing into adults, while others interrupt their development to enter diapause. It is unknown which, why and when individuals enter the diapause program. Transcriptomic data from copepods on known programs were analyzed using dimensionality reduction of gene expression and functional analyses to identify program-specific genes and biological processes. These analyses elucidated physiological differences and established protocols that distinguish between programs. Differences in gene expression were associated with maturation of individuals on the reproductive program, while those on the diapause program showed little change over time. Only two of six filters effectively separated copepods by developmental program. The first one included all genes annotated to RNA metabolism and this was confirmed using differential gene expression analysis. The second filter identified 54 differentially expressed genes that were consistently up-regulated in individuals on the diapause program in comparison with those on the reproductive program. Annotated to oogenesis, RNA metabolism and fatty acid biosynthesis, these genes are both indicators for diapause preparation and good candidates for functional studies.
more »
« less
This content will become publicly available on May 30, 2026
Chromosome-length genome assembly of Uta stansburiana and gene expression data reveal fast pace-of-life comes with environmental stability
Uta stansburiana are an emerging model system for studying sexual selection, polymorphism, and the evolution of pace-of-life syndromes (POLS) whose distribution covers variable environments and a wide latitudinal gradient. POLS are suites of traits causing variation of life history along a slow maturing-fast maturing continuum. We present a high-quality chromosome-level reference genome for U. stansburiana and pair it with RNA-seq gene expression data to demonstrate, for the first time, the molecular basis for pace-of-life differences between locations with higher and lower climate seasonality and sexual size dimorphism (SSD). Our assembly is 2.1 Gbp, has scaffold N50 of 320 Mbp, includes 104 scaffolds, and has an L50 of 3. The assembly comprises six macrochromosomes and 11 microchromosomes. We annotated 20,350 genes for the assembly and found a repeat element composition of 49.23%, similar to work in other phrynosomatid lizards. RNA-seq gene expression data demonstrate expression differences in genes associated with pace-of-life differences including those related to stress, sexual reproduction, and cell proliferation/carcinogenesis between different environments. Our results provide the first differential gene expression evidence of environmentally-mediated pace-of-life processes related to different degrees of SSD in U. stansburiana and demonstrate the utility of RNA-seq gene expression data in detecting POLS.
more »
« less
- Award ID(s):
- 2305608
- PAR ID:
- 10615371
- Publisher / Repository:
- bioRxiv
- Date Published:
- Format(s):
- Medium: X
- Institution:
- bioRxiv
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Variation in gene expression is a feature of all living systems and has recently been characterized extensively among wild strains of the model organism Caenorhabditis elegans. To enable researchers to query gene expression and gene expression variation at any gene of interest, we have created a user-friendly web application that shares RNA-seq transcription data for 208 wild C. elegans strains generated by the Caenorhabditis Natural Diversity Resource (CaeNDR). Here, we describe the features of the web application and the details of the data and data processing underlying it. We hope that this website, wildworm.biosci.gatech.edu/cendrexp/, will help C. elegans researchers better understand their favorite genes and strains.more » « less
-
Chen, Tzong-Yueh (Ed.)Salinity gradients act as strong environmental barriers that limit the distribution of aquatic organisms. Changes in gene expression associated with transitions between freshwater and saltwater environments can provide insights into organismal responses to variation in salinity. We used RNA-sequencing (RNA-seq) to investigate genome-wide variation in gene expression between a hypersaline population and a freshwater population of the livebearing fish speciesLimia perugiae(Poeciliidae). Our analyses of gill gene expression revealed potential molecular mechanisms underlying salinity tolerance in this species, including the enrichment of genes involved in ion transport, maintenance of chemical homeostasis, and cell signaling in the hypersaline population. We also found differences in gene expression patterns associated with cell-cycle and protein-folding processes between the hypersaline and freshwaterL.perugiae. Bidirectional freshwater-saltwater transitions have occurred repeatedly during the diversification of fishes, allowing for broad-scale examination of repeatable patterns in evolution. Therefore, we compared transcriptomic variation inL.perugiaewith other teleosts that have made freshwater-saltwater transitions to test for convergence in gene expression. Among the four distantly related population pairs from high- and low-salinity environments that we included in our analysis, we found only ten shared differentially expressed genes, indicating little evidence for convergence. However, we found that differentially expressed genes shared among three or more lineages were functionally enriched for ion transport and immune functioning. Overall, our results—in conjunction with other recent studies—suggest that different genes are involved in salinity transitions across disparate lineages of teleost fishes.more » « less
-
One important characteristic of single-cell RNA sequencing (scRNA-seq) data is its high sparsity, where the gene-cell count data matrix contains high proportion of zeros. The sparsity has motivated widespread discussions on dropouts and missing data, as well as imputation algorithms of scRNA-seq analysis. Here, we aim to investigate whether there exist genes that are more prone to be under-detected in scRNA-seq, and if yes, what commonalities those genes may share. From public data sources, we gathered paired bulk RNA-seq and scRNA-seq data from 53 human samples, which were generated in diverse biological contexts. We derived pseudo-bulk gene expression by averaging the scRNA-seq data across cells. Comparisons of the paired bulk and pseudo-bulk gene expression profiles revealed that there indeed exists a collection of genes that are frequently under-detected in scRNA-seq compared to bulk RNA-seq. This result was robust to randomization when unpaired bulk and pseudo-bulk gene expression profiles were compared. We performed motif search to the last 350 bp of the identified genes, and observed an enrichment of poly(T) motif. The poly(T) motif toward the tails of those genes may be able to form hairpin structures with the poly(A) tails of their mRNA transcripts, making it difficult for their mRNA transcripts to be captured during scRNA-seq library preparation, which is a mechanistic conjecture of why certain genes may be more prone to be under-detected in scRNA-seq.more » « less
-
The Gulf pipefish Syngnathus scovelli has emerged as an important species for studying sexual selection, development, and physiology. Comparative evolutionary genomics research involving fishes from Syngnathidae depends on having a high-quality genome assembly and annotation. However, the first S. scovelli genome assembled using short-read sequences and a smaller RNA-sequence dataset has limited contiguity and a relatively poor annotation. Here, using PacBio long-read high-fidelity sequences and a proximity ligation library, we generate an improved assembly to obtain 22 chromosome-level scaffolds. Compared to the first assembly, the gaps in the improved assembly are smaller, the N75 is larger, and our genome is ~95% BUSCO complete. Using a large body of RNA-Seq reads from different tissue types and NCBI's Eukaryotic Annotation Pipeline, we discovered 28,162 genes, of which 8,061 are non-coding genes. Our new genome assembly and annotation are tagged as a RefSeq genome by NCBI and provide enhanced resources for research work involving S. scovelli.more » « less
An official website of the United States government
