skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Paralog transcriptional differentiation in the D. melanogaster-specific gene family Sdic across populations and spermatogenesis stages
Abstract How recently originated gene copies become stable genomic components remains uncertain as high sequence similarity of young duplicates precludes their functional characterization. The tandem multigene familySdicis specific toDrosophila melanogasterand has been annotated across multiple reference-quality genome assemblies. Here we show the existence of a positive correlation betweenSdiccopy number and totalexpression, plus vast intrastrain differences in mRNA abundance among paralogs, using RNA-sequencing from testis of four strains with variable paralog composition. Single cell and nucleus RNA-sequencing data expose paralog expression differentiation in meiotic cell types within testis from third instar larva and adults. Additional RNA-sequencing across synthetic strains only differing in theirYchromosomes reveal a tissue-dependenttrans-regulatory effect onSdic: upregulation in testis and downregulation in male accessory gland. By leveraging paralog-specific expression information from tissue- and cell-specific data, our results elucidate the intraspecific functional diversification of a recently expanded tandem gene family.  more » « less
Award ID(s):
2129845
PAR ID:
10469990
Author(s) / Creator(s):
; ; ; ; ;
Publisher / Repository:
Nature Publishing Group
Date Published:
Journal Name:
Communications Biology
Volume:
6
Issue:
1
ISSN:
2399-3642
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Early lineage diversification is central to understand what mutational events drive species divergence. Particularly, gene misregulation in interspecific hybrids can inform about what genes and pathways underlie hybrid dysfunction. InDrosophilahybrids, how regulatory evolution impacts different reproductive tissues remains understudied. Here, we generate a new genome assembly and annotation inDrosophila willistoniand analyse the patterns of transcriptome divergence between two allopatrically evolvedD. willistonisubspecies, their male sterile and female fertile hybrid progeny across testis, male accessory gland, and ovary. Patterns of transcriptome divergence and modes of regulatory evolution were tissue‐specific. Despite no indication for cell‐type differences in hybrid testis, this tissue exhibited the largest magnitude of expression differentiation between subspecies and between parentals and hybrids. No evidence for anomalous dosage compensation in hybrid male tissues was detected nor was a differential role for the neo‐ and the ancestral arms of theD. willistoni Xchromosome. Compared to the autosomes, theXchromosome appeared enriched for transgressively expressed genes in testis despite being the least differentiated in expression between subspecies. Evidence for fine genome clustering of transgressively expressed genes suggests a role of chromatin structure on hybrid gene misregulation. Lastly, transgressively expressed genes in the testis of the sterile male progeny were enriched for GO terms not typically associated with sperm function, instead hinting at anomalous development of the reproductive tissue. Our thorough tissue‐level portrait of transcriptome differentiation between recently divergedD. willistonisubspecies and their hybrids provides a more nuanced view of early regulatory changes during speciation. 
    more » « less
  2. Abstract To understand phenotypic variations and key factors which affect disease susceptibility of complex traits, it is important to decipher cell‐type tissue compositions. To study cellular compositions of bulk tissue samples, one can evaluate cellular abundances and cell‐type‐specific gene expression patterns from the tissue transcriptome profiles. We develop both fixed and mixed models to reconstruct cellular expression fractions for bulk‐profiled samples by using reference single‐cell (sc) RNA‐sequencing (RNA‐seq) reference data. In benchmark evaluations of estimating cellular expression fractions, the mixed‐effect models provide similar results as an elegant machine learning algorithm named cell‐type identification by estimating relative subsets of RNA transcripts (CIBERSORTx), which is a well‐known and reliable procedure to reconstruct cell‐type abundances and cell‐type‐specific gene expression profiles. In real data analysis, the mixed‐effect models outperform or perform similarly as CIBERSORTx. The mixed models perform better than the fixed models in both benchmark evaluations and data analysis. In simulation studies, we show that if the heterogeneity exists in scRNA‐seq data, it is better to use mixed models with heterogeneous mean and variance–covariance. As a byproduct, the mixed models provide fractions of covariance between subject‐specific gene expression and cell types to measure their correlations. The proposed mixed models provide a complementary tool to dissect bulk tissues using scRNA‐seq data. 
    more » « less
  3. Abstract BackgroundLow back pain is a leading cause of disability worldwide and is frequently attributed to intervertebral disc (IVD) degeneration. Though the contributions of the adjacent cartilage endplates (CEP) to IVD degeneration are well documented, the phenotype and functions of the resident CEP cells are critically understudied. To better characterize CEP cell phenotype and possible mechanisms of CEP degeneration, bulk and single-cell RNA sequencing of non-degenerated and degenerated CEP cells were performed. MethodsHuman lumbar CEP cells from degenerated (Thompson grade ≥ 4) and non-degenerated (Thompson grade ≤ 2) discs were expanded for bulk (N=4 non-degenerated,N=4 degenerated) and single-cell (N=1 non-degenerated,N=1 degenerated) RNA sequencing. Genes identified from bulk RNA sequencing were categorized by function and their expression in non-degenerated and degenerated CEP cells were compared. A PubMed literature review was also performed to determine which genes were previously identified and studied in the CEP, IVD, and other cartilaginous tissues. For single-cell RNA sequencing, different cell clusters were resolved using unsupervised clustering and functional annotation. Differential gene expression analysis and Gene Ontology, respectively, were used to compare gene expression and functional enrichment between cell clusters, as well as between non-degenerated and degenerated CEP samples. ResultsBulk RNA sequencing revealed 38 genes were significantly upregulated and 15 genes were significantly downregulated in degenerated CEP cells relative to non-degenerated cells (|fold change| ≥ 1.5). Of these, only 2 genes were previously studied in CEP cells, and 31 were previously studied in the IVD and other cartilaginous tissues. Single-cell RNA sequencing revealed 11 unique cell clusters, including multiple chondrocyte and progenitor subpopulations with distinct gene expression and functional profiles. Analysis of genes in the bulk RNA sequencing dataset showed that progenitor cell clusters from both samples were enriched in “non-degenerated” genes but not “degenerated” genes. For both bulk- and single-cell analyses, gene expression and pathway enrichment analyses highlighted several pathways that may regulate CEP degeneration, including transcriptional regulation, translational regulation, intracellular transport, and mitochondrial dysfunction. ConclusionsThis thorough analysis using RNA sequencing methods highlighted numerous differences between non-degenerated and degenerated CEP cells, the phenotypic heterogeneity of CEP cells, and several pathways of interest that may be relevant in CEP degeneration. 
    more » « less
  4. In briefModes of reproduction across limbed vertebrates are diverse, but the molecular mechanisms required for the development and maintenance of reproductive tract tissue architecture are poorly understood. This paper describes gene expression changes across the regions of the reproductive tract of the adult female brown anole,Anolis sagrei. AbstractThe morphological diversity and functional role of the organs of the female reproductive system across tetrapods (limbed vertebrates) are relatively poorly understood. Although some features are morphologically similar, species-specific modification makes comparisons between species and inference about evolutionary origins challenging. In combination with the study of morphological changes, studying differences in gene expression in the adult reproductive system in diverse species can clarify the function of each organ. Here, we use the brown anole,Anolis sagrei, to study gene expression differences within the reproductive tract of the adult female. We generated gene expression profiles of four biological replicates of the three regions of the female reproductive tract, the infundibulum, glandular uterus, and nonglandular uterus, by RNA-sequencing. We aligned reads to the recently publishedA. sagreigenome and identified significantly differentially expressed genes between the regions using DESeq2. Each organ expressed approximately 14,600 genes, and comparison of gene expression profiles between organs revealed between 367 and 883 differentially expressed genes. We identify shared and region-specific transcriptional signatures for the three regions and compare gene expression in the brown anole reproductive tract to known gene expression patterns in other tetrapods. We find that genes in theHoxcluster have an anterior–posterior, collinear expression pattern as has been described in mammals. We also define a secretome for the glandular uterus. These data provide fundamental information for functional studies of the reproductive tract organs in the brown anole and an important phylogenetic anchor for comparative studies of the evolution of the female reproductive tract. 
    more » « less
  5. Abstract Single-cell RNA sequencing is a powerful technique that continues to expand across various biological applications. However, incomplete 3′-UTR annotations can impede single-cell analysis resulting in genes that are partially or completely uncounted. Performing single-cell RNA sequencing with incomplete 3′-UTR annotations can hinder the identification of cell identities and gene expression patterns and lead to erroneous biological inferences. We demonstrate that performing single-cell isoform sequencing in tandem with single-cell RNA sequencing can rapidly improve 3′-UTR annotations. Using threespine stickleback fish (Gasterosteus aculeatus), we show that gene models resulting from a minimal embryonic single-cell isoform sequencing dataset retained 26.1% greater single-cell RNA sequencing reads than gene models from Ensembl alone. Furthermore, pooling our single-cell sequencing isoforms with a previously published adult bulk Iso-Seq dataset from stickleback, and merging the annotation with the Ensembl gene models, resulted in a marginal improvement (+0.8%) over the single-cell isoform sequencing only dataset. In addition, isoforms identified by single-cell isoform sequencing included thousands of new splicing variants. The improved gene models obtained using single-cell isoform sequencing led to successful identification of cell types and increased the reads identified of many genes in our single-cell RNA sequencing stickleback dataset. Our work illuminates single-cell isoform sequencing as a cost-effective and efficient mechanism to rapidly annotate genomes for single-cell RNA sequencing. 
    more » « less