skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Predicting Gene Expression Divergence between Single-Copy Orthologs in Two Species
Abstract Predicting gene expression divergence is integral to understanding the emergence of new biological functions and associated traits. Whereas several sophisticated methods have been developed for this task, their applications are either limited to duplicate genes or require expression data from more than two species. Thus, here we present PredIcting eXpression dIvergence (PiXi), the first machine learning framework for predicting gene expression divergence between single-copy orthologs in two species. PiXi models gene expression evolution as an Ornstein-Uhlenbeck process, and overlays this model with multi-layer neural network (NN), random forest, and support vector machine architectures for making predictions. It outputs the predicted class “conserved” or “diverged” for each pair of orthologs, as well as their predicted expression optima in the two species. We show that PiXi has high power and accuracy in predicting gene expression divergence between single-copy orthologs, as well as high accuracy and precision in estimating their expression optima in the two species, across a wide range of evolutionary scenarios, with the globally best performance achieved by a multi-layer NN. Moreover, application of our best-performing PiXi predictor to empirical gene expression data from single-copy orthologs residing at different loci in two species of Drosophila reveals that approximately 23% underwent expression divergence after positional relocation. Further analysis shows that several of these “diverged” genes are involved in the electron transport chain of the mitochondrial membrane, suggesting that new chromatin environments may impact energy production in Drosophila. Thus, by providing a toolkit for predicting gene expression divergence between single-copy orthologs in two species, PiXi can shed light on the origins of novel phenotypes across diverse biological processes and study systems.  more » « less
Award ID(s):
2130666 2001063 1949268
PAR ID:
10423597
Author(s) / Creator(s):
; ;
Editor(s):
Yi, Soojin
Date Published:
Journal Name:
Genome Biology and Evolution
Volume:
15
Issue:
5
ISSN:
1759-6653
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Goldman, Gustavo H. (Ed.)
    ABSTRACT Gene expression divergence through evolutionary processes is thought to be important for achieving programmed development in multicellular organisms. To test this premise in filamentous fungi, we investigated transcriptional profiles of 3,942 single-copy orthologous genes (SCOGs) in five related sordariomycete species that have morphologically diverged in the formation of their flask-shaped perithecia. We compared expression of the SCOGs to inferred gene expression levels of the most recent common ancestor of the five species, ranking genes from their largest increases to smallest increases in expression during perithecial development in each of the five species. We found that a large proportion of the genes that exhibited evolved increases in gene expression were important for normal perithecial development in Fusarium graminearum . Many of these genes were previously uncharacterized, encoding hypothetical proteins without any known functional protein domains. Interestingly, the developmental stages during which aberrant knockout phenotypes appeared largely coincided with the elevated expression of the deleted genes. In addition, we identified novel genes that affected normal perithecial development in Magnaporthe oryzae and Neurospora crassa , which were functionally and transcriptionally diverged from the orthologous counterparts in F. graminearum . Furthermore, comparative analysis of developmental transcriptomes and phylostratigraphic analysis suggested that genes encoding hypothetical proteins are generally young and transcriptionally divergent between related species. This study provides tangible evidence of shifts in gene expression that led to acquisition of novel function of orthologous genes in each lineage and demonstrates that several genes with hypothetical function are crucial for shaping multicellular fruiting bodies. IMPORTANCE The fungal class Sordariomycetes includes numerous important plant and animal pathogens. It also provides model systems for studying fungal fruiting body development, as its members develop fruiting bodies with a few well-characterized tissue types on common growth media and have rich genomic resources that enable comparative and functional analyses. To understand transcriptional divergence of key developmental genes between five related sordariomycete fungi, we performed targeted knockouts of genes inferred to have evolved significant upward shifts in expression. We found that many previously uncharacterized genes play indispensable roles at different stages of fruiting body development, which have undergone transcriptional activation in specific lineages. These novel genes are predicted to be phylogenetically young and tend to be involved in lineage- or species-specific function. Transcriptional activation of genes with unknown function seems to be more frequent than ever thought, which may be crucial for rapid adaption to changing environments for successful sexual reproduction. 
    more » « less
  2. In opportunistic human pathogenic fungi, changes in gene expression play a crucial role in the progression of growth stages from early spore germination through host infection. Comparative transcriptomics between diverse fungal pathogens and non-pathogens provided insights into regulatory mechanisms behind the initiation of infectious processes. We examined the gene expression patterns of 3,845 single-copy orthologous genes (SCOGs) across five phylogenetically distinct species, including the opportunistic human pathogens Fusarium oxysporum, Aspergillus fumigatus, and A. nidulans, and nonpathogenic species Neurospora crassa and Trichoderma asperelloides, at four sequential stages of spore germination. Ancestral status of gene expression was inferred for nodes along the phylogeny. By comparing expression patterns of the SCOGs with their most recent common ancestor (MRCA), we identified genes that exhibit divergent levels of expression during spore germination when comparing fungal pathogens to non-pathogens. We focused on genes related to the MAPK pathway, nitrogen metabolism, asexual development, G-protein signaling, and conidial-wall integrity. Notably, orthologs of the transcription activator abaA, a known central regulator of conidiation, exhibited significant divergence in gene expression in F. oxysporum. This dramatic expression change in abaA was accompanied by structural modifications of phialides in F. oxysporum, and revealed how these changes impact development of offspring, formation of aerial hyphae, spore production, and pathogenicity. Our research provides insights into ecological adaptations observed during the divergence of these species, specifically highlighting how divergence in gene expression during spore germination contributes to their ability to thrive in distinct environments. 
    more » « less
  3. Abstract The phenotype of an organism is shaped by gene expression within developing tissues. This shaping relates the evolution of gene expression to phenotypic evolution, through divergence in gene expression and consequent phenotype. Rates of phenotypic evolution receive extensive attention. However, the degree to which divergence in the phenotype of gene expression is subject to heterogeneous rates of evolution across developmental stages has not previously been assessed. Here, we analyzed the evolution of the expression of single-copy orthologs within 9 species of Sordariomycetes Fungi, across 9 developmental stages within asexual spore germination and sexual reproduction. Rates of gene expression evolution exhibited high variation both within and among developmental stages. Furthermore, rates of gene expression evolution were correlated with nonsynonymous to synonymous substitution rates (dN/dS), suggesting that gene sequence evolution and expression evolution are indirectly or directly driven by common evolutionary forces. Functional pathway analyses demonstrate that rates of gene expression evolution are higher in labile pathways such as carbon metabolism, and lower in conserved pathways such as those involved in cell cycle and molecular signaling. Lastly, the expression of genes in the meiosis pathway evolved at a slower rate only across the stages where meiosis took place, suggesting that stage-specific low rates of expression evolution implicate high relevance of the genes to developmental operations occurring between those stages. 
    more » « less
  4. New genes arise through a variety of mechanisms, including the duplication of existing genes and the de novo birth of genes from noncoding DNA sequences. While there are numerous examples of duplicated genes with important func- tional roles, the functions of de novo genes remain largely unexplored. Many newly evolved genes are expressed in the male reproductive tract, suggesting that these evolutionary innovations may provide advantages to males experiencing sexual selection. Using testis-specific RNA interference, we screened 11 putative de novo genes in Drosophila mela- nogaster for effects on male fertility and identified two, goddard and saturn, that are essential for spermatogenesis and sperm function. Goddard knockdown (KD) males fail to produce mature sperm, while saturn KD males produce few sperm, and these function inefficiently once transferred to females. Consistent with a de novo origin, both genes are identifiable only in Drosophila and are predicted to encode proteins with no sequence similarity to any annotated protein. However, since high levels of divergence prevented the unambiguous identification of the noncoding sequences from which each gene arose, we consider goddard and saturn to be putative de novo genes. Within Drosophila, both genes have been lost in certain lineages, but show conserved, male-specific patterns of expression in the species in which they are found. Goddard is consistently found in single-copy and evolves under purifying selection. In contrast, saturn has diversified through gene duplication and positive selection. These data suggest that de novo genes can acquire essential roles in male reproduction. 
    more » « less
  5. Wittkopp, Patricia (Ed.)
    Abstract In Drosophila melanogaster and D. simulans head tissue, 60% of orthologous genes show evidence of sex-biased expression in at least one species. Of these, ∼39% (2,192) are conserved in direction. We hypothesize enrichment of open chromatin in the sex where we see expression bias and closed chromatin in the opposite sex. Male-biased orthologs are significantly enriched for H3K4me3 marks in males of both species (∼89% of male-biased orthologs vs. ∼76% of unbiased orthologs). Similarly, female-biased orthologs are significantly enriched for H3K4me3 marks in females of both species (∼90% of female-biased orthologs vs. ∼73% of unbiased orthologs). The sex-bias ratio in female-biased orthologs was similar in magnitude between the two species, regardless of the closed chromatin (H3K27me2me3) marks in males. However, in male-biased orthologs, the presence of H3K27me2me3 in both species significantly reduced the correlation between D. melanogaster sex-bias ratio and the D. simulans sex-bias ratio. Male-biased orthologs are enriched for evidence of positive selection in the D. melanogaster group. There are more male-biased genes than female-biased genes in both species. For orthologs with gains/losses of sex-bias between the two species, there is an excess of male-bias compared to female-bias, but there is no consistent pattern in the relationship between H3K4me3 or H3K27me2me3 chromatin marks and expression. These data suggest chromatin state is a component of the maintenance of sex-biased expression and divergence of sex-bias between species is reflected in the complexity of the chromatin status. 
    more » « less