skip to main content


Title: Transcriptional Divergence Underpinning Sexual Development in the Fungal Class Sordariomycetes
ABSTRACT Gene expression divergence through evolutionary processes is thought to be important for achieving programmed development in multicellular organisms. To test this premise in filamentous fungi, we investigated transcriptional profiles of 3,942 single-copy orthologous genes (SCOGs) in five related sordariomycete species that have morphologically diverged in the formation of their flask-shaped perithecia. We compared expression of the SCOGs to inferred gene expression levels of the most recent common ancestor of the five species, ranking genes from their largest increases to smallest increases in expression during perithecial development in each of the five species. We found that a large proportion of the genes that exhibited evolved increases in gene expression were important for normal perithecial development in Fusarium graminearum . Many of these genes were previously uncharacterized, encoding hypothetical proteins without any known functional protein domains. Interestingly, the developmental stages during which aberrant knockout phenotypes appeared largely coincided with the elevated expression of the deleted genes. In addition, we identified novel genes that affected normal perithecial development in Magnaporthe oryzae and Neurospora crassa , which were functionally and transcriptionally diverged from the orthologous counterparts in F. graminearum . Furthermore, comparative analysis of developmental transcriptomes and phylostratigraphic analysis suggested that genes encoding hypothetical proteins are generally young and transcriptionally divergent between related species. This study provides tangible evidence of shifts in gene expression that led to acquisition of novel function of orthologous genes in each lineage and demonstrates that several genes with hypothetical function are crucial for shaping multicellular fruiting bodies. IMPORTANCE The fungal class Sordariomycetes includes numerous important plant and animal pathogens. It also provides model systems for studying fungal fruiting body development, as its members develop fruiting bodies with a few well-characterized tissue types on common growth media and have rich genomic resources that enable comparative and functional analyses. To understand transcriptional divergence of key developmental genes between five related sordariomycete fungi, we performed targeted knockouts of genes inferred to have evolved significant upward shifts in expression. We found that many previously uncharacterized genes play indispensable roles at different stages of fruiting body development, which have undergone transcriptional activation in specific lineages. These novel genes are predicted to be phylogenetically young and tend to be involved in lineage- or species-specific function. Transcriptional activation of genes with unknown function seems to be more frequent than ever thought, which may be crucial for rapid adaption to changing environments for successful sexual reproduction.  more » « less
Award ID(s):
1916137
NSF-PAR ID:
10350995
Author(s) / Creator(s):
; ; ; ; ; ;
Editor(s):
Goldman, Gustavo H.
Date Published:
Journal Name:
mBio
Volume:
13
Issue:
3
ISSN:
2150-7511
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. ABSTRACT Long noncoding RNA (lncRNA) plays important roles in sexual development in eukaryotes. In filamentous fungi, however, little is known about the expression and roles of lncRNAs during fruiting body formation. By profiling developmental transcriptomes during the life cycle of the plant-pathogenic fungus Fusarium graminearum , we identified 547 lncRNAs whose expression was highly dynamic, with about 40% peaking at the meiotic stage. Many lncRNAs were found to be antisense to mRNAs, forming 300 sense-antisense pairs. Although small RNAs were produced from these overlapping loci, antisense lncRNAs appeared not to be involved in gene silencing pathways. Genome-wide analysis of small RNA clusters identified many silenced loci at the meiotic stage. However, we found transcriptionally active small RNA clusters, many of which were associated with lncRNAs. Also, we observed that many antisense lncRNAs and their respective sense transcripts were induced in parallel as the fruiting bodies matured. The nonsense-mediated decay (NMD) pathway is known to determine the fates of lncRNAs as well as mRNAs. Thus, we analyzed mutants defective in NMD and identified a subset of lncRNAs that were induced during sexual development but suppressed by NMD during vegetative growth. These results highlight the developmental stage-specific nature and functional potential of lncRNA expression in shaping the fungal fruiting bodies and provide fundamental resources for studying sexual stage-induced lncRNAs. IMPORTANCE Fusarium graminearum is the causal agent of the head blight on our major staple crops, wheat and corn. The fruiting body formation on the host plants is indispensable for the disease cycle and epidemics. Long noncoding RNA (lncRNA) molecules are emerging as key regulatory components for sexual development in animals and plants. To date, however, there is a paucity of information on the roles of lncRNAs in fungal fruiting body formation. Here we characterized hundreds of lncRNAs that exhibited developmental stage-specific expression patterns during fruiting body formation. Also, we discovered that many lncRNAs were induced in parallel with their overlapping transcripts on the opposite DNA strand during sexual development. Finally, we found a subset of lncRNAs that were regulated by an RNA surveillance system during vegetative growth. This research provides fundamental genomic resources that will spur further investigations on lncRNAs that may play important roles in shaping fungal fruiting bodies. 
    more » « less
  2. Multicellularity has been one of the most important innovations in the history of life. The role of gene regulatory changes in driving transitions to multicellularity is being increasingly recognized; however, factors influencing gene expression patterns are poorly known in many clades. Here, we compared the developmental transcriptomes of complex multicellular fruiting bodies of eight Agaricomycetes and Cryptococcus neoformans , a closely related human pathogen with a simple morphology. In-depth analysis in Pleurotus ostreatus revealed that allele-specific expression, natural antisense transcripts, and developmental gene expression, but not RNA editing or a ‘developmental hourglass,’ act in concert to shape its transcriptome during fruiting body development. We found that transcriptional patterns of genes strongly depend on their evolutionary ages. Young genes showed more developmental and allele-specific expression variation, possibly because of weaker evolutionary constraint, suggestive of nonadaptive expression variance in fruiting bodies. These results prompted us to define a set of conserved genes specifically regulated only during complex morphogenesis by excluding young genes and accounting for deeply conserved ones shared with species showing simple sexual development. Analysis of the resulting gene set revealed evolutionary and functional associations with complex multicellularity, which allowed us to speculate they are involved in complex multicellular morphogenesis of mushroom fruiting bodies. 
    more » « less
  3. ABSTRACT The origins and maintenance of the rich fungal diversity have been longstanding issues in evolutionary biology. To investigate how differences in expression regulation contribute to divergences in development and ecology among closely related species, transcriptomes were compared between Chaetomium globosum , a homothallic pathogenic fungus thriving in highly humid ecologies, and Neurospora crassa , a heterothallic postfire saprotroph. Gene expression was quantified in perithecia at nine distinct morphological stages during nearly synchronous sexual development. Unlike N. crassa , expression of all mating loci in C. globosum was highly correlated. Key regulators of the initiation of sexual development in response to light stimuli—including orthologs of N. crassa sub-1 , sub-1 -dependent gene NCU00309, and asl-1 —showed regulatory dynamics matching between C. globosum and N. crassa . Among 24 secondary metabolism gene clusters in C. globosum , 11—including the cochliodones biosynthesis cluster—exhibited highly coordinated expression across perithecial development. C. globosum exhibited coordinately upregulated expression of histidine kinases in hyperosmotic response pathways—consistent with gene expression responses to high humidity we identified in fellow pathogen Fusarium graminearum . Bayesian networks indicated that gene interactions during sexual development have diverged in concert with the capacities both to reproduce asexually and to live a self-compatible versus self-incompatible life cycle, shifting the hierarchical roles of genes associated with conidiation and heterokaryon incompatibility in N. crassa and C. globosum . This divergence supports an evolutionary history of loss of conidiation due to unfavorable combinations of heterokaryon incompatibility in homothallic species. IMPORTANCE Fungal diversity has amazed evolutionary biologists for decades. One societally important aspect of this diversity manifests in traits that enable pathogenicity. The opportunistic pathogen Chaetomium globosum is well adapted to a high-humidity environment and produces numerous secondary metabolites that defend it from predation. Many of these chemicals can threaten human health. Understanding the phases of the C. globosum life cycle in which these products are made enables better control and even utilization of this fungus. Among its intriguing traits is that it both is self-fertile and lacks any means of propagule-based asexual reproduction. By profiling genome-wide gene expression across the process of sexual reproduction in C. globosum and comparing it to genome-wide gene expression in the model filamentous fungus N. crassa and other closely related fungi, we revealed associations among mating-type genes, sexual developmental genes, sexual incompatibility regulators, environmentally responsive genes, and secondary metabolic pathways. 
    more » « less
  4. Abstract Convergent evolution is pervasive in nature, but it is poorly understood how various constraints and natural selection limit the diversity of evolvable phenotypes. Here, we analyze the transcriptome across fruiting body development to understand the independent evolution of complex multicellularity in the two largest clades of fungi—the Agarico- and Pezizomycotina. Despite >650 My of divergence between these clades, we find that very similar sets of genes have convergently been co-opted for complex multicellularity, followed by expansions of their gene families by duplications. Over 82% of shared multicellularity-related gene families were expanding in both clades, indicating a high prevalence of convergence also at the gene family level. This convergence is coupled with a rich inferred repertoire of multicellularity-related genes in the most recent common ancestor of the Agarico- and Pezizomycotina, consistent with the hypothesis that the coding capacity of ancestral fungal genomes might have promoted the repeated evolution of complex multicellularity. We interpret this repertoire as an indication of evolutionary predisposition of fungal ancestors for evolving complex multicellular fruiting bodies. Our work suggests that evolutionary convergence may happen not only when organisms are closely related or are under similar selection pressures, but also when ancestral genomic repertoires render certain evolutionary trajectories more likely than others, even across large phylogenetic distances. 
    more » « less
  5. INTRODUCTION Transposable elements (TEs), repeat expansions, and repeat-mediated structural rearrangements play key roles in chromosome structure and species evolution, contribute to human genetic variation, and substantially influence human health through copy number variants, structural variants, insertions, deletions, and alterations to gene transcription and splicing. Despite their formative role in genome stability, repetitive regions have been relegated to gaps and collapsed regions in human genome reference GRCh38 owing to the technological limitations during its development. The lack of linear sequence in these regions, particularly in centromeres, resulted in the inability to fully explore the repeat content of the human genome in the context of both local and regional chromosomal environments. RATIONALE Long-read sequencing supported the complete, telomere-to-telomere (T2T) assembly of the pseudo-haploid human cell line CHM13. This resource affords a genome-scale assessment of all human repetitive sequences, including TEs and previously unknown repeats and satellites, both within and outside of gaps and collapsed regions. Additionally, a complete genome enables the opportunity to explore the epigenetic and transcriptional profiles of these elements that are fundamental to our understanding of chromosome structure, function, and evolution. Comparative analyses reveal modes of repeat divergence, evolution, and expansion or contraction with locus-level resolution. RESULTS We implemented a comprehensive repeat annotation workflow using previously known human repeats and de novo repeat modeling followed by manual curation, including assessing overlaps with gene annotations, segmental duplications, tandem repeats, and annotated repeats. Using this method, we developed an updated catalog of human repetitive sequences and refined previous repeat annotations. We discovered 43 previously unknown repeats and repeat variants and characterized 19 complex, composite repetitive structures, which often carry genes, across T2T-CHM13. Using precision nuclear run-on sequencing (PRO-seq) and CpG methylated sites generated from Oxford Nanopore Technologies long-read sequencing data, we assessed RNA polymerase engagement across retroelements genome-wide, revealing correlations between nascent transcription, sequence divergence, CpG density, and methylation. These analyses were extended to evaluate RNA polymerase occupancy for all repeats, including high-density satellite repeats that reside in previously inaccessible centromeric regions of all human chromosomes. Moreover, using both mapping-dependent and mapping-independent approaches across early developmental stages and a complete cell cycle time series, we found that engaged RNA polymerase across satellites is low; in contrast, TE transcription is abundant and serves as a boundary for changes in CpG methylation and centromere substructure. Together, these data reveal the dynamic relationship between transcriptionally active retroelement subclasses and DNA methylation, as well as potential mechanisms for the derivation and evolution of new repeat families and composite elements. Focusing on the emerging T2T-level assembly of the HG002 X chromosome, we reveal that a high level of repeat variation likely exists across the human population, including composite element copy numbers that affect gene copy number. Additionally, we highlight the impact of repeats on the structural diversity of the genome, revealing repeat expansions with extreme copy number differences between humans and primates while also providing high-confidence annotations of retroelement transduction events. CONCLUSION The comprehensive repeat annotations and updated repeat models described herein serve as a resource for expanding the compendium of human genome sequences and reveal the impact of specific repeats on the human genome. In developing this resource, we provide a methodological framework for assessing repeat variation within and between human genomes. The exhaustive assessment of the transcriptional landscape of repeats, at both the genome scale and locally, such as within centromeres, sets the stage for functional studies to disentangle the role transcription plays in the mechanisms essential for genome stability and chromosome segregation. Finally, our work demonstrates the need to increase efforts toward achieving T2T-level assemblies for nonhuman primates and other species to fully understand the complexity and impact of repeat-derived genomic innovations that define primate lineages, including humans. Telomere-to-telomere assembly of CHM13 supports repeat annotations and discoveries. The human reference T2T-CHM13 filled gaps and corrected collapsed regions (triangles) in GRCh38. Combining long read–based methylation calls, PRO-seq, and multilevel computational methods, we provide a compendium of human repeats, define retroelement expression and methylation profiles, and delineate locus-specific sites of nascent transcription genome-wide, including previously inaccessible centromeres. SINE, short interspersed element; SVA, SINE–variable number tandem repeat– Alu ; LINE, long interspersed element; LTR, long terminal repeat; TSS, transcription start site; pA, xxxxxxxxxxxxxxxx. 
    more » « less