skip to main content


Search for: All records

Award ID contains: 2110404

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. The subphylum Saccharomycotina is a lineage in the fungal phylum Ascomycota that exhibits levels of genomic diversity similar to those of plants and animals. The Saccharomycotina consist of more than 1 200 known species currently divided into 16 families, one order, and one class. Species in this subphylum are ecologically and metabolically diverse and include important opportunistic human pathogens, as well as species important in biotechnological applications. Many traits of biotechnological interest are found in closely related species and often restricted to single phylogenetic clades. However, the biotechnological potential of most yeast species remains unexplored. Although the subphylum Saccharomycotina has much higher rates of genome sequence evolution than its sister subphylum, Pezizomycotina , it contains only one class compared to the 16 classes in Pezizomycotina . The third subphylum of Ascomycota , the Taphrinomycotina , consists of six classes and has approximately 10 times fewer species than the Saccharomycotina . These data indicate that the current classification of all these yeasts into a single class and a single order is an underappreciation of their diversity. Our previous genome-scale phylogenetic analyses showed that the Saccharomycotina contains 12 major and robustly supported phylogenetic clades; seven of these are current families ( Lipomycetaceae , Trigonopsidaceae , Alloascoideaceae , Pichiaceae , Phaffomycetaceae , Saccharomycodaceae , and Saccharomycetaceae ), one comprises two current families ( Dipodascaceae and Trichomonascaceae ), one represents the genus Sporopachydermia , and three represent lineages that differ in their translation of the CUG codon (CUG-Ala, CUG-Ser1, and CUG-Ser2). Using these analyses in combination with relative evolutionary divergence and genome content analyses, we propose an updated classification for the Saccharomycotina , including seven classes and 12 orders that can be diagnosed by genome content. This updated classification is consistent with the high levels of genomic diversity within this subphylum and is necessary to make the higher rank classification of the Saccharomycotina more comparable to that of other fungi, as well as to communicate efficiently on lineages that are not yet formally named. 
    more » « less
    Free, publicly-accessible full text available May 25, 2024
  2. Hejnol, Andreas (Ed.)
    Molecular evolution studies, such as phylogenomic studies and genome-wide surveys of selection, often rely on gene families of single-copy orthologs (SC-OGs). Large gene families with multiple homologs in 1 or more species—a phenomenon observed among several important families of genes such as transporters and transcription factors—are often ignored because identifying and retrieving SC-OGs nested within them is challenging. To address this issue and increase the number of markers used in molecular evolution studies, we developed OrthoSNAP, a software that uses a phylogenetic framework to simultaneously split gene families into SC-OGs and prune species-specific inparalogs. We term SC-OGs identified by OrthoSNAP as SNAP-OGs because they are identified using a s plitti n g a nd p runing procedure analogous to snapping branches on a tree. From 415,129 orthologous groups of genes inferred across 7 eukaryotic phylogenomic datasets, we identified 9,821 SC-OGs; using OrthoSNAP on the remaining 405,308 orthologous groups of genes, we identified an additional 10,704 SNAP-OGs. Comparison of SNAP-OGs and SC-OGs revealed that their phylogenetic information content was similar, even in complex datasets that contain a whole-genome duplication, complex patterns of duplication and loss, transcriptome data where each gene typically has multiple transcripts, and contentious branches in the tree of life. OrthoSNAP is useful for increasing the number of markers used in molecular evolution data matrices, a critical step for robustly inferring and exploring the tree of life. 
    more » « less
  3. Stajich, J (Ed.)
    Abstract Bioinformatic analysis—such as genome assembly quality assessment, alignment summary statistics, relative synonymous codon usage, file format conversion, and processing and analysis—is integrated into diverse disciplines in the biological sciences. Several command-line pieces of software have been developed to conduct some of these individual analyses, but unified toolkits that conduct all these analyses are lacking. To address this gap, we introduce BioKIT, a versatile command line toolkit that has, upon publication, 42 functions, several of which were community-sourced, that conduct routine and novel processing and analysis of genome assemblies, multiple sequence alignments, coding sequences, sequencing data, and more. To demonstrate the utility of BioKIT, we conducted a comprehensive examination of relative synonymous codon usage across 171 fungal genomes that use alternative genetic codes, showed that the novel metric of gene-wise relative synonymous codon usage can accurately estimate gene-wise codon optimization, evaluated the quality and characteristics of 901 eukaryotic genome assemblies, and calculated alignment summary statistics for 10 phylogenomic data matrices. BioKIT will be helpful in facilitating and streamlining sequence analysis workflows. BioKIT is freely available under the MIT license from GitHub (https://github.com/JLSteenwyk/BioKIT), PyPi (https://pypi.org/project/jlsteenwyk-biokit/), and the Anaconda Cloud (https://anaconda.org/jlsteenwyk/jlsteenwyk-biokit). Documentation, user tutorials, and instructions for requesting new features are available online (https://jlsteenwyk.com/BioKIT). 
    more » « less
  4. The budding yeast coevolution network captures cellular structure and function in the absence of functional data. 
    more » « less
  5. Mitchell, Aaron P. (Ed.)
    Aspergillus fumigatus causes a range of human and animal diseases collectively known as aspergillosis. A . fumigatus possesses and expresses a range of genetic determinants of virulence, which facilitate colonisation and disease progression, including the secretion of mycotoxins. Gliotoxin (GT) is the best studied A . fumigatus mycotoxin with a wide range of known toxic effects that impair human immune cell function. GT is also highly toxic to A . fumigatus and this fungus has evolved self-protection mechanisms that include (i) the GT efflux pump GliA, (ii) the GT neutralising enzyme GliT, and (iii) the negative regulation of GT biosynthesis by the bis -thiomethyltransferase GtmA. The transcription factor (TF) RglT is the main regulator of GliT and this GT protection mechanism also occurs in the non-GT producing fungus A . nidulans . However, the A . nidulans genome does not encode GtmA and GliA. This work aimed at analysing the transcriptional response to exogenous GT in A . fumigatus and A . nidulans , two distantly related Aspergillus species, and to identify additional components required for GT protection. RNA-sequencing shows a highly different transcriptional response to exogenous GT with the RglT-dependent regulon also significantly differing between A . fumigatus and A . nidulans . However, we were able to observe homologs whose expression pattern was similar in both species (43 RglT-independent and 11 RglT-dependent). Based on this approach, we identified a novel RglT-dependent methyltranferase, MtrA, involved in GT protection. Taking into consideration the occurrence of RglT-independent modulated genes, we screened an A . fumigatus deletion library of 484 transcription factors (TFs) for sensitivity to GT and identified 15 TFs important for GT self-protection. Of these, the TF KojR, which is essential for kojic acid biosynthesis in Aspergillus oryzae , was also essential for virulence and GT biosynthesis in A . fumigatus , and for GT protection in A . fumigatus , A . nidulans , and A . oryzae . KojR regulates rglT , gliT , gliJ expression and sulfur metabolism in Aspergillus species. Together, this study identified conserved components required for GT protection in Aspergillus species. 
    more » « less
  6. Newton, Irene L. (Ed.)
    ABSTRACT Clear and effective figures are central to successfully communicating scientific data. Here, we present ggpubfigs, an R package with colorblind-friendly color palettes and extensions of the ggplot2 graphic system, which helps make publication-quality scientific figures from quantitative data; ggpubfigs is an open-source and user-friendly tool that is available from https://github.com/JLSteenwyk/ggpubfigs . 
    more » « less
  7. Wolfe, Kenneth (Ed.)
    Abstract The DNA mismatch repair (MMR) pathway corrects mismatched bases produced during DNA replication and is highly conserved across the tree of life, reflecting its fundamental importance for genome integrity. Loss of function in one or a few MMR genes can lead to increased mutation rates and microsatellite instability, as seen in some human cancers. Although loss of MMR genes has been documented in the context of human disease and in hypermutant strains of pathogens, examples of entire species and species lineages that have experienced substantial MMR gene loss are lacking. We examined the genomes of 1,107 species in the fungal phylum Ascomycota for the presence of 52 genes known to be involved in the MMR pathway of fungi. We found that the median ascomycete genome contained 49/52 MMR genes. In contrast, four closely related species of obligate plant parasites from the powdery mildew genera Erysiphe and Blumeria, have lost between five and 21 MMR genes, including MLH3, EXO1, and DPB11. The lost genes span MMR functions, include genes that are conserved in all other ascomycetes, and loss of function of any of these genes alone has been previously linked to increased mutation rate. Consistent with the hypothesis that loss of these genes impairs MMR pathway function, we found that powdery mildew genomes with higher levels of MMR gene loss exhibit increased numbers of mononucleotide runs, longer microsatellites, accelerated sequence evolution, elevated mutational bias in the A|T direction, and decreased GC content. These results identify a striking example of macroevolutionary loss of multiple MMR pathway genes in a eukaryotic lineage, even though the mutational outcomes of these losses appear to resemble those associated with detrimental MMR dysfunction in other organisms. 
    more » « less