skip to main content


Title: Analysis of Fungal Genomes Reveals Commonalities of Intron Gain or Loss and Functions in Intron-Poor Species
Abstract Previous evolutionary reconstructions have concluded that early eukaryotic ancestors including both the last common ancestor of eukaryotes and of all fungi had intron-rich genomes. By contrast, some extant eukaryotes have few introns, underscoring the complex histories of intron–exon structures, and raising the question as to why these few introns are retained. Here, we have used recently available fungal genomes to address a variety of questions related to intron evolution. Evolutionary reconstruction of intron presence and absence using 263 diverse fungal species supports the idea that massive intron reduction through intron loss has occurred in multiple clades. The intron densities estimated in various fungal ancestors differ from zero to 7.6 introns per 1 kb of protein-coding sequence. Massive intron loss has occurred not only in microsporidian parasites and saccharomycetous yeasts, but also in diverse smuts and allies. To investigate the roles of the remaining introns in highly-reduced species, we have searched for their special characteristics in eight intron-poor fungi. Notably, the introns of ribosome-associated genes RPL7 and NOG2 have conserved positions; both intron-containing genes encoding snoRNAs. Furthermore, both the proteins and snoRNAs are involved in ribosome biogenesis, suggesting that the expression of the protein-coding genes and noncoding snoRNAs may be functionally coordinated. Indeed, these introns are also conserved in three-quarters of fungi species. Our study shows that fungal introns have a complex evolutionary history and underappreciated roles in gene expression.  more » « less
Award ID(s):
1616878
NSF-PAR ID:
10331434
Author(s) / Creator(s):
; ; ;
Editor(s):
Ouangraoua, Aida
Date Published:
Journal Name:
Molecular Biology and Evolution
Volume:
38
Issue:
10
ISSN:
1537-1719
Page Range / eLocation ID:
4166 to 4186
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    Spliceosomal introns are gene segments removed from RNA transcripts by ribonucleoprotein machineries called spliceosomes. In some eukaryotes a second ‘minor’ spliceosome is responsible for processing a tiny minority of introns. Despite its seemingly modest role, minor splicing has persisted for roughly 1.5 billion years of eukaryotic evolution. Identifying minor introns in over 3000 eukaryotic genomes, we report diverse evolutionary histories including surprisingly high numbers in some fungi and green algae, repeated loss, as well as general biases in their positional and genic distributions. We estimate that ancestral minor intron densities were comparable to those of vertebrates, suggesting a trend of long-term stasis. Finally, three findings suggest a major role for neutral processes in minor intron evolution. First, highly similar patterns of minor and major intron evolution contrast with both functionalist and deleterious model predictions. Second, observed functional biases among minor intron-containing genes are largely explained by these genes’ greater ages. Third, no association of intron splicing with cell proliferation in a minor intron-rich fungus suggests that regulatory roles are lineage-specific and thus cannot offer a general explanation for minor splicing’s persistence. These data constitute the most comprehensive view of minor introns and their evolutionary history to date, and provide a foundation for future studies of these remarkable genetic elements.

     
    more » « less
  2. Introduction

    Eukaryotic life depends on the functional elements encoded by both the nuclear genome and organellar genomes, such as those contained within the mitochondria. The content, size, and structure of the mitochondrial genome varies across organisms with potentially large implications for phenotypic variance and resulting evolutionary trajectories. Among yeasts in the subphylum Saccharomycotina, extensive differences have been observed in various species relative to the model yeastSaccharomyces cerevisiae, but mitochondrial genome sampling across many groups has been scarce, even as hundreds of nuclear genomes have become available.

    Methods

    By extracting mitochondrial assemblies from existing short-read genome sequence datasets, we have greatly expanded both the number of available genomes and the coverage across sparsely sampled clades.

    Results

    Comparison of 353 yeast mitochondrial genomes revealed that, while size and GC content were fairly consistent across species, those in the generaMetschnikowiaandSaccharomycestrended larger, while several species in the order Saccharomycetales, which includesS. cerevisiae, exhibited lower GC content. Extreme examples for both size and GC content were scattered throughout the subphylum. All mitochondrial genomes shared a core set of protein-coding genes for Complexes III, IV, and V, but they varied in the presence or absence of mitochondrially-encoded canonical Complex I genes. We traced the loss of Complex I genes to a major event in the ancestor of the orders Saccharomycetales and Saccharomycodales, but we also observed several independent losses in the orders Phaffomycetales, Pichiales, and Dipodascales. In contrast to prior hypotheses based on smaller-scale datasets, comparison of evolutionary rates in protein-coding genes showed no bias towards elevated rates among aerobically fermenting (Crabtree/Warburg-positive) yeasts. Mitochondrial introns were widely distributed, but they were highly enriched in some groups. The majority of mitochondrial introns were poorly conserved within groups, but several were shared within groups, between groups, and even across taxonomic orders, which is consistent with horizontal gene transfer, likely involving homing endonucleases acting as selfish elements.

    Discussion

    As the number of available fungal nuclear genomes continues to expand, the methods described here to retrieve mitochondrial genome sequences from these datasets will prove invaluable to ensuring that studies of fungal mitochondrial genomes keep pace with their nuclear counterparts.

     
    more » « less
  3. Abstract

    U12-type or minor introns are found in most multicellular eukaryotes and constitute ∼0.5% of all introns in species with a minor spliceosome. Although the biological significance for the evolutionary conservation of U12-type introns is debated, mutations disrupting U12 splicing cause developmental defects in both plants and animals. In human hematopoietic stem cells, U12 splicing defects disrupt proper differentiation of myeloid lineages and are associated with myelodysplastic syndrome, predisposing individuals to acute myeloid leukemia. Mutants in the maize ortholog of RNA binding motif protein 48 (RBM48) have aberrant U12-type intron splicing. Human RBM48 was recently purified biochemically as part of the minor spliceosome and shown to recognize the 5′ end of the U6atac snRNA. In this report, we use CRISPR/Cas9-mediated ablation of RBM48 in human K-562 cells to show the genetic function of RBM48. RNA-seq analysis comparing wild-type and mutant K-562 genotypes found that 48% of minor intron-containing genes have significant U12-type intron retention in RBM48 mutants. Comparing these results to maize rbm48 mutants defined a subset of minor intron-containing genes disrupted in both species. Mutations in the majority of these orthologous minor intron-containing genes have been reported to cause developmental defects in both plants and animals. Our results provide genetic evidence that the primary defect of human RBM48 mutants is aberrant U12-type intron splicing, while a comparison of human and maize RNA-seq data identifies candidate genes likely to mediate mutant phenotypes of U12-type splicing defects.

     
    more » « less
  4. Abstract

    The oceanic igneous crust is a vast reservoir for microbial life, dominated by diverse and active bacteria, archaea, and fungi. Archaeal and bacterial viruses were previously detected in oceanic crustal fluids at the Juan de Fuca Ridge (JdFR). Here we report the discovery of two eukaryotic Nucleocytoviricota genomes from the same crustal fluids by sorting and sequencing single virions. Both genomes have a tRNATyr gene with an intron (20 bps) at the canonical position between nucleotide 37 and 38, a common feature in eukaryotic and archaeal tRNA genes with short introns (<100 bps), and fungal genes acquired through horizontal gene transfer (HGT) events. The dominance of Ascomycota fungi as the main eukaryotes in crustal fluids and the evidence for HGT point to these fungi as the putative hosts, making these the first putative fungi-Nucleocytoviricota specific association. Our study suggests active host-viral dynamics for the only eukaryotic group found in the subsurface oceanic crust and raises important questions about the impact of viral infection on the productivity and biogeochemical cycling in this ecosystem.

     
    more » « less
  5. Abstract

    Dinoflagellates are a diverse group of phytoplankton, ranging from harmful bloom-forming microalgae to photosymbionts of coral reefs. Genome-scale data from dinoflagellates reveal atypical genomic features, extensive genomic divergence, and lineage-specific innovation of gene functions. Long non-coding RNAs (lncRNAs), known to regulate gene expression in eukaryotes, are largely unexplored in dinoflagellates. Here, using high-quality genome and transcriptome data, we identified 48039 polyadenylated lncRNAs in three dinoflagellate species: the coral symbionts Cladocopium proliferum and Durusdinium trenchii, and the bloom-forming species, Prorocentrum cordatum. These lncRNAs have fewer introns and lower G+C content than protein-coding sequences; 37 768 (78.6%) are unique with respect to sequence similarity. We classified all lncRNAs based on conserved motifs (k-mers) into distinct clusters, following properties of protein-binding and/or subcellular localisation. Interestingly, 3708 (7.7%) lncRNAs are differentially expressed under heat stress, algal lifestyle, and/or growth phase, and share co-expression patterns with protein-coding genes. Based on inferred triplex interactions between lncRNA and putative promoter regions, we identified 19 460 putative gene targets for 3721 lncRNAs; 907 genes exhibit differential expression under heat stress. These results reveal, for the first time, the diversity of lncRNAs in dinoflagellates and how lncRNAs may regulate gene expression as a heat-stress response in these ecologically important microbes.

     
    more » « less