skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Anaerobic fungi contain abundant, diverse, and transcriptionally active Long Terminal Repeat retrotransposons
Long Terminal Repeat (LTR) retrotransposons are a class of repetitive elements that are widespread in the genomes of plants and many fungi. LTR retrotransposons have been associated with rapidly evolving gene clusters in plants and virulence factor transfer in fungal-plant parasite-host interactions. We report here the abundance and transcriptional activity of LTR retrotransposons across several species of the early-branching Neocallimastigomycota, otherwise known as the anaerobic gut fungi (AGF). The ubiquity of LTR retrotransposons in these genomes suggests key evolutionary roles in these rumen-dwelling biomass degraders, whose genomes also contain many enzymes that are horizontally transferred from other rumen-dwelling prokaryotes. Up to 10% of anaerobic fungal genomes consist of LTR retrotransposons, and the mapping of sequences from LTR retrotransposons to transcriptomes shows that the majority of clusters are transcribed, with some exhibiting expression greater than 104 reads per kilobase million mapped reads (rpkm). Many LTR retrotransposons are strongly differentially expressed upon heat stress during fungal cultivation, with several exhibiting a nearly three-log10 fold increase in expression, whereas growth substrate variation modulated transcription to a lesser extent. We show that some LTR retrotransposons contain carbohydrate-active enzymes (CAZymes), and the expansion of CAZymes within genomes and among anaerobic fungal species may be linked to retrotransposon activity. We further discuss how these widespread sequences may be a source of promoters and other parts towards the bioengineering of anaerobic fungi.  more » « less
Award ID(s):
2128271
PAR ID:
10532495
Author(s) / Creator(s):
; ; ;
Corporate Creator(s):
Editor(s):
na
Publisher / Repository:
Elsevier
Date Published:
Journal Name:
Fungal Genetics and Biology
Volume:
172
Issue:
C
ISSN:
1087-1845
Page Range / eLocation ID:
103897
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Tribble, C (Ed.)
    Abstract The majority of sequenced genomes in the monocots are from species belonging to Poaceae, which include many commercially important crops. Here, we expand the number of sequenced genomes from the monocots to include the genomes of 4 related cyperids: Carex cristatella and Carex scoparia from Cyperaceae and Juncus effusus and Juncus inflexus from Juncaceae. The high-quality, chromosome-scale genome sequences from these 4 cyperids were assembled by combining whole-genome shotgun sequencing of Nanopore long reads, Illumina short reads, and Hi-C sequencing data. Some members of the Cyperaceae and Juncaceae are known to possess holocentric chromosomes. We examined the repeat landscapes in our sequenced genomes to search for potential repeats associated with centromeres. Several large satellite repeat families, comprising 3.2–9.5% of our sequenced genomes, showed dispersed distribution of large satellite repeat clusters across all Carex chromosomes, with few instances of these repeats clustering in the same chromosomal regions. In contrast, most large Juncus satellite repeats were clustered in a single location on each chromosome, with sporadic instances of large satellite repeats throughout the Juncus genomes. Recognizable transposable elements account for about 20% of each of the 4 genome assemblies, with the Carex genomes containing more DNA transposons than retrotransposons while the converse is true for the Juncus genomes. These genome sequences and annotations will facilitate better comparative analysis within monocots. 
    more » « less
  2. null (Ed.)
    Morels (Morchella spp.) are popular edible fungi with significant economic and scientific value. However, white mold disease, caused by Paecilomyces penicillatus, can reduce morel yield by up to 80% in the main cultivation area in China. Paecilomyces is a polyphyletic genus and the exact phylogenetic placement of P. penicillatus is currently still unclear. Here, we obtained the first high-quality genome sequence of P. penicillatus generated through the single-molecule real-time (SMRT) sequencing platform. The assembled draft genome of P. penicillatus was 40.2 Mb, had an N50 value of 2.6 Mb and encoded 9454 genes. Phylogenetic analysis of single-copy orthologous genes revealed that P. penicillatus is in Hypocreales and closely related to Hypocreaceae, which includes several genera exhibiting a mycoparasitic lifestyle. CAZymes analysis demonstrated that P. penicillatus encodes a large number of fungal cell wall degradation enzymes. We identified many gene clusters involved in the production of secondary metabolites known to exhibit antifungal, antibacterial, or insecticidal activities. We further demonstrated through dual culture assays that P. penicillatus secretes certain soluble compounds that are inhibitory to the mycelial growth of Morchella sextelata. This study provides insights into the correct phylogenetic placement of P. penicillatus and the molecular mechanisms that underlie P. penicillatus pathogenesis. 
    more » « less
  3. Abstract Carbohydrate Active EnZymes (CAZymes) are significantly important for microbial communities to thrive in carbohydrate rich environments such as animal guts, agricultural soils, forest floors, and ocean sediments. Since 2017, microbiome sequencing and assembly have produced numerous metagenome assembled genomes (MAGs). We have updated our dbCAN-seq database (https://bcb.unl.edu/dbCAN_seq) to include the following new data and features: (i) ∼498 000 CAZymes and ∼169 000 CAZyme gene clusters (CGCs) from 9421 MAGs of four ecological (human gut, human oral, cow rumen, and marine) environments; (ii) Glycan substrates for 41 447 (24.54%) CGCs inferred by two novel approaches (dbCAN-PUL homology search and eCAMI subfamily majority voting) (the two approaches agreed on 4183 CGCs for substrate assignments); (iii) A redesigned CGC page to include the graphical display of CGC gene compositions, the alignment of query CGC and subject PUL (polysaccharide utilization loci) of dbCAN-PUL, and the eCAMI subfamily table to support the predicted substrates; (iv) A statistics page to organize all the data for easy CGC access according to substrates and taxonomic phyla; and (v) A batch download page. In summary, this updated dbCAN-seq database highlights glycan substrates predicted for CGCs from microbiomes. Future work will implement the substrate prediction function in our dbCAN2 web server. 
    more » « less
  4. ABSTRACT Dead fungal cells, known as necromass, are increasingly recognised as significant contributors to long‐term soil carbon pools, yet the genes involved in necromass decomposition are poorly understood. In particular, how microorganisms degrade necromass with differing initial cell wall chemical compositions using carbohydrate‐active enzymes (CAZymes) has not been well studied. Based on the frequent occurrence and high abundance of the fungal genusTrichodermaon decaying fungal necromass in situ, we grewTrichoderma reeseiRUT‐C30 on low and high melanin necromass ofHyaloscypha bicolor(Ascomycota) in liquid cultures and assessedT. reeseigene expression relative to each other and relative to glucose. Transcriptome data revealed thatT. reeseiup‐regulated many genes (over 100; necromass versus glucose substrate) coding for CAZymes, including enzymes that would target individual layers of an Ascomycota fungal cell wall. We also observed differential expression of protease‐ and laccase‐encoding genes on high versus low melanin necromass, highlighting a subset of genes (fewer than 15) possibly linked to the deconstruction of melanin, a cell wall constituent that limits necromass decay rates in nature. Collectively, these results advance our understanding of the genomic traits underpinning the rates and fates of carbon turnover in an understudied pool of Earth's belowground carbon, fungal necromass. 
    more » « less
  5. Abstract Transposable elements represent the largest components of many eukaryotic genomes and different genomes harbor different combinations of elements. Here, we discovered a novel DNA transposon in the genome of the clubmoss Selaginella lepidophylla. Further searching for related sequences to the conserved DDE region uncovered the presence of this superfamily of elements in fish, coral, sea anemone, and other animal species. However, this element appears restricted to Bryophytes and Lycophytes in plants. This transposon, named GingerRoot, is associated with a 6 bp (base pair) target site duplication, and 100–150 bp terminal inverted repeats. Analysis of transposase sequences identified the DDE motif, a catalytic domain, which shows similarity to the integrase of Gypsy-like long terminal repeat retrotransposons, the most abundant component in plant genomes. A total of 77 intact and several hundred truncated copies of GingerRoot elements were identified in S. lepidophylla. Like Gypsy retrotransposons, GingerRoots show a lack of insertion preference near genes, which contrasts to the compact genome size of about 100 Mb. Nevertheless, a considerable portion of GingerRoot elements was found to carry gene fragments, suggesting the capacity of duplicating gene sequences is unlikely attributed to the proximity to genes. Elements carrying gene fragments appear to be less methylated, more diverged, and more distal to genes than those without gene fragments, indicating they are preferentially retained in gene-poor regions. This study has identified a broadly dispersed, novel DNA transposon, and the first plant DNA transposon with an integrase-related transposase, suggesting the possibility of de novo formation of Gypsy-like elements in plants. 
    more » « less