skip to main content


Title: A De Novo Transcriptome Assembly of Ceratopteris richardii Provides Insights into the Evolutionary Dynamics of Complex Gene Families in Land Plants
Abstract As the closest extant sister group to seed plants, ferns are an important reference point to study the origin and evolution of plant genes and traits. One bottleneck to the use of ferns in phylogenetic and genetic studies is the fact that genome-level sequence information of this group is limited, due to the extreme genome sizes of most ferns. Ceratopteris richardii (hereafter Ceratopteris) has been widely used as a model system for ferns. In this study, we generated a transcriptome of Ceratopteris, through the de novo assembly of the RNA-seq data from 17 sequencing libraries that are derived from two sexual types of gametophytes and five different sporophyte tissues. The Ceratopteris transcriptome, together with 38 genomes and transcriptomes from other species across the Viridiplantae, were used to uncover the evolutionary dynamics of orthogroups (predicted gene families using OrthoFinder) within the euphyllophytes and identify proteins associated with the major shifts in plant morphology and physiology that occurred in the last common ancestors of euphyllophytes, ferns, and seed plants. Furthermore, this resource was used to identify and classify the GRAS domain transcriptional regulators of many developmental processes in plants. Through the phylogenetic analysis within each of the 15 GRAS orthogroups, we uncovered which GRAS family members are conserved or have diversified in ferns and seed plants. Taken together, the transcriptome database and analyses reported here provide an important platform for exploring the evolution of gene families in land plants and for studying gene function in seed-free vascular plants.  more » « less
Award ID(s):
1931114
NSF-PAR ID:
10277192
Author(s) / Creator(s):
; ; ; ; ;
Editor(s):
Gaut, Brandon
Date Published:
Journal Name:
Genome Biology and Evolution
Volume:
13
Issue:
3
ISSN:
1759-6653
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract The large size and complexity of most fern genomes have hampered efforts to elucidate fundamental aspects of fern biology and land plant evolution through genome-enabled research. Here we present a chromosomal genome assembly and associated methylome, transcriptome and metabolome analyses for the model fern species Ceratopteris richardii . The assembly reveals a history of remarkably dynamic genome evolution including rapid changes in genome content and structure following the most recent whole-genome duplication approximately 60 million years ago. These changes include massive gene loss, rampant tandem duplications and multiple horizontal gene transfers from bacteria, contributing to the diversification of defence-related gene families. The insertion of transposable elements into introns has led to the large size of the Ceratopteris genome and to exceptionally long genes relative to other plants. Gene family analyses indicate that genes directing seed development were co-opted from those controlling the development of fern sporangia, providing insights into seed plant evolution. Our findings and annotated genome assembly extend the utility of Ceratopteris as a model for investigating and teaching plant biology. 
    more » « less
  2. Abstract

    Ferns are notorious for possessing large genomes and numerous chromosomes. Despite decades of speculation, the processes underlying the expansive genomes of ferns are unclear, largely due to the absence of a sequenced homosporous fern genome. The lack of this crucial resource has not only hindered investigations of evolutionary processes responsible for the unusual genome characteristics of homosporous ferns, but also impeded synthesis of genome evolution across land plants. Here, we used the model fern speciesCeratopteris richardiito address the processes (e.g., polyploidy, spread of repeat elements) by which the large genomes and high chromosome numbers typical of homosporous ferns may have evolved and have been maintained. We directly compared repeat compositions in species spanning the green plant tree of life and a diversity of genome sizes, as well as both short- and long-read-based assemblies ofCeratopteris. We found evidence consistent with a single ancient polyploidy event in the evolutionary history ofCeratopterisbased on both genomic and cytogenetic data, and on repeat proportions similar to those found in large flowering plant genomes. This study provides a major stepping-stone in the understanding of land plant evolutionary genomics by providing the first homosporous fern reference genome, as well as insights into the processes underlying the formation of these massive genomes.

     
    more » « less
  3. Ferns are the second largest clade of vascular plants with over 10,000 species, yet the generation of genomic resources for the group has lagged behind other major clades of plants. Transcriptomic data have proven to be a powerful tool to assess phylogenetic relationships, using thousands of markers that are largely conserved across the genome, and without the need to sequence entire genomes. We assembled the largest nuclear phylogenetic dataset for ferns to date, including 2884 single-copy nuclear loci from 247 transcriptomes (242 ferns, five outgroups), and investigated phylogenetic relationships across the fern tree, the placement of whole genome duplications (WGDs), and gene retention patterns following WGDs. We generated a well-supported phylogeny of ferns and identified several regions of the fern phylogeny that demonstrate high levels of gene tree–species tree conflict, which largely correspond to areas of the phylogeny that have been difficult to resolve. Using a combination of approaches, we identified 27 WGDs across the phylogeny, including 18 large-scale events (involving more than one sampled taxon) and nine small-scale events (involving only one sampled taxon). Most inferred WGDs occur within single lineages (e.g., orders, families) rather than on the backbone of the phylogeny, although two inferred events are shared by leptosporangiate ferns (excluding Osmundales) and Polypodiales (excluding Lindsaeineae and Saccolomatineae), clades which correspond to the majority of fern diversity. We further examined how retained duplicates following WGDs compared across independent events and found that functions of retained genes were largely convergent, with processes involved in binding, responses to stimuli, and certain organelles over-represented in paralogs while processes involved in transport, organelles derived from endosymbiotic events, and signaling were under-represented. To date, our study is the most comprehensive investigation of the nuclear fern phylogeny, though several avenues for future research remain unexplored. 
    more » « less
  4. Abstract Background and Aims Cycads are regarded as an ancient lineage of living seed plants, and hold important clues to understand the early evolutionary trends of seed plants. The molecular phylogeny and spatio-temporal diversification of one of the species-rich genera of cycads, Macrozamia, have not been well reconstructed. Methods We analysed a transcriptome dataset of 4740 single-copy nuclear genes (SCGs) of 39 Macrozamia species and two outgroup taxa. Based on concatenated (maximum parsimony, maximum likelihood) and multispecies coalescent analyses, we first establish a well-resolved phylogenetic tree of Macrozamia. To identify cyto-nuclear incongruence, the plastid protein coding genes (PCGs) from transcriptome data are extracted using the software HybPiper. Furthermore, we explore the biogeographical history of the genus and shed light on the pattern of floristic exchange between three distinct areas of Australia. Six key diagnostic characters are traced on the phylogenetic framework using two comparative methods, and infra-generic classification is investigated. Key Results The tree topologies of concatenated and multi-species coalescent analyses of SCGs are mostly congruent with a few conflicting nodes, while those from plastid PCGs show poorly supported relationships. The genus contains three major clades that correspond to their distinct distributional areas in Australia. The crown group of Macrozamia is estimated to around 11.80 Ma, with a major expansion in the last 5–6 Myr. Six morphological characters show homoplasy, and the traditional phenetic sectional division of the genus is inconsistent with this current phylogeny. Conclusions This first detailed phylogenetic investigation of Macrozamia demonstrates promising prospects of SCGs in resolving phylogenetic relationships within cycads. Our study suggests that Macrozamia, once widely distributed in Australia, underwent major extinctions because of fluctuating climatic conditions such as cooling and mesic biome disappearance in the past. The current close placement of morphologically distinct species in the phylogenetic tree may be related to neotenic events that occurred in the genus. 
    more » « less
  5. Abstract

    Plants employ a diverse set of defense mechanisms to mediate interactions with insects and fungi. These relationships can leave lasting impacts on host plant genome structure such as rapid expansion of gene families through tandem duplication. These genomic signatures provide important clues about the complexities of plant/biotic stress interactions and evolution. We used a pseudo‐backcross hybrid family to identify quantitative trait loci (QTL) controlling associations betweenPopulustrees and several commonPopulusdiseases and insects. Using whole‐genome sequences from each parent, we identified candidate genes that may mediate these interactions. Candidates were partially validated using mass spectrometry to identify corresponding QTL for defensive compounds. We detected significant QTL for two interacting fungal pathogens and three insects. The QTL intervals contained candidate genes potentially involved in physical and chemical mechanisms of host–plant resistance and susceptibility. In particular, we identified adjoining QTLs for a phenolic glycoside andPhyllocolpasawfly abundance. There was also significant enrichment of recent tandem duplications in the genomic intervals of the native parent, but not the exotic parent. Tandem gene duplication may be an important mechanism for rapid response to biotic stressors, enabling trees with long juvenile periods to reach maturity despite many coevolving biotic stressors.

     
    more » « less