Green plants (Viridiplantae) include around 450,000–500,000 species of great diversity and have important roles in terrestrial and aquatic ecosystems. Here, as part of the One Thousand Plant Transcriptomes Initiative, we sequenced the vegetative transcriptomes of 1,124 species that span the diversity of plants in a broad sense (Archaeplastida), including green plants (Viridiplantae), glaucophytes (Glaucophyta) and red algae (Rhodophyta). Our analysis provides a robust phylogenomic framework for examining the evolution of green plants. Most inferred species relationships are well supported across multiple species tree and supermatrix analyses, but discordance among plastid and nuclear gene trees at a few important nodes highlights the complexity of plant genome evolution, including polyploidy, periods of rapid speciation, and extinction. Incomplete sorting of ancestral variation, polyploidization and massive expansions of gene families punctuate the evolutionary history of green plants. Notably, we find that large expansions of gene families preceded the origins of green plants, land plants and vascular plants, whereas whole-genome duplications are inferred to have occurred repeatedly throughout the evolution of flowering plants and ferns. The increasing availability of high-quality plant genome sequences and advances in functional genomics are enabling research on genome evolution across the green tree of life.
more »
« less
Green plant genomes: What we know in an era of rapidly expanding opportunities
Green plants play a fundamental role in ecosystems, human health, and agriculture. As de novo genomes are being generated for all known eukaryotic species as advocated by the Earth BioGenome Project, increasing genomic information on green land plants is essential. However, setting standards for the generation and storage of the complex set of genomes that characterize the green lineage of life is a major challenge for plant scientists. Such standards will need to accommodate the immense variation in green plant genome size, transposable element content, and structural complexity while enabling research into the molecular and evolutionary processes that have resulted in this enormous genomic variation. Here we provide an overview and assessment of the current state of knowledge of green plant genomes. To date fewer than 300 complete chromosome-scale genome assemblies representing fewer than 900 species have been generated across the estimated 450,000 to 500,000 species in the green plant clade. These genomes range in size from 12 Mb to 27.6 Gb and are biased toward agricultural crops with large branches of the green tree of life untouched by genomic-scale sequencing. Locating suitable tissue samples of most species of plants, especially those taxa from extreme environments, remains one of the biggest hurdles to increasing our genomic inventory. Furthermore, the annotation of plant genomes is at present undergoing intensive improvement. It is our hope that this fresh overview will help in the development of genomic quality standards for a cohesive and meaningful synthesis of green plant genomes as we scale up for the future.
more »
« less
- Award ID(s):
- 1943371
- PAR ID:
- 10323074
- Date Published:
- Journal Name:
- Proceedings of the National Academy of Sciences
- Volume:
- 119
- Issue:
- 4
- ISSN:
- 0027-8424
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
The F-box proteins function as substrate receptors to determine the specificity of Skp1-Cul1-F-box ubiquitin ligases. Genomic studies revealed large and diverse sizes of the F-box gene superfamily across plant species. Our previous studies suggested that the plant F-box gene superfamily is under genomic drift evolution promoted by epigenomic programming. However, how the size of the superfamily drifts across plant genomes is currently unknown. Through a large-scale genomic and phylogenetic comparison of the F-box gene superfamily covering 110 green plants and one red algal species, I discovered four distinct groups of plant F-box genes with diverse evolutionary processes. While the members in Clusters 1 and 2 are species/lineage-specific, those in Clusters 3 and 4 are present in over 46 plant genomes. Statistical modeling suggests that F-box genes from the former two groups are skewed toward fewer species and more paralogs compared to those of the latter two groups whose presence frequency and sizes in plant genomes follow a random statistical model. The enrichment of known Arabidopsis F-box genes in Clusters 3 and 4, along with comprehensive biochemical evidence showing that Arabidopsis members in Cluster 4 interact with the Arabidopsis Skp1-like 1 (ASK1), demonstrates over-representation of active F-box genes in these two groups. Collectively, I propose purifying and dosage balancing selection models to explain the lineage/species-specific duplications and expansions of F-box genes in plant genomes. The purifying selection model suggests that most, if not all, lineage/species-specific F-box genes are detrimental and are thus kept at low frequencies in plant genomes.more » « less
-
Centromeres are essential for chromosome function, yet their role in shaping genome evolution in polyploid plants remains poorly understood. Allopolyploidy, where post-hybridization genome doubling merges parental genomes that may differ markedly in chromosomal architecture, has the potential to increase centromeric complexity and influence genomic plasticity. We explore this possibility in carnivorous Caryophyllales, a morphologically and chromosomally diverse plant lineage encompassing sundews, Venus flytraps, and Nepenthes pitcher plants. Focusing on sundews (Drosera), we generated chromosome-scale assemblies of holocentric D. regia and monocentric D. capensis, which share an allohexaploid origin but have diverged dramatically in genome structure. D. regia retains ancestral chromosomal fusions, dispersed centromeric repeats, and conserved synteny, whereas D. capensis exhibits extensive chromosomal reorganization and regionally localized centromeres after a lineage-specific genome duplication. Phylogenomic evidence traces D. regia to an ancient hybridization between sundew- and Venus flytrap-like ancestors, setting it apart within its infrageneric context. Genus-wide satellite DNA repeat profiling reveals rapid turnover and species-level variation in centromere organization. Together, these results establish sundews as a natural system for investigating how centromere dynamics interact with recurrent polyploidization and episodes of ecological innovation to shape genomic resilience.more » « less
-
Abstract Broad paradigms of vertebrate genomic repeat element evolution have been largely shaped by analyses of mammalian and avian genomes. Here, based on analyses of genomes sequenced from over 60 squamate reptiles (lizards and snakes), we show that patterns of genomic repeat landscape evolution in squamates challenge such paradigms. Despite low variance in genome size, squamate genomes exhibit surprisingly high variation among species in abundance (ca. 25–73% of the genome) and composition of identifiable repeat elements. We also demonstrate that snake genomes have experienced microsatellite seeding by transposable elements at a scale unparalleled among eukaryotes, leading to some snake genomes containing the highest microsatellite content of any known eukaryote. Our analyses of transposable element evolution across squamates also suggest that lineage-specific variation in mechanisms of transposable element activity and silencing, rather than variation in species-specific demography, may play a dominant role in driving variation in repeat element landscapes across squamate phylogeny.more » « less
-
Abstract Across plants and animals, genome size is often correlated with life‐history traits: large genomes are correlated with larger seeds, slower development, larger body size and slower cell division. Among decapod crustaceans, caridean shrimps are among the most variable both in terms of genome size variation and life‐history characteristics such as larval development mode and egg size, but the extent to which these traits are associated in a phylogenetic context is largely unknown. In this study, we examine correlations among egg size, larval development and genome size in two different genera of snapping shrimp,AlpheusandSynalpheus, using phylogenetically informed analyses. In bothAlpheusandSynalpheus, egg size is strongly linked to larval development mode: species with abbreviated development had significantly larger eggs than species with extended larval development. We produced the first comprehensive dataset of genome size inAlpheus(n = 37 species) and demonstrated that genome size was strongly and positively correlated with egg size in bothAlpheusandSynalpheus. Correlated trait evolution analyses showed that inAlpheus, changes in genome size were clearly dependent on egg size. InSynalpheus, evolutionary path analyses suggest that changes in development mode (from extended to abbreviated) drove increases in egg volume; larger eggs, in turn, resulted in larger genomes. These data suggest that variation in reproductive traits may underpin the high degree of variation in genome size seen in a wide variety of caridean shrimp groups more generally.more » « less
An official website of the United States government

