Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher.
Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?
Some links on this page may take you to non-federal websites. Their policies may differ from this site.
-
Birchler, James (Ed.)Abstract Ancient whole-genome duplications (WGDs) are believed to facilitate novelty and adaptation by providing the raw fuel for new genes. However, it is unclear how recent WGDs may contribute to evolvability within recent polyploids. Hybridization accompanying some WGDs may combine divergent gene content among diploid species. Some theory and evidence suggest that polyploids have a greater accumulation and tolerance of gene presence-absence and genomic structural variation, but it is unclear to what extent either is true. To test how recent polyploidy may influence pangenomic variation, we sequenced, assembled, and annotated twelve complete, chromosome-scale genomes of Camelina sativa, an allohexaploid biofuel crop with three distinct subgenomes. Using pangenomic comparative analyses, we characterized gene presence-absence and genomic structural variation both within and between the subgenomes. We found over 75% of ortholog gene clusters are core in Camelina sativa and <10% of sequence space was affected by genomic structural rearrangements. In contrast, 19% of gene clusters were unique to one subgenome, and the majority of these were Camelina-specific (no ortholog in Arabidopsis). We identified an inversion that may contribute to vernalization requirements in winter-type Camelina, and an enrichment of Camelina-specific genes with enzymatic processes related to seed oil quality and Camelina’s unique glucosinolate profile. Genes related to these traits exhibited little presence-absence variation. Our results reveal minimal pangenomic variation in this species, and instead show how hybridization accompanied by WGD may benefit polyploids by merging diverged gene content of different species.more » « less
-
Abstract Fertilization is a fundamental process that triggers seed and fruit development, but the molecular mechanisms underlying fertilization-induced seed development are poorly understood. Previous research has established AGamous-Like62 (AGL62) activation and auxin biosynthesis in the endosperm as key events following fertilization in Arabidopsis (Arabidopsis thaliana) and wild strawberry (Fragaria vesca). To test the hypothesis that epigenetic mechanisms are critical in mediating the effect of fertilization on the activation of AGL62 and auxin biosynthesis in the endosperm, we first identified and analyzed imprinted genes from the endosperm of wild strawberries. We isolated endosperm tissues from F1 seeds of 2 wild strawberry F. vesca subspecies, generated endosperm-enriched transcriptomes, and identified candidate Maternally Expressed and Paternally Expressed Genes (MEGs and PEGs). Through bioinformatic analyses, we identified 4 imprinted genes that may be involved in regulating the expression of FveAGL62 and auxin biosynthesis genes. We conducted functional analysis of a maternally expressed gene FveMYB98 through CRISPR-knockout and over-expression in transgenic strawberries as well as analysis in heterologous systems. FveMYB98 directly repressed FveAGL62 at stage 3 endosperm, which likely serves to limit auxin synthesis and endosperm proliferation. These results provide an inroad into the regulation of early-stage seed development by imprinted genes in strawberries, suggest the potential function of imprinted genes in parental conflict, and identify FveMYB98 as a regulator of a key transition point in endosperm development.more » « less
-
Abstract Camelina (Camelina sativa), an allohexaploid species, is an emerging aviation biofuel crop that has been the focus of resurgent interest in recent decades. To guide future breeding and crop improvement efforts, the community requires a deeper comprehension of subgenome dominance, often noted in allopolyploid species, “alongside an understanding of the genetic diversity” and population structure of material present within breeding programs. We conducted population genetic analyses of a C. sativa diversity panel, leveraging a new genome, to estimate nucleotide diversity and population structure, and analyzed for patterns of subgenome expression dominance among different organs. Our analyses confirm that C. sativa has relatively low genetic diversity and show that the SG3 subgenome has substantially lower genetic diversity compared to the other two subgenomes. Despite the low genetic diversity, our analyses identified 13 distinct subpopulations including two distinct wild populations and others putatively representing founders in existing breeding populations. When analyzing for subgenome composition of long non-coding RNAs, which are known to play important roles in (a)biotic stress tolerance, we found that the SG3 subgenome contained significantly more lincRNAs compared to other subgenomes. Similarly, transcriptome analyses revealed that expression dominance of SG3 is not as strong as previously reported and may not be universal across all organ types. From a global analysis, SG3 “was only significant higher expressed” in flower, flower bud, and fruit organs, which is an important discovery given that the crop yield is associated with these organs. Collectively, these results will be valuable for guiding future breeding efforts in camelina.more » « less
-
Abstract Centromeres in most multicellular eukaryotes are composed of long arrays of repetitive DNA sequences. Interestingly, several transposable elements, including the well-known long terminal repeat centromeric retrotransposon of maize (CRM), were found to be enriched in functional centromeres marked by the centromeric histone H3 (CENH3). Here, we report a centromeric long interspersed nuclear element (LINE), Celine, in Populus species. Celine has colonized preferentially in the CENH3-associated chromatin of every poplar chromosome, with 84% of the Celine elements localized in the CENH3-binding domains. In contrast, only 51% of the CRM elements were bound to CENH3 domains in Populus trichocarpa. These results suggest different centromere targeting mechanisms employed by Celine and CRM elements. Nevertheless, the high target specificity seems to be detrimental to further amplification of the Celine elements, leading to a shorter life span and patchy distribution among plant species compared with the CRM elements. Using a phylogenetically guided approach, we were able to identify Celine-like LINE elements in tea plant (Camellia sinensis) and green ash tree (Fraxinus pennsylvanica). The centromeric localization of these Celine-like LINEs was confirmed in both species. We demonstrate that the centromere targeting property of Celine-like LINEs is of primitive origin and has been conserved among distantly related plant species.more » « less
-
Abstract Subgenome dominance has been reported in diverse allopolyploid species, where genes from one subgenome are preferentially retained and are more highly expressed than those from other subgenome(s). However, the molecular mechanisms responsible for subgenome dominance remain poorly understood. Here, we develop genome-wide map of accessible chromatin regions (ACRs) in cultivated strawberry (2n = 8x = 56, with A, B, C, D subgenomes). Each ACR is identified as an MNase hypersensitive site (MHS). We discover that the dominant subgenome A contains a greater number of total MHSs and MHS per gene than the submissive B/C/D subgenomes. Subgenome A suffers fewer losses of MHS-related DNA sequences and fewer MHS fragmentations caused by insertions of transposable elements. We also discover that genes and MHSs related to stress response have been preferentially retained in subgenome A. We conclude that preservation of genes and their cognate ACRs, especially those related to stress responses, play a major role in the establishment of subgenome dominance in octoploid strawberry.more » « less
-
Abstract Teleost fishes, which are the largest and most diverse group of living vertebrates, have a rich history of ancient and recent polyploidy. Previous studies of allotetraploid common carp and goldfish (cyprinids) reported a dominant subgenome, which is more expressed and exhibits biased gene retention. However, the underlying mechanisms contributing to observed ‘subgenome dominance’ remains poorly understood. Here we report high-quality genomes of twenty-one cyprinids to investigate the origin and subsequent subgenome evolution patterns following three independent allopolyploidy events. We identify the closest extant relatives of the diploid progenitor species, investigate genetic and epigenetic differences among subgenomes, and conclude that observed subgenome dominance patterns are likely due to a combination of maternal dominance and transposable element densities in each polyploid. These findings provide an important foundation to understanding subgenome dominance patterns observed in teleost fishes, and ultimately the role of polyploidy in contributing to evolutionary innovations.more » « less
-
Summary Processes affecting rates of sequence polymorphism are fundamental to the evolution of gene duplicates. The relationship between gene activity and sequence polymorphism can influence the likelihood that functionally redundant gene copies are co‐maintained in stable evolutionary equilibria vs other outcomes such as neofunctionalization.Here, we investigate genic variation in epigenome‐associated polymorphism rates inArabidopsis thalianaand consider whether these affect the evolution of gene duplicates. We compared the frequency of sequence polymorphism and patterns of genetic differentiation between genes classified by exon methylation patterns: unmethylated (unM), gene‐body methylated (gbM), and transposon‐like methylated (teM) states, which reflect divergence in gene expression.We found that the frequency of polymorphism was higher in teM (transcriptionally repressed, tissue‐specific) genes and lower in gbM (active, constitutively expressed) genes. Comparisons of gene duplicates were largely consistent with genome‐wide patterns – gene copies that exhibit teM accumulate more variation, evolve faster, and are in chromatin states associated with reduced DNA repair.This relationship between expression, the epigenome, and polymorphism may lead to the breakdown of equilibrium states that would otherwise maintain genetic redundancies. Epigenome‐mediated polymorphism rate variation may facilitate the evolution of novel gene functions in duplicate paralogs maintained over evolutionary time.more » « less
-
Abstract Gene duplication is a source of evolutionary novelty. DNA methylation may play a role in the evolution of duplicate genes (paralogs) through its association with gene expression. While this relationship has been examined to varying extents in a few individual species, the generalizability of these results at either a broad phylogenetic scale with species of differing duplication histories or across a population remains unknown. We applied a comparative epigenomic approach to 43 angiosperm species across the phylogeny and a population of 928 Arabidopsis (Arabidopsis thaliana) accessions, examining the association of DNA methylation with paralog evolution. Genic DNA methylation was differentially associated with duplication type, the age of duplication, sequence evolution, and gene expression. Whole-genome duplicates were typically enriched for CG-only gene body methylated or unmethylated genes, while single-gene duplications were typically enriched for non-CG methylated or unmethylated genes. Non-CG methylation, in particular, was a characteristic of more recent single-gene duplicates. Core angiosperm gene families were differentiated into those which preferentially retain paralogs and “duplication-resistant” families, which convergently reverted to singletons following duplication. Duplication-resistant families that still have paralogous copies were, uncharacteristically for core angiosperm genes, enriched for non-CG methylation. Non-CG methylated paralogs had higher rates of sequence evolution, higher frequency of presence–absence variation, and more limited expression. This suggests that silencing by non-CG methylation may be important to maintaining dosage following duplication and be a precursor to fractionation. Our results indicate that genic methylation marks differing evolutionary trajectories and fates between paralogous genes and have a role in maintaining dosage following duplication.more » « less
-
Summary Allopolyploids result from hybridization between different evolutionary lineages coupled with genome doubling. Homoeologous chromosomes (chromosomes with common shared ancestry) may undergo recombination immediately after allopolyploid formation and continue over successive generations. The outcome of this meiotic pairing behavior is dynamic and complex. Homoeologous exchanges (HEs) may lead to the formation of unbalanced gametes, reduced fertility, and selective disadvantage. By contrast, HEs could act as sources of novel evolutionary substrates, shifting the relative dosage of parental gene copies, generating novel phenotypic diversity, and helping the establishment of neo‐allopolyploids. However, HE patterns vary among lineages, across generations, and even within individual genomes and chromosomes. The causes and consequences of this variation are not fully understood, though interest in this evolutionary phenomenon has increased in the last decade. Recent technological advances show promise in uncovering the mechanistic basis of HEs. Here, we describe recent observations of the common patterns among allopolyploid angiosperm lineages, underlying genomic and epigenomic features, and consequences of HEs. We identify critical research gaps and discuss future directions with far‐reaching implications in understanding allopolyploid evolution and applying them to the development of important phenotypic traits of polyploid crops.more » « less
-
Abstract Ethiopian mustard (Brassica carinata) is an ancient crop with remarkable stress resilience and a desirable seed fatty acid profile for biofuel uses. Brassica carinata is one of six Brassica species that share three major genomes from three diploid species (AA, BB, and CC) that spontaneously hybridized in a pairwise manner to form three allotetraploid species (AABB, AACC, and BBCC). Of the genomes of these species, that of B. carinata is the least understood. Here, we report a chromosome scale 1.31-Gbp genome assembly with 156.9-fold sequencing coverage for B. carinata, completing the reference genomes comprising the classic Triangle of U, a classical theory of the evolutionary relationships among these six species. Our assembly provides insights into the hybridization event that led to the current B. carinata genome and the genomic features that gave rise to the superior agronomic traits of B. carinata. Notably, we identified an expansion of transcription factor networks and agronomically important gene families. Completion of the Triangle of U comparative genomics platform has allowed us to examine the dynamics of polyploid evolution and the role of subgenome dominance in the domestication and continuing agronomic improvement of B. carinata and other Brassica species.more » « less
An official website of the United States government
