The evolutionary histories of many polyploid plant species are difficult to resolve due to a complex interplay of hybridization, incomplete lineage sorting, and missing diploid progenitors. In the case of octoploid strawberry with four subgenomes designated ABCD, the identities of the diploid progenitors for subgenomes C and D have been subject to much debate. By integrating new sequencing data from North American diploids with reticulate phylogeny and admixture analyses, we uncovered introgression from an extinct or unsampled species in the clade ofFragaria viridis,Fragaria nipponica, andFragaria nilgerrensisinto the donor of subgenome A of octoploidFragariaprior to its divergence fromF. vescasubsp. bracteata. We also detected an introgression event fromF. iinumaeinto an ancestor ofF. nipponicaandF. nilgerrensis.Using an LTR-age-distribution-based approach, we estimate that the octoploid and its intermediate hexaploid and tetraploid ancestors emerged approximately 0.8, 2, and 3 million years ago, respectively. These results provide an explanation for previous reports ofF. viridisandF. nipponicaas donors of the C and D subgenomes and suggest a greater role than previously thought for homoploid hybridization in the diploid progenitors of octoploid strawberry. The integrated set of approaches used here can help advance polyploid genome analysis in other species where hybridization and incomplete lineage sorting obscure evolutionary relationships.
more »
« less
Homoploid Hybridization Resolves the Origin of Octoploid Strawberries
Abstract The identity of the diploid progenitors of octoploid cultivated strawberry (Fragaria × ananassa) has been subject to much debate. Past work identified four subgenomes and consistent evidence forF. californica(previously namedF. vescasubsp.bracteata) andF. iinumaeas donors for subgenomes A and B, respectively, with conflicting results for the origins of subgenomes C and D. Here, reticulate phylogeny and admixture analysis support hybridization betweenF. viridisandF. vescain the ancestry of subgenome A, and betweenF. nipponicaandF. iinumaein the ancestry of subgenome B. Using an LTR-age-distribution-based approach, we estimate that the octoploid and its intermediate hexaploid and tetraploid ancestors emerged approximately 0.8, 2, and 3 million years ago, respectively. These results provide an explanation for previous reports ofF. viridisandF. nipponicaas donors of the C and D subgenomes and unify conflicting hypotheses about the evolutionary origin of octoploidFragaria.
more »
« less
- Award ID(s):
- 1912180
- PAR ID:
- 10579859
- Publisher / Repository:
- bioRxiv
- Date Published:
- Format(s):
- Medium: X
- Institution:
- bioRxiv
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Abstract Subgenome dominance has been reported in diverse allopolyploid species, where genes from one subgenome are preferentially retained and are more highly expressed than those from other subgenome(s). However, the molecular mechanisms responsible for subgenome dominance remain poorly understood. Here, we develop genome-wide map of accessible chromatin regions (ACRs) in cultivated strawberry (2n = 8x = 56, with A, B, C, D subgenomes). Each ACR is identified as an MNase hypersensitive site (MHS). We discover that the dominant subgenome A contains a greater number of total MHSs and MHS per gene than the submissive B/C/D subgenomes. Subgenome A suffers fewer losses of MHS-related DNA sequences and fewer MHS fragmentations caused by insertions of transposable elements. We also discover that genes and MHSs related to stress response have been preferentially retained in subgenome A. We conclude that preservation of genes and their cognate ACRs, especially those related to stress responses, play a major role in the establishment of subgenome dominance in octoploid strawberry.more » « less
-
Abstract Over the decades, evolutionists and ecologists have shown intense interest in the role of polyploidization in plant evolution. Without clear knowledge of the diploid ancestor(s) of polyploids, we would not be able to answer fundamental ecological questions such as the evolution of niche differences between them or its underlying genetic basis. Here, we explored the evolutionary history of two Fragaria tetraploids, Fragaria corymbosa and Fragaria moupinensis. We de novo assembled five genomes including these two tetraploids and three diploid relatives. Based on multiple lines of evidence, we found no evidence of subgenomes in either of the two tetraploids, suggesting autopolyploid origins. We determined that Fragaria chinensis was the diploid ancestor of F. corymbosa while either an extinct species affinitive to F. chinensis or an unsampled population of F. chinensis could be the progenitor of F. moupinensis. Meanwhile, we found introgression signals between F. chinensis and Fragaria pentaphylla, leading to the genomic similarity between these two diploids. Compared to F. chinensis, gene families related to high ultraviolet (UV)-B and DNA repair were expanded, while those that responded towards abiotic and biotic stresses (such as salt stress, wounding, and various pathogens) were contracted in both tetraploids. Furthermore, the two tetraploids tended to down-regulate defense response genes but up-regulate UV-B response, DNA repairing, and cell division gene expression compared to F. chinensis. These findings may reflect adaptions toward high-altitude habitats. In summary, our work provides insights into the genome evolution of wild Fragaria tetraploids and opens up an avenue for future works to answer deeper evolutionary and ecological questions regarding the strawberry genus.more » « less
-
Abstract Camelina (Camelina sativa), an allohexaploid species, is an emerging aviation biofuel crop that has been the focus of resurgent interest in recent decades. To guide future breeding and crop improvement efforts, the community requires a deeper comprehension of subgenome dominance, often noted in allopolyploid species, “alongside an understanding of the genetic diversity” and population structure of material present within breeding programs. We conducted population genetic analyses of a C. sativa diversity panel, leveraging a new genome, to estimate nucleotide diversity and population structure, and analyzed for patterns of subgenome expression dominance among different organs. Our analyses confirm that C. sativa has relatively low genetic diversity and show that the SG3 subgenome has substantially lower genetic diversity compared to the other two subgenomes. Despite the low genetic diversity, our analyses identified 13 distinct subpopulations including two distinct wild populations and others putatively representing founders in existing breeding populations. When analyzing for subgenome composition of long non-coding RNAs, which are known to play important roles in (a)biotic stress tolerance, we found that the SG3 subgenome contained significantly more lincRNAs compared to other subgenomes. Similarly, transcriptome analyses revealed that expression dominance of SG3 is not as strong as previously reported and may not be universal across all organ types. From a global analysis, SG3 “was only significant higher expressed” in flower, flower bud, and fruit organs, which is an important discovery given that the crop yield is associated with these organs. Collectively, these results will be valuable for guiding future breeding efforts in camelina.more » « less
-
Abstract Coffea arabica, an allotetraploid hybrid ofCoffea eugenioidesandCoffea canephora, is the source of approximately 60% of coffee products worldwide, and its cultivated accessions have undergone several population bottlenecks. We present chromosome-level assemblies of a di-haploidC. arabicaaccession and modern representatives of its diploid progenitors,C. eugenioidesandC. canephora. The three species exhibit largely conserved genome structures between diploid parents and descendant subgenomes, with no obvious global subgenome dominance. We find evidence for a founding polyploidy event 350,000–610,000 years ago, followed by several pre-domestication bottlenecks, resulting in narrow genetic variation. A split between wild accessions and cultivar progenitors occurred ~30.5 thousand years ago, followed by a period of migration between the two populations. Analysis of modern varieties, including lines historically introgressed withC. canephora, highlights their breeding histories and loci that may contribute to pathogen resistance, laying the groundwork for future genomics-based breeding ofC. arabica.more » « less
An official website of the United States government

