skip to main content


Title: Einkorn genomics sheds light on history of the oldest domesticated wheat
Abstract

Einkorn (Triticum monococcum) was the first domesticated wheat species, and was central to the birth of agriculture and the Neolithic Revolution in the Fertile Crescent around 10,000 years ago1,2. Here we generate and analyse 5.2-Gb genome assemblies for wild and domesticated einkorn, including completely assembled centromeres. Einkorn centromeres are highly dynamic, showing evidence of ancient and recent centromere shifts caused by structural rearrangements. Whole-genome sequencing analysis of a diversity panel uncovered the population structure and evolutionary history of einkorn, revealing complex patterns of hybridizations and introgressions after the dispersal of domesticated einkorn from the Fertile Crescent. We also show that around 1% of the modern bread wheat (Triticum aestivum) A subgenome originates from einkorn. These resources and findings highlight the history of einkorn evolution and provide a basis to accelerate the genomics-assisted improvement of einkorn and bread wheat.

 
more » « less
Award ID(s):
1822162
PAR ID:
10518133
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; more » ; ; ; ; ; ; ; ; « less
Publisher / Repository:
Nature
Date Published:
Journal Name:
Nature
Volume:
620
Issue:
7975
ISSN:
0028-0836
Page Range / eLocation ID:
830 to 838
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract A-genome diploid wheats represent the earliest domesticated and cultivated wheat species in the Fertile Crescent and include the donor of the wheat A sub-genome. The A-genome species encompass the cultivated einkorn (Triticum monococcum L. subsp. monococcum), wild einkorn (T. monococcum L. subsp. aegilopoides (Link) Thell.), and Triticum urartu. We evaluated the collection of 930 accessions in the Wheat Genetics Resource Center (WGRC) using genotyping by sequencing and identified 13,860 curated single-nucleotide polymorphisms. Genomic analysis detected misclassified and genetically identical (>99%) accessions, with most of the identical accessions originating from the same or nearby locations. About 56% (n = 520) of the WGRC A-genome species collections were genetically identical, supporting the need for genomic characterization for effective curation and maintenance of these collections. Population structure analysis confirmed the morphology-based classifications of the accessions and reflected the species geographic distributions. We also showed that T. urartu is the closest A-genome diploid to the A-subgenome in common wheat (Triticum aestivum L.) through phylogenetic analysis. Population analysis within the wild einkorn group showed three genetically distinct clusters, which corresponded with wild einkorn races α, β, and γ described previously. The T. monococcum genome-wide FST scan identified candidate genomic regions harboring a domestication selection signature at the Non-brittle rachis 1 (Btr1) locus on the short arm of chromosome 3Am at ∼70 Mb. We established an A-genome core set (79 accessions) based on allelic diversity, geographical distribution, and available phenotypic data. The individual species core set maintained at least 79% of allelic variants in the A-genome collection and constituted a valuable genetic resource to improve wheat and domesticated einkorn in breeding programs. 
    more » « less
  2. Abstract

    Common bread wheat, Triticum aestivum, has one of the most complex genomes known to science, with 6 copies of each chromosome, enormous numbers of near-identical sequences scattered throughout, and an overall haploid size of more than 15 billion bases. Multiple past attempts to assemble the genome have produced assemblies that were well short of the estimated genome size. Here we report the first near-complete assembly of T. aestivum, using deep sequencing coverage from a combination of short Illumina reads and very long Pacific Biosciences reads. The final assembly contains 15 344 693 583 bases and has a weighted average (N50) contig size of 232 659 bases. This represents by far the most complete and contiguous assembly of the wheat genome to date, providing a strong foundation for future genetic studies of this important food crop. We also report how we used the recently published genome of Aegilops tauschii, the diploid ancestor of the wheat D genome, to identify 4 179 762 575 bp of T. aestivum that correspond to its D genome components.

     
    more » « less
  3. SUMMARY

    Aegilopsspecies represent the most important gene pool for breeding bread wheat (Triticum aestivum). Thus, understanding the genome evolution, including chromosomal structural rearrangements and syntenic relationships amongAegilopsspecies or betweenAegilopsand wheat, is important for both basic genome research and practical breeding applications. In the present study, we attempted to develop subgenome D‐specific fluorescencein situhybridization (FISH) probes by selecting D‐specific oligonucleotides based on the reference genome of Chinese Spring. The oligo‐based chromosome painting probes consisted of approximately 26 000 oligos per chromosome and their specificity was confirmed in both diploid and polyploid species containing the D subgenome. Two previously reported translocations involving two D chromosomes have been confirmed in wheat varieties and their derived lines. We demonstrate that the oligo painting probes can be used not only to identify the translocations involving D subgenome chromosomes, but also to determine the precise positions of chromosomal breakpoints. Chromosome painting of 56 accessions ofAe. tauschiifrom different origins led us to identify two novel translocations: a reciprocal 3D‐7D translocation in two accessions and a complex 4D‐5D‐7D translocation in one accession. Painting probes were also used to analyze chromosomes from more diverseAegilopsspecies. These probes produced FISH signals in four different genomes. Chromosome rearrangements were identified inAegilops umbellulata,Aegilops markgrafii, andAegilops uniaristata, thus providing syntenic information that will be valuable for the application of these wild species in wheat breeding.

     
    more » « less
  4. null (Ed.)
    Several plant pathogenic Parastagonospora species have been identified infecting wheat and other cereals over the past 50 years. As new lineages were discovered, naming conventions grew unwieldy and the relationships with previously recognized species remained unclear. We used genome sequencing to clarify relationships among these species and provided new names for most of these species. Six of the nine described Parastagonospora species were recovered from wheat, with five of these species coming from Iran. Genome sequences revealed that three strains thought to be hybrids between P. nodorum and P. pseudonodorum were not actually hybrids, but rather represented rare gene introgressions between those species. Our data are consistent with the hypothesis that P. nodorum originated as a pathogen of wild grasses in the Fertile Crescent, then emerged as a wheat pathogen via host-tracking during the domestication of wheat in the same region. The discovery of a diverse array of Parastagonospora species infecting wheat in Iran suggests that new wheat pathogens could emerge from this region in the future. 
    more » « less
  5. Bread wheat (Triticum aestivum) is a major food crop and an important plant system for agricultural genetics research. However, due to the complexity and size of its allohexaploid genome, genomic resources are limited compared to other major crops. The IWGSC recently published a reference genome and associated annotation (IWGSC CS v1.0, Chinese Spring) that has been widely adopted and utilized by the wheat community. Although this reference assembly represents all three wheat subgenomes at chromosome-scale, it was derived from short reads, and thus is missing a substantial portion of the expected 16 Gbp of genomic sequence. We earlier published an independent wheat assembly (Triticum_aestivum_3.1, Chinese Spring) that came much closer in length to the expected genome size, although it was only a contig-level assembly lacking gene annotations. Here, we describe a reference-guided effort to scaffold those contigs into chromosome-length pseudomolecules, add in any missing sequence that was unique to the IWGSC CS v1.0 assembly, and annotate the resulting pseudomolecules with genes. Our updated assembly, Triticum_aestivum_4.0, contains 15.07 Gbp of non-gap sequence anchored to chromosomes, which is 1.2 Gbps more than the previous reference assembly. It includes 108,639 genes unambiguously localized to chromosomes, including over 2,000 genes that were previously unplaced. We also discovered more than 5,700 additional gene copies, facilitating the accurate annotation of functional gene duplications including at the Ppd-B1 photoperiod response locus. 
    more » « less