skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: A common resequencing‐based genetic marker data set for global maize diversity
SUMMARY Maize (Zea maysssp.mays) populations exhibit vast ranges of genetic and phenotypic diversity. As sequencing costs have declined, an increasing number of projects have sought to measure genetic differences between and within maize populations using whole‐genome resequencing strategies, identifying millions of segregating single‐nucleotide polymorphisms (SNPs) and insertions/deletions (InDels). Unlike older genotyping strategies like microarrays and genotyping by sequencing, resequencing should, in principle, frequently identify and score common genetic variants. However, in practice, different projects frequently employ different analytical pipelines, often employ different reference genome assemblies and consistently filter for minor allele frequency within the study population. This constrains the potential to reuse and remix data on genetic diversity generated from different projects to address new biological questions in new ways. Here, we employ resequencing data from 1276 previously published maize samples and 239 newly resequenced maize samples to generate a single unified marker set of approximately 366 million segregating variants and approximately 46 million high‐confidence variants scored across crop wild relatives, landraces as well as tropical and temperate lines from different breeding eras. We demonstrate that the new variant set provides increased power to identify known causal flowering‐time genes using previously published trait data sets, as well as the potential to track changes in the frequency of functionally distinct alleles across the global distribution of modern maize.  more » « less
Award ID(s):
1826781
PAR ID:
10466604
Author(s) / Creator(s):
; ; ; ; ;
Publisher / Repository:
The Plant Journal
Date Published:
Journal Name:
The Plant Journal
Volume:
113
Issue:
6
ISSN:
0960-7412
Page Range / eLocation ID:
1109 to 1121
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract The development of next-generation sequencing (NGS) enabled a shift from array-based genotyping to directly sequencing genomic libraries for high-throughput genotyping. Even though whole-genome sequencing was initially too costly for routine analysis in large populations such as breeding or genetic studies, continued advancements in genome sequencing and bioinformatics have provided the opportunity to capitalize on whole-genome information. As new sequencing platforms can routinely provide high-quality sequencing data for sufficient genome coverage to genotype various breeding populations, a limitation comes in the time and cost of library construction when multiplexing a large number of samples. Here we describe a high-throughput whole-genome skim-sequencing (skim-seq) approach that can be utilized for a broad range of genotyping and genomic characterization. Using optimized low-volume Illumina Nextera chemistry, we developed a skim-seq method and combined up to 960 samples in one multiplex library using dual index barcoding. With the dual-index barcoding, the number of samples for multiplexing can be adjusted depending on the amount of data required, and could be extended to 3,072 samples or more. Panels of doubled haploid wheat lines (Triticum aestivum, CDC Stanley x CDC Landmark), wheat-barley (T.aestivumxHordeum vulgare) and wheat-wheatgrass (Triticum durum x Thinopyrum intermedium) introgression lines as well as known monosomic wheat stocks were genotyped using the skim-seq approach. Bioinformatics pipelines were developed for various applications where sequencing coverage ranged from 1 × down to 0.01 × per sample. Using reference genomes, we detected chromosome dosage, identified aneuploidy, and karyotyped introgression lines from the skim-seq data. Leveraging the recent advancements in genome sequencing, skim-seq provides an effective and low-cost tool for routine genotyping and genetic analysis, which can track and identify introgressions and genomic regions of interest in genetics research and applied breeding programs. 
    more » « less
  2. Gene expression and complex phenotypes are determined by the activity of cis-regulatory elements. However, an understanding of how extant genetic variants affect cis regulation remains limited. Here, we investigated the consequences of cis-regulatory diversity using single-cell genomics of more than 0.7 million nuclei across 172Zea mays(maize) inbreds. Our analyses pinpointed cis-regulatory elements distinct to domesticated maize and revealed how historical transposon activity has shaped the cis-regulatory landscape. Leveraging population genetics principles, we fine-mapped about 22,000 chromatin accessibility–associated genetic variants with widespread cell type–specific effects. Variants in TEOSINTE BRANCHED1/CYCLOIDEA/PROLIFERATING CELL FACTOR–binding sites were the most prevalent determinants of chromatin accessibility. Finally, integrating chromatin accessibility–associated variants, organismal trait variation, and population differentiation revealed how local adaptation has rewired regulatory networks in unique cellular contexts to alter maize flowering. 
    more » « less
  3. Abstract Optimizing leaf angle and other canopy architecture traits has helped modern maize (Zea maysL.) become adapted to higher planting densities over the last 60 years. Traditional investigations into genetic control of leaf angle have focused on one leaf or the average of multiple leaves; as a result, our understanding of genetic control across multiple canopy levels is still limited. To address this, genetic mapping across four canopy levels was conducted in the present study to investigate the genetic control of leaf angle across the canopy. We developed two populations of doubled haploid lines derived from three inbreds with distinct leaf angle phenotypes. These populations were genotyped with genotyping‐by‐sequencing and phenotyped for leaf angle at four different canopy levels over multiple years. To understand how leaf angle changes across the canopy, the four measurements were used to derive three additional traits. Composite interval mapping was conducted with the leaf‐specific measurements and the derived traits. A set of 59 quantitative trait loci (QTLs) were uncovered for seven traits, and two genomic regions were consistently detected across multiple canopy levels. Additionally, seven genomic regions were found to contain consistent QTLs with either relatively stable or dynamic effects at different canopy levels. Prioritizing the selection of QTLs with dynamic effects across the canopy will aid breeders in selecting maize hybrids with the ideal canopy architecture that continues to maximize yield on a per area basis under increasing planting densities. 
    more » « less
  4. Genetic diversity found in crop wild relatives is critical to preserve and utilize for crop improvement to achieve sustainable food production amid climate change and increased demand. We genetically characterized a large collection of 1,041Aegilopsaccessions distributed among 23 different species using more than 45K single nucleotide polymorphisms identified by genotyping-by-sequencing. The Wheat Genetics Resource Center (WGRC)Aegilopsgermplasm collection was curated through the identification of misclassified and redundant accessions. There were 49 misclassified and 28 sets of redundant accessions within the four diploid species. The curated germplasm sets now have improved utility for genetic studies and wheat improvement. We constructed a phylogenetic tree and principal component analysis cluster for allAegilopsspecies together, giving one of the most comprehensive views ofAegilops. TheSitopsissection and the U genomeAegilopsclade were further scrutinized with in-depth population analysis. The genetic relatedness among the pair ofAegilopsspecies provided strong evidence for the species evolution, speciation, and diversification. We inferred genome symbols for two speciesAe.neglectaandAe.columnarisbased on the sequence read mapping and the presence of segregating loci on the pertinent genomes as well as genetic clustering. The high genetic diversity observed amongAegilopsspecies indicated that the genus could play an even greater role in providing the critical need for untapped genetic diversity for future wheat breeding and improvement. To fully characterize theseAegilopsspecies, there is an urgent need to generate reference assemblies for these wild wheats, especially for the polyploidAegilops. 
    more » « less
  5. Genetic diversity found in crop wild relatives is critical to preserve and utilize for crop improvement to achieve sustainable food production amid climate change and increased demand. We genetically characterized a large collection of 1,041Aegilopsaccessions distributed among 23 different species using more than 45K single nucleotide polymorphisms identified by genotyping-by-sequencing. The Wheat Genetics Resource Center (WGRC)Aegilopsgermplasm collection was curated through the identification of misclassified and redundant accessions. There were 49 misclassified and 28 sets of redundant accessions within the four diploid species. The curated germplasm sets now have improved utility for genetic studies and wheat improvement. We constructed a phylogenetic tree and principal component analysis cluster for allAegilopsspecies together, giving one of the most comprehensive views ofAegilops. TheSitopsissection and the U genomeAegilopsclade were further scrutinized with in-depth population analysis. The genetic relatedness among the pair ofAegilopsspecies provided strong evidence for the species evolution, speciation, and diversification. We inferred genome symbols for two speciesAe.neglectaandAe.columnarisbased on the sequence read mapping and the presence of segregating loci on the pertinent genomes as well as genetic clustering. The high genetic diversity observed amongAegilopsspecies indicated that the genus could play an even greater role in providing the critical need for untapped genetic diversity for future wheat breeding and improvement. To fully characterize theseAegilopsspecies, there is an urgent need to generate reference assemblies for these wild wheats, especially for the polyploidAegilops. 
    more » « less