skip to main content

Title: A high-throughput skim-sequencing approach for genotyping, dosage estimation and identifying translocations

The development of next-generation sequencing (NGS) enabled a shift from array-based genotyping to directly sequencing genomic libraries for high-throughput genotyping. Even though whole-genome sequencing was initially too costly for routine analysis in large populations such as breeding or genetic studies, continued advancements in genome sequencing and bioinformatics have provided the opportunity to capitalize on whole-genome information. As new sequencing platforms can routinely provide high-quality sequencing data for sufficient genome coverage to genotype various breeding populations, a limitation comes in the time and cost of library construction when multiplexing a large number of samples. Here we describe a high-throughput whole-genome skim-sequencing (skim-seq) approach that can be utilized for a broad range of genotyping and genomic characterization. Using optimized low-volume Illumina Nextera chemistry, we developed a skim-seq method and combined up to 960 samples in one multiplex library using dual index barcoding. With the dual-index barcoding, the number of samples for multiplexing can be adjusted depending on the amount of data required, and could be extended to 3,072 samples or more. Panels of doubled haploid wheat lines (Triticum aestivum, CDC Stanley x CDC Landmark), wheat-barley (T.aestivumxHordeum vulgare) and wheat-wheatgrass (Triticum durum x Thinopyrum intermedium) introgression lines as well as known monosomic wheat stocks were genotyped using the skim-seq approach. Bioinformatics pipelines were developed for various applications where sequencing coverage ranged from 1 × down to 0.01 × per sample. Using reference genomes, we detected chromosome dosage, identified aneuploidy, and karyotyped introgression lines from the skim-seq data. Leveraging the recent advancements in genome sequencing, skim-seq provides an effective and low-cost tool for routine genotyping and genetic analysis, which can track and identify introgressions and genomic regions of interest in genetics research and applied breeding programs.

more » « less
Award ID(s):
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ;
Publisher / Repository:
Nature Publishing Group
Date Published:
Journal Name:
Scientific Reports
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract A-genome diploid wheats represent the earliest domesticated and cultivated wheat species in the Fertile Crescent and include the donor of the wheat A sub-genome. The A-genome species encompass the cultivated einkorn (Triticum monococcum L. subsp. monococcum), wild einkorn (T. monococcum L. subsp. aegilopoides (Link) Thell.), and Triticum urartu. We evaluated the collection of 930 accessions in the Wheat Genetics Resource Center (WGRC) using genotyping by sequencing and identified 13,860 curated single-nucleotide polymorphisms. Genomic analysis detected misclassified and genetically identical (>99%) accessions, with most of the identical accessions originating from the same or nearby locations. About 56% (n = 520) of the WGRC A-genome species collections were genetically identical, supporting the need for genomic characterization for effective curation and maintenance of these collections. Population structure analysis confirmed the morphology-based classifications of the accessions and reflected the species geographic distributions. We also showed that T. urartu is the closest A-genome diploid to the A-subgenome in common wheat (Triticum aestivum L.) through phylogenetic analysis. Population analysis within the wild einkorn group showed three genetically distinct clusters, which corresponded with wild einkorn races α, β, and γ described previously. The T. monococcum genome-wide FST scan identified candidate genomic regions harboring a domestication selection signature at the Non-brittle rachis 1 (Btr1) locus on the short arm of chromosome 3Am at ∼70 Mb. We established an A-genome core set (79 accessions) based on allelic diversity, geographical distribution, and available phenotypic data. The individual species core set maintained at least 79% of allelic variants in the A-genome collection and constituted a valuable genetic resource to improve wheat and domesticated einkorn in breeding programs. 
    more » « less
  2. Powdery mildew caused by Blumeria graminis f. sp. tritici (Bgt) is one of many severe diseases that threaten bread wheat (Triticum aestivum L.) yield and quality worldwide. The discovery and deployment of powdery mildew resistance genes (Pm) can prevent this disease epidemic in wheat. In a previous study, we transferred the powdery mildew resistance gene Pm57 from Aegilops searsii into common wheat and cytogenetically mapped the gene in a chromosome region with the fraction length (FL) 0.75–0.87, which represents 12% segment of the long arm of chromosome 2Ss#1. In this study, we performed RNA-seq using RNA extracted from leaf samples of three infected and mock-infected wheat-Ae. searsii 2Ss#1 introgression lines at 0, 12, 24, and 48 h after inoculation with Bgt isolates. Then we designed 79 molecular markers based on transcriptome sequences and physically mapped them to Ae. searsii chromosome 2Ss#1- in seven intervals. We used these markers to identify 46 wheat-Ae. searsii 2Ss#1 recombinants induced by ph1b, a deletion mutant of pairing homologous (Ph) genes. After analyzing the 46 ph1b-induced 2Ss#1L recombinants in the region where Pm57 is located with different Bgt-responses, we physically mapped Pm57 gene on the long arm of 2Ss#1 in a 5.13 Mb genomic region, which was flanked by markers X67593 (773.72 Mb) and X62492 (778.85 Mb). By comparative synteny analysis of the corresponding region on chromosome 2B in Chinese Spring (T. aestivum L.) with other model species, we identified ten genes that are putative plant defense-related (R) genes which includes six coiled-coil nucleotide-binding site-leucine-rich repeat (CNL), three nucleotide-binding site-leucine-rich repeat (NL) and a leucine-rich receptor-like repeat (RLP) encoding proteins. This study will lay a foundation for cloning of Pm57, and benefit the understanding of interactions between resistance genes of wheat and powdery mildew pathogens. 
    more » « less
  3. Breeding of agricultural crops adapted to climate change and resistant to diseases and pests is hindered by a limited gene pool because of domestication and thousands of years of human selection. One way to increase genetic variation is chromosome-mediated gene transfer from wild relatives by cross hybridization. In the case of wheat ( Triticum aestivum ), the species of genus Aegilops are a particularly attractive source of new genes and alleles. However, during the evolution of the Aegilops and Triticum genera, diversification of the D-genome lineage resulted in the formation of diploid C, M, and U genomes of Aegilops . The extent of structural genome alterations, which accompanied their evolution and speciation, and the shortage of molecular tools to detect Aegilops chromatin hamper gene transfer into wheat. To investigate the chromosome structure and help develop molecular markers with a known physical position that could improve the efficiency of the selection of desired introgressions, we developed single-gene fluorescence in situ hybridization (FISH) maps for M- and U-genome progenitors, Aegilops comosa and Aegilops umbellulata , respectively. Forty-three ortholog genes were located on 47 loci in Ae. comosa and on 52 loci in Ae. umbellulata using wheat cDNA probes. The results obtained showed that M-genome chromosomes preserved collinearity with those of wheat, excluding 2 and 6M containing an intrachromosomal rearrangement and paracentric inversion of 6ML, respectively. While Ae. umbellulata chromosomes 1, 3, and 5U maintained collinearity with wheat, structural reorganizations in 2, 4, 6, and 7U suggested a similarity with the C genome of Aegilops markgrafii . To develop molecular markers with exact physical positions on chromosomes of Aegilops , the single-gene FISH data were validated in silico using DNA sequence assemblies from flow-sorted M- and U-genome chromosomes. The sequence similarity search of cDNA sequences confirmed 44 out of the 47 single-gene loci in Ae. comosa and 40 of the 52 map positions in Ae. umbellulata . Polymorphic regions, thus, identified enabled the development of molecular markers, which were PCR validated using wheat- Aegilops disomic chromosome addition lines. The single-gene FISH-based approach allowed the development of PCR markers specific for cytogenetically mapped positions on Aegilops chromosomes, substituting as yet unavailable segregating map. The new knowledge and resources will support the efforts for the introgression of Aegilops genes into wheat and their cloning. 
    more » « less
  4. Abstract

    Wheat (Triticum aestivum) genetic maps are a key enabling tool for genetic studies. We used genotyping-by-sequencing-(GBS) derived markers to map recombinant inbred line (RIL) and doubled haploid (DH) populations from crosses of W7984 by Opata, and used the maps to explore features of recombination control. The RIL and DH populations, SynOpRIL and SynOpDH, were composed of 906 and 92 individuals, respectively. Two high-density genetic linkage framework maps were constructed of 2,842 and 2,961 cM, harboring 3,634 and 6,580 markers, respectively. Using imputation, we added 43,013 and 86,042 markers to the SynOpRIL and SynOpDH maps. We observed preferential recombination in telomeric regions and reduced recombination in pericentromeric regions. Recombination rates varied between subgenomes, with the D genomes of the two populations exhibiting the highest recombination rates of 0.26–0.27 cM/Mb. QTL mapping identified two additive and three epistatic loci associated with crossover number. Additionally, we used published POPSEQ data from SynOpDH to explore the structural variation in W7984 and Opata. We found that chromosome 5AS is missing from W7984. We also found 2,332 variations larger than 100 kb. Structural variants were more abundant in distal regions, and overlapped 9,196 genes. The two maps provide a resource for trait mapping and genomic-assisted breeding.

    more » « less
  5. Abstract

    High‐throughput phenotyping (HTP) with unoccupied aerial systems (UAS), consisting of unoccupied aerial vehicles (UAV; or drones) and sensor(s), is an increasingly promising tool for plant breeders and researchers. Enthusiasm and opportunities from this technology for plant breeding are similar to the emergence of genomic tools ∼30 years ago, and genomic selection more recently. Unlike genomic tools, HTP provides a variety of strategies in implementation and utilization that generate big data on the dynamic nature of plant growth formed by temporal interactions between growth and environment. This review lays out strategies deployed across four major staple crop species: cotton (Gossypium hirsutumL.), maize (Zea maysL.), soybean (Glycine maxL.), and wheat (Triticum aestivumL.). Each crop highlighted in this review demonstrates how UAS‐collected data are employed to automate and improve estimation or prediction of objective phenotypic traits. Each crop section includes four major topics: (a) phenotyping of routine traits, (b) phenotyping of previously infeasible traits, (c) sample cases of UAS application in breeding, and (d) implementation of phenotypic and phenomic prediction and selection. While phenotyping of routine agronomic and productivity traits brings advantages in time and resource optimization, the most potentially beneficial application of UAS data is in collecting traits that were previously difficult or impossible to quantify, improving selection efficiency of important phenotypes. In brief, UAS sensor technology can be used for measuring abiotic stress, biotic stress, crop growth and development, as well as productivity. These applications and the potential implementation of machine learning strategies allow for improved prediction, selection, and efficiency within breeding programs, making UAS HTP a potentially indispensable asset.

    more » « less