skip to main content


Title: Harnessing Genetic Diversity in the USDA Pea Germplasm Collection Through Genomic Prediction
Phenotypic evaluation and efficient utilization of germplasm collections can be time-intensive, laborious, and expensive. However, with the plummeting costs of next-generation sequencing and the addition of genomic selection to the plant breeder’s toolbox, we now can more efficiently tap the genetic diversity within large germplasm collections. In this study, we applied and evaluated genomic prediction’s potential to a set of 482 pea ( Pisum sativum L.) accessions—genotyped with 30,600 single nucleotide polymorphic (SNP) markers and phenotyped for seed yield and yield-related components—for enhancing selection of accessions from the USDA Pea Germplasm Collection. Genomic prediction models and several factors affecting predictive ability were evaluated in a series of cross-validation schemes across complex traits. Different genomic prediction models gave similar results, with predictive ability across traits ranging from 0.23 to 0.60, with no model working best across all traits. Increasing the training population size improved the predictive ability of most traits, including seed yield. Predictive abilities increased and reached a plateau with increasing number of markers presumably due to extensive linkage disequilibrium in the pea genome. Accounting for population structure effects did not significantly boost predictive ability, but we observed a slight improvement in seed yield. By applying the best genomic prediction model (e.g., RR-BLUP), we then examined the distribution of genotyped but nonphenotyped accessions and the reliability of genomic estimated breeding values (GEBV). The distribution of GEBV suggested that none of the nonphenotyped accessions were expected to perform outside the range of the phenotyped accessions. Desirable breeding values with higher reliability can be used to identify and screen favorable germplasm accessions. Expanding the training set and incorporating additional orthogonal information (e.g., transcriptomics, metabolomics, physiological traits, etc.) into the genomic prediction framework can enhance prediction accuracy.  more » « less
Award ID(s):
2019077
NSF-PAR ID:
10357567
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ;
Date Published:
Journal Name:
Frontiers in Genetics
Volume:
12
ISSN:
1664-8021
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    Accelerating biomass improvement is a major goal ofMiscanthusbreeding. The development and implementation of genomic‐enabled breeding tools, like marker‐assisted selection (MAS) and genomic selection, has the potential to improve the efficiency ofMiscanthusbreeding. The present study conducted genome‐wide association (GWA) and genomic prediction of biomass yield and 14 yield‐components traits inMiscanthus sacchariflorus. We evaluated a diversity panel with 590 accessions ofM. sacchariflorusgrown across 4 years in one subtropical and three temperate locations and genotyped with 268,109 single‐nucleotide polymorphisms (SNPs). The GWA study identified a total of 835 significant SNPs and 674 candidate genes across all traits and locations. Of the significant SNPs identified, 280 were localized in mapped quantitative trait loci intervals and proximal to SNPs identified for similar traits in previously reportedMiscanthusstudies, providing additional support for the importance of these genomic regions for biomass yield. Our study gave insights into the genetic basis for yield‐component traits inM. sacchariflorusthat may facilitate marker‐assisted breeding for biomass yield. Genomic prediction accuracy for the yield‐related traits ranged from 0.15 to 0.52 across all locations and genetic groups. Prediction accuracies within the six genetic groupings ofM. saccharifloruswere limited due to low sample sizes. Nevertheless, the Korea/NE China/Russia (N = 237) genetic group had the highest prediction accuracy of all genetic groups (ranging 0.26–0.71), suggesting that with adequate sample sizes, there is strong potential for genomic selection within the genetic groupings ofM. sacchariflorus. This study indicated that MAS and genomic prediction will likely be beneficial for conducting population‐improvement ofM. sacchariflorus.

     
    more » « less
  2. Abstract BACKGROUND

    Pea (Pisum sativum) is a prevalent cool‐season crop that produces seeds valued for their high protein content. Modern cultivars have incorporated several traits that improved harvested yield. However, progress toward improving seed quality has received less emphasis, in part due to the lack of tools for easily and rapidly measuring seed traits. In this study we evaluated the accuracy of single‐seed near‐infrared spectroscopy (NIRS) for measuring pea‐seed weight, protein, and oil content. A total of 96 diverse pea accessions were analyzed using both single‐seed NIRS and wet chemistry methods. To demonstrate field relevance, the single‐seed NIRS protein prediction model was used to determine the impact of seed treatments and foliar fungicides on the protein content of harvested dry peas in a field trial.

    RESULTS

    External validation of partial least squares (PLS) regression models showed high prediction accuracy for protein and weight (R2= 0.94 for both) and less accuracy for oil (R2= 0.74). Single‐seed weight was weakly correlated with protein and oil content in contrast with previous reports. In the field study, the single‐seed NIRS predicted protein values were within 10 mg g−1of an independent analytical reference measurement and were sufficiently precise to detect small treatment effects.

    CONCLUSION

    The high accuracy of protein and weight estimation show that single‐seed NIRS could be used in the dual selection of high‐protein, high‐weight peas early in the breeding cycle, allowing for faster genetic advancement toward improved pea nutritional quality. © 2020 Society of Chemical Industry

     
    more » « less
  3. Summary

    Hybrid breeding is the main strategy for improving productivity in many crops, especially in rice and maize. Genomic hybrid breeding is a technology that uses whole‐genome markers to predict future hybrids. Predicted superior hybrids are then field evaluated and released as new hybrid cultivars after their superior performances are confirmed. This will increase the opportunity of selecting true superior hybrids with minimum costs. Here, we used genomic best linear unbiased prediction to perform hybrid performance prediction using an existing rice population of 1495 hybrids. Replicated 10‐fold cross‐validations showed that the prediction abilities on ten agronomic traits ranged from 0.35 to 0.92. Using the 1495 rice hybrids as a training sample, we predicted six agronomic traits of 100 hybrids derived from half diallel crosses involving 21 parents that are different from the parents of the hybrids in the training sample. The prediction abilities were relatively high, varying from 0.54 (yield) to 0.92 (grain length). We concluded that the current population of 1495 hybrids can be used to predict hybrids from seemingly unrelated parents. Eventually, we used this training population to predict all potential hybrids of cytoplasm male sterile lines from 3000 rice varieties from the 3K Rice Genome Project. Using a breeding index combining 10 traits, we identified the top and bottom 200 predicted hybrids. SNP genotypes of the training population and parameters estimated from this training population are available for general uses and further validation in genomic hybrid prediction of all potential hybrids generated from all varieties of rice.

     
    more » « less
  4. Abstract

    Tepary bean (Phaseolus acutifoliusA. Gray), indigenous to the arid climates of northern Mexico and the Southwest United States, diverged from common bean (Phaseolus vulgarisL.), approximately 2 million years ago and exhibits a wide range of resistance to biotic stressors. The tepary genome is highly syntenic to the common bean genome providing a foundation for discovery and breeding of agronomic traits between these two crop species. Although a limited number of adaptive traits from tepary bean have been introgressed into common bean, hybridization barriers between these two species required the development of bridging lines to alleviate this barrier. Thus, to fully utilize the extant tepary bean germplasm as both a crop and as a donor of adaptive traits, we developed a diversity panel of 422 cultivated, weedy, and wild tepary bean accessions which were then genotyped and phenotyped to enable population genetic analyses and genome‐wide association studies for their response to a range of biotic stressors. Population structure analyses of the panel revealed eight subpopulations and the differentiation of botanical varieties withinP. acutifolius. Genome‐wide association studies revealed loci and candidate genes underlying biotic stress resistance including quantitative trait loci for resistance to weevils, common bacterial blight, Fusarium wilt, and bean common mosaic necrosis virus that can be harnessed not only for tepary bean but also common bean improvement.

     
    more » « less
  5. Abstract

    Cotton bacterial blight (CBB), caused by the pathogenXanthomonas citrisubsp.malvacearum(Xcm), can inflict significant damage to cotton (Gossypium hirsutumL.) production. Previously, we identified and mapped the broad‐spectrum CBB‐resistant locusBB‐13on the long arm of chromosome D02 using array‐based single nucleotide polymorphisms (SNPs). In the current study, linked SNPs were converted into easily assayable Kompetitive Allele‐Specific PCR (KASP) markers to enable efficient detection and marker‐assisted selection of alleles at theBB‐13locus. The KASP marker's efficiency in detecting theBB‐13resistant gene was validated using an Upland cotton diversity panel of 72 accessions phenotyped withXcmrace 18. The KASP marker NCBB‐KASP4, derived from the CottonSNP63K array‐based marker i25755Gh that is closely associated withBB‐13, predicted the CBB response phenotypes with an error rate of 4.17% in the diversity panel. Additionally, two independent biparental recombinant inbred line populations segregating for resistance toXcmrace 18 were used for KASP marker validation and to test their utility in detecting the presence of theBB‐13locus in the resistant accession CABD3CABCH‐1‐89. NCBB‐KASP4, validated across breeding populations and broad germplasm, is a reliable KASP marker for detection and testing ofBB‐13locus in cotton. Further, diagnostic array‐based SNP marker i25755Gh's allele pattern and the potential CBB response is described for 875Gossypiumaccessions. These SNP‐based phenotypic predictions for 875 accessions along with disease response phenotypes toXcmrace 18 for 253 accessions provide a reference for CBB resistance in diverse cotton germplasm in the United States.

     
    more » « less