skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Harnessing Genetic Diversity in the USDA Pea Germplasm Collection Through Genomic Prediction
Phenotypic evaluation and efficient utilization of germplasm collections can be time-intensive, laborious, and expensive. However, with the plummeting costs of next-generation sequencing and the addition of genomic selection to the plant breeder’s toolbox, we now can more efficiently tap the genetic diversity within large germplasm collections. In this study, we applied and evaluated genomic prediction’s potential to a set of 482 pea ( Pisum sativum L.) accessions—genotyped with 30,600 single nucleotide polymorphic (SNP) markers and phenotyped for seed yield and yield-related components—for enhancing selection of accessions from the USDA Pea Germplasm Collection. Genomic prediction models and several factors affecting predictive ability were evaluated in a series of cross-validation schemes across complex traits. Different genomic prediction models gave similar results, with predictive ability across traits ranging from 0.23 to 0.60, with no model working best across all traits. Increasing the training population size improved the predictive ability of most traits, including seed yield. Predictive abilities increased and reached a plateau with increasing number of markers presumably due to extensive linkage disequilibrium in the pea genome. Accounting for population structure effects did not significantly boost predictive ability, but we observed a slight improvement in seed yield. By applying the best genomic prediction model (e.g., RR-BLUP), we then examined the distribution of genotyped but nonphenotyped accessions and the reliability of genomic estimated breeding values (GEBV). The distribution of GEBV suggested that none of the nonphenotyped accessions were expected to perform outside the range of the phenotyped accessions. Desirable breeding values with higher reliability can be used to identify and screen favorable germplasm accessions. Expanding the training set and incorporating additional orthogonal information (e.g., transcriptomics, metabolomics, physiological traits, etc.) into the genomic prediction framework can enhance prediction accuracy.  more » « less
Award ID(s):
2019077
PAR ID:
10357567
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ;
Date Published:
Journal Name:
Frontiers in Genetics
Volume:
12
ISSN:
1664-8021
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Developments in genomics and phenomics have provided valuable tools for use in cultivar development. Genomic prediction (GP) has been used in commercial soybean [Glycine maxL. (Merr.)] breeding programs to predict grain yield and seed composition traits. Phenomic prediction (PP) is a rapidly developing field that holds the potential to be used for the selection of genotypes early in the growing season. The objectives of this study were to compare the performance of GP and PP for predicting soybean seed yield, protein, and oil. We additionally conducted genome‐wide association studies (GWAS) to identify significant single‐nucleotide polymorphisms (SNPs) associated with the traits of interest. The GWAS panel of 292 diverse accessions was grown in six environments in replicated trials. Spectral data were collected at two time points during the growing season. A genomic best linear unbiased prediction (GBLUP) model was trained on 269 accessions, while three separate machine learning (ML) models were trained on vegetation indices (VIs) and canopy traits. We observed that PP had a higher correlation coefficient than GP for seed yield, while GP had higher correlation coefficients for seed protein and oil contents. VIs with high feature importance were used as covariates in a new GBLUP model, and a new random forest model was trained with the inclusion of selected SNPs. These models did not outperform the original GP and PP models. These results show the capability of using ML for in‐season predictions for specific traits in soybean breeding and provide insights on PP and GP inclusions in breeding programs. 
    more » « less
  2. Abstract Tepary bean (Phaseolus acutifoliusA. Gray), indigenous to the arid climates of northern Mexico and the Southwest United States, diverged from common bean (Phaseolus vulgarisL.), approximately 2 million years ago and exhibits a wide range of resistance to biotic stressors. The tepary genome is highly syntenic to the common bean genome providing a foundation for discovery and breeding of agronomic traits between these two crop species. Although a limited number of adaptive traits from tepary bean have been introgressed into common bean, hybridization barriers between these two species required the development of bridging lines to alleviate this barrier. Thus, to fully utilize the extant tepary bean germplasm as both a crop and as a donor of adaptive traits, we developed a diversity panel of 422 cultivated, weedy, and wild tepary bean accessions which were then genotyped and phenotyped to enable population genetic analyses and genome‐wide association studies for their response to a range of biotic stressors. Population structure analyses of the panel revealed eight subpopulations and the differentiation of botanical varieties withinP. acutifolius. Genome‐wide association studies revealed loci and candidate genes underlying biotic stress resistance including quantitative trait loci for resistance to weevils, common bacterial blight, Fusarium wilt, and bean common mosaic necrosis virus that can be harnessed not only for tepary bean but also common bean improvement. 
    more » « less
  3. Abstract Plant architecture is a major determinant of planting density, which enhances productivity potential for crops per unit area. Genomic prediction is well positioned to expedite genetic gain of plant architectural traits since they are typically highly heritable. Additionally, the adaptation of genomic prediction models to query predictive abilities of markers tagging certain genomic regions could shed light on the genetic architecture of these traits. Here, we leveraged transcriptional networks from a prior study that contextually described developmental progression during tassel and leaf organogenesis in maize (Zea mays) to inform genomic prediction models for architectural traits. Since these developmental processes underlie tassel branching and leaf angle, 2 important agronomic architectural traits, we tested whether genes prioritized from these networks quantitatively contribute to the genetic architecture of these traits. We used genomic prediction models to evaluate the ability of markers in the vicinity of prioritized network genes to predict breeding values of tassel branching and leaf angle traits for 2 diversity panels in maize and diversity panels from sorghum (Sorghum bicolor) and rice (Oryza sativa). Predictive abilities of markers near these prioritized network genes were similar to those using whole-genome marker sets. Notably, markers near highly connected transcription factors from core network motifs in maize yielded predictive abilities that were significantly greater than expected by chance in not only maize but also closely related sorghum. We expect that these highly connected regulators are key drivers of architectural variation that are conserved across closely related cereal crop species. 
    more » « less
  4. null (Ed.)
    Variation in complex traits is the result of contributions from many loci of small effect. Based on this principle, genomic prediction methods are used to make predictions of breeding value for an individual using genome-wide molecular markers. In breeding, genomic prediction models have been used in plant and animal breeding for almost two decades to increase rates of genetic improvement and reduce the length of artificial selection experiments. However, evolutionary genomics studies have been slow to incorporate this technique to select individuals for breeding in a conservation context or to learn more about the genetic architecture of traits, the genetic value of missing individuals or microevolution of breeding values. Here, we outline the utility of genomic prediction and provide an overview of the methodology. We highlight opportunities to apply genomic prediction in evolutionary genetics of wild populations and the best practices when using these methods on field-collected phenotypes. 
    more » « less
  5. Quinoa ( Chenopodium quinoa Willd.), a pseudocereal with high protein quality originating from the Andean region of South America, has broad genetic variation and adaptability to diverse agroecological conditions, contributing to the potential to serve as a global keystone protein crop in a changing climate. However, the germplasm resources currently available to facilitate quinoa expansion worldwide are restricted to a small portion of quinoa’s total genetic diversity, in part because of day-length sensitivity and issues related to seed sovereignty. This study aimed to characterize phenotypic relationships and variation within a quinoa world core collection. The 360 accessions were planted in a randomized complete block design with four replicates in each of two greenhouses in Pullman, WA during the summer of 2018. Phenological stages, plant height, and inflorescence characteristics were recorded. Seed yield, composition, thousand seed weight, nutritional composition, shape, size, and color were measured using a high-throughput phenotyping pipeline. Considerable variation existed among the germplasm. Crude protein content ranged from 11.24% to 17.81% (fixed at 14% moisture). We found that protein content was negatively correlated with yield and positively correlated with total amino acid content and days to harvest. Mean essential amino acids values met adult daily requirements but not leucine and lysine infant requirements. Yield was positively correlated with thousand seed weight and seed area, and negatively correlated with ash content and days to harvest. The accessions clustered into four groups, with one-group representing useful accessions for long-day breeding programs. The results of this study establish a practical resource for plant breeders to leverage as they strategically develop germplasm in support of the global expansion of quinoa. 
    more » « less