skip to main content


Title: Association mapping across a multitude of traits collected in diverse environments in maize
Abstract

Classical genetic studies have identified many cases of pleiotropy where mutations in individual genes alter many different phenotypes. Quantitative genetic studies of natural genetic variants frequently examine one or a few traits, limiting their potential to identify pleiotropic effects of natural genetic variants. Widely adopted community association panels have been employed by plant genetics communities to study the genetic basis of naturally occurring phenotypic variation in a wide range of traits. High-density genetic marker data—18M markers—from 2 partially overlapping maize association panels comprising 1,014 unique genotypes grown in field trials across at least 7 US states and scored for 162 distinct trait data sets enabled the identification of of 2,154 suggestive marker-trait associations and 697 confident associations in the maize genome using a resampling-based genome-wide association strategy. The precision of individual marker-trait associations was estimated to be 3 genes based on a reference set of genes with known phenotypes. Examples were observed of both genetic loci associated with variation in diverse traits (e.g., above-ground and below-ground traits), as well as individual loci associated with the same or similar traits across diverse environments. Many significant signals are located near genes whose functions were previously entirely unknown or estimated purely via functional data on homologs. This study demonstrates the potential of mining community association panel data using new higher-density genetic marker sets combined with resampling-based genome-wide association tests to develop testable hypotheses about gene functions, identify potential pleiotropic effects of natural genetic variants, and study genotype-by-environment interaction.

 
more » « less
Award ID(s):
1557417
NSF-PAR ID:
10370299
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ; ;
Publisher / Repository:
Oxford University Press
Date Published:
Journal Name:
GigaScience
Volume:
11
ISSN:
2047-217X
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Mapping the genetic basis of complex traits is critical to uncovering the biological mechanisms that underlie disease and other phenotypes. Genome-wide association studies (GWAS) in humans and quantitative trait locus (QTL) mapping in model organisms can now explain much of the observed heritability in many traits, allowing us to predict phenotype from genotype. However, constraints on power due to statistical confounders in large GWAS and smaller sample sizes in QTL studies still limit our ability to resolve numerous small-effect variants, map them to causal genes, identify pleiotropic effects across multiple traits, and infer non-additive interactions between loci (epistasis). Here, we introduce barcoded bulk quantitative trait locus (BB-QTL) mapping, which allows us to construct, genotype, and phenotype 100,000 offspring of a budding yeast cross, two orders of magnitude larger than the previous state of the art. We use this panel to map the genetic basis of eighteen complex traits, finding that the genetic architecture of these traits involves hundreds of small-effect loci densely spaced throughout the genome, many with widespread pleiotropic effects across multiple traits. Epistasis plays a central role, with thousands of interactions that provide insight into genetic networks. By dramatically increasing sample size, BB-QTL mapping demonstrates the potential of natural variants in high-powered QTL studies to reveal the highly polygenic, pleiotropic, and epistatic architecture of complex traits. 
    more » « less
  2. Abstract Background

    Genome wide association (GWA) studies demonstrate linkages between genetic variants and traits of interest. Here, we tested associations between single nucleotide polymorphisms (SNPs) in rice (Oryza sativa) and two root hair traits, root hair length (RHL) and root hair density (RHD). Root hairs are outgrowths of single cells on the root epidermis that aid in nutrient and water acquisition and have also served as a model system to study cell differentiation and tip growth. Using lines from the Rice Diversity Panel-1, we explored the diversity of root hair length and density across four subpopulations of rice (aus,indica,temperate japonica, andtropical japonica). GWA analysis was completed using the high-density rice array (HDRA) and the rice reference panel (RICE-RP) SNP sets.

    Results

    We identified 18 genomic regions related to root hair traits, 14 of which related to RHD and four to RHL. No genomic regions were significantly associated with both traits. Two regions overlapped with previously identified quantitative trait loci (QTL) associated with root hair density in rice. We identified candidate genes in these regions and present those with previously published expression data relevant to root hair development. We re-phenotyped a subset of lines with extreme RHD phenotypes and found that the variation in RHD was due to differences in cell differentiation, not cell size, indicating genes in an associated genomic region may influence root hair cell fate. The candidate genes that we identified showed little overlap with previously characterized genes in rice andArabidopsis.

    Conclusions

    Root hair length and density are quantitative traits with complex and independent genetic control in rice. The genomic regions described here could be used as the basis for QTL development and further analysis of the genetic control of root hair length and density. We present a list of candidate genes involved in root hair formation and growth in rice, many of which have not been previously identified as having a relation to root hair growth. Since little is known about root hair growth in grasses, these provide a guide for further research and crop improvement.

     
    more » « less
  3. Abstract Genome-wide association studies (GWAS) are integral for studying genotype-phenotype relationships and gaining a deeper understanding of the genetic architecture underlying trait variation. A plethora of genetic associations between distinct loci and various traits have been successfully discovered and published for the model plant Arabidopsis thaliana. This success and the free availability of full genomes and phenotypic data for more than 1,000 different natural inbred lines led to the development of several data repositories. AraPheno (https://arapheno.1001genomes.org) serves as a central repository of population-scale phenotypes in A. thaliana, while the AraGWAS Catalog (https://aragwas.1001genomes.org) provides a publicly available, manually curated and standardized collection of marker-trait associations for all available phenotypes from AraPheno. In this major update, we introduce the next generation of both platforms, including new data, features and tools. We included novel results on associations between knockout-mutations and all AraPheno traits. Furthermore, AraPheno has been extended to display RNA-Seq data for hundreds of accessions, providing expression information for over 28 000 genes for these accessions. All data, including the imputed genotype matrix used for GWAS, are easily downloadable via the respective databases. 
    more » « less
  4. Abstract

    Introductions of invasive species to new environments often result in rapid rates of trait evolution. While in some cases these evolutionary transitions are adaptive and driven by natural selection, they can also result from patterns of genetic and phenotypic variation associated with the invasion history. Here, we examined the brown anole (Anolis sagrei), a widespread invasive lizard for which genetic data have helped trace the sources of non‐native populations. We focused on the dewlap, a complex signalling trait known to be subject to multiple selective pressures. We measured dewlap reflectance, pattern and size in 30 non‐native populations across the southeastern United States. As well, we quantified environmental variables known to influence dewlap signal effectiveness, such as canopy openness. Further, we used genome‐wide data to estimate genetic ancestry, perform association mapping and test for signatures of selection. We found that among‐population variation in dewlap characteristics was best explained by genetic ancestry. This result was supported by genome‐wide association mapping, which identified several ancestry‐specific loci associated with dewlap traits. Despite the strong imprint of this aspect of the invasion history on dewlap variation, we also detected significant relationships between dewlap traits and local environmental conditions. However, we found limited evidence that dewlap‐associated genetic variants have been subject to selection. Our study emphasizes the importance of genetic ancestry and admixture in shaping phenotypes during biological invasion, while leaving the role of selection unresolved, likely due to the polygenic genetic architecture of dewlaps and selection acting on many genes of small effect.

     
    more » « less
  5. ABSTRACT Genome-wide association studies (GWAS) can identify genetic variants responsible for naturally occurring and quantitative phenotypic variation. Association studies therefore provide a powerful complement to approaches that rely on de novo mutations for characterizing gene function. Although bacteria should be amenable to GWAS, few GWAS have been conducted on bacteria, and the extent to which nonindependence among genomic variants (e.g., linkage disequilibrium [LD]) and the genetic architecture of phenotypic traits will affect GWAS performance is unclear. We apply association analyses to identify candidate genes underlying variation in 20 biochemical, growth, and symbiotic phenotypes among 153 strains of Ensifer meliloti . For 11 traits, we find genotype-phenotype associations that are stronger than expected by chance, with the candidates in relatively small linkage groups, indicating that LD does not preclude resolving association candidates to relatively small genomic regions. The significant candidates show an enrichment for nucleotide polymorphisms (SNPs) over gene presence-absence variation (PAV), and for five traits, candidates are enriched in large linkage groups, a possible signature of epistasis. Many of the variants most strongly associated with symbiosis phenotypes were in genes previously identified as being involved in nitrogen fixation or nodulation. For other traits, apparently strong associations were not stronger than the range of associations detected in permuted data. In sum, our data show that GWAS in bacteria may be a powerful tool for characterizing genetic architecture and identifying genes responsible for phenotypic variation. However, careful evaluation of candidates is necessary to avoid false signals of association. IMPORTANCE Genome-wide association analyses are a powerful approach for identifying gene function. These analyses are becoming commonplace in studies of humans, domesticated animals, and crop plants but have rarely been conducted in bacteria. We applied association analyses to 20 traits measured in Ensifer meliloti , an agriculturally and ecologically important bacterium because it fixes nitrogen when in symbiosis with leguminous plants. We identified candidate alleles and gene presence-absence variants underlying variation in symbiosis traits, antibiotic resistance, and use of various carbon sources; some of these candidates are in genes previously known to affect these traits whereas others were in genes that have not been well characterized. Our results point to the potential power of association analyses in bacteria, but also to the need to carefully evaluate the potential for false associations. 
    more » « less