skip to main content


Title: Evaluating the power and limitations of genome-wide association studies in Caenorhabditis elegans
Abstract Quantitative genetics in Caenorhabditis elegans seeks to identify naturally segregating genetic variants that underlie complex traits. Genome-wide association studies scan the genome for individual genetic variants that are significantly correlated with phenotypic variation in a population, or quantitative trait loci. Genome-wide association studies are a popular choice for quantitative genetic analyses because the quantitative trait loci that are discovered segregate in natural populations. Despite numerous successful mapping experiments, the empirical performance of genome-wide association study has not, to date, been formally evaluated in C. elegans. We developed an open-source genome-wide association study pipeline called NemaScan and used a simulation-based approach to provide benchmarks of mapping performance in collections of wild C. elegans strains. Simulated trait heritability and complexity determined the spectrum of quantitative trait loci detected by genome-wide association studies. Power to detect smaller-effect quantitative trait loci increased with the number of strains sampled from the C. elegans Natural Diversity Resource. Population structure was a major driver of variation in mapping performance, with populations shaped by recent selection exhibiting significantly lower false discovery rates than populations composed of more divergent strains. We also recapitulated previous genome-wide association studies of experimentally validated quantitative trait variants. Our simulation-based evaluation of performance provides the community with critical context to pursue quantitative genetic studies using the C. elegans Natural Diversity Resource to elucidate the genetic basis of complex traits in C. elegans natural populations.  more » « less
Award ID(s):
1930382
NSF-PAR ID:
10356403
Author(s) / Creator(s):
; ; ;
Editor(s):
Macdonald, S
Date Published:
Journal Name:
G3 Genes|Genomes|Genetics
Volume:
12
Issue:
7
ISSN:
2160-1836
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    Studies of model organisms have provided important insights into how natural genetic differences shape trait variation. These discoveries are driven by the growing availability of genomes and the expansive experimental toolkits afforded to researchers using these species. For example, Caenorhabditis elegans is increasingly being used to identify and measure the effects of natural genetic variants on traits using quantitative genetics. Since 2016, the C. elegans Natural Diversity Resource (CeNDR) has facilitated many of these studies by providing an archive of wild strains, genome-wide sequence and variant data for each strain, and a genome-wide association (GWA) mapping portal for the C. elegans community. Here, we present an updated platform, the Caenorhabditis Natural Diversity Resource (CaeNDR), that enables quantitative genetics and genomics studies across the three Caenorhabditis species: C. elegans, C. briggsae and C. tropicalis. The CaeNDR platform hosts several databases that are continually updated by the addition of new strains, whole-genome sequence data and annotated variants. Additionally, CaeNDR provides new interactive tools to explore natural variation and enable GWA mappings. All CaeNDR data and tools are accessible through a freely available web portal located at caendr.org.

     
    more » « less
  2. Phenotypic variation in diverse organism-level traits have been studied in Caenorhabditis elegans wild strains, but differences in gene expression and the underlying variation in regulatory mechanisms are largely unknown. Here, we use natural variation in gene expression to connect genetic variants to differences in organismal- level traits, including drug and toxicant responses. We performed transcriptomic analysis on 207 genetically distinct C. elegans wild strains to study natural regulatory variation of gene expression. Using this massive dataset, we performed genome-wide association mappings to investigate the genetic basis underlying gene expression variation and revealed complex genetic architectures. We found a large collection of hotspots enriched for expression quantitative trait loci across the genome. We further used mediation analysis to understand how gene expression variation could underlie organism-level phenotypic variation for a variety of complex traits. These results reveal the natural diversity in gene expression and possible regulatory mechanisms in this keystone model organism, highlighting the promise of gene expression variation in shaping phenotypic diversity. 
    more » « less
  3. Abstract

    Phenotypic variation in organism-level traits has been studied inCaenorhabditis eleganswild strains, but the impacts of differences in gene expression and the underlying regulatory mechanisms are largely unknown. Here, we use natural variation in gene expression to connect genetic variants to differences in organismal-level traits, including drug and toxicant responses. We perform transcriptomic analyses on 207 genetically distinctC. eleganswild strains to study natural regulatory variation of gene expression. Using this massive dataset, we perform genome-wide association mappings to investigate the genetic basis underlying gene expression variation and reveal complex genetic architectures. We find a large collection of hotspots enriched for expression quantitative trait loci across the genome. We further use mediation analysis to understand how gene expression variation could underlie organism-level phenotypic variation for a variety of complex traits. These results reveal the natural diversity in gene expression and possible regulatory mechanisms in this keystone model organism, highlighting the promise of using gene expression variation to understand how phenotypic diversity is generated.

     
    more » « less
  4. Abstract

    The genetic architecture of phenotypic traits can affect the mode and tempo of trait evolution. Human‐altered environments can impose strong natural selection, where successful evolutionary adaptation requires swift and large phenotypic shifts. In these scenarios, theory predicts that adaptation is due to a few adaptive variants of large effect, but empirical studies that have revealed the genetic architecture of rapidly evolved phenotypes are rare, especially for populations inhabiting polluted environments.Funduluskillifish have repeatedly evolved adaptive resistance to extreme pollution in urban estuaries. Prior studies, including genome scans for signatures of natural selection, have revealed some of the genes and pathways important for evolved pollution resistance, and provide context for the genotype–phenotype association studies reported here. We created multiple quantitative trait locus (QTL) mapping families using progenitors from four different resistant populations, and using RAD‐seq genetically mapped variation in sensitivity (developmental perturbations) following embryonic exposure to a model toxicant PCB‐126. We found that one to two large‐effect QTL loci accounted for resistance to PCB‐mediated developmental toxicity. QTLs harbored candidate genes that govern the regulation of aryl hydrocarbon receptor (AHR) signaling. One QTL locus was shared across all populations and another was shared across three populations. One QTL locus showed strong signatures of recent natural selection in the corresponding wild population but another QTL locus did not. Some candidate genes for PCB resistance inferred from genome scans in wild populations were identified as QTL, but some key candidate genes were not. We conclude that rapidly evolved resistance to the developmental defects normally caused by PCB‐126 is governed by few genes of large effect. However, other aspects of resistance beyond developmental phenotypes may be governed by additional loci, such that comprehensive resistance to PCB‐126, and to the mixtures of chemicals that distinguish urban estuaries more broadly, may be more genetically complex.

     
    more » « less
  5. Abstract

    Introductions of invasive species to new environments often result in rapid rates of trait evolution. While in some cases these evolutionary transitions are adaptive and driven by natural selection, they can also result from patterns of genetic and phenotypic variation associated with the invasion history. Here, we examined the brown anole (Anolis sagrei), a widespread invasive lizard for which genetic data have helped trace the sources of non‐native populations. We focused on the dewlap, a complex signalling trait known to be subject to multiple selective pressures. We measured dewlap reflectance, pattern and size in 30 non‐native populations across the southeastern United States. As well, we quantified environmental variables known to influence dewlap signal effectiveness, such as canopy openness. Further, we used genome‐wide data to estimate genetic ancestry, perform association mapping and test for signatures of selection. We found that among‐population variation in dewlap characteristics was best explained by genetic ancestry. This result was supported by genome‐wide association mapping, which identified several ancestry‐specific loci associated with dewlap traits. Despite the strong imprint of this aspect of the invasion history on dewlap variation, we also detected significant relationships between dewlap traits and local environmental conditions. However, we found limited evidence that dewlap‐associated genetic variants have been subject to selection. Our study emphasizes the importance of genetic ancestry and admixture in shaping phenotypes during biological invasion, while leaving the role of selection unresolved, likely due to the polygenic genetic architecture of dewlaps and selection acting on many genes of small effect.

     
    more » « less