skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Genome-Wide Association Study in Two Cohorts from a Multi-generational Mouse Advanced Intercross Line Highlights the Difficulty of Replication Due to Study-Specific Heterogeneity
Abstract There has been extensive discussion of the “Replication Crisis” in many fields, including genome-wide association studies (GWAS). We explored replication in a mouse model using an advanced intercross line (AIL), which is a multigenerational intercross between two inbred strains. We re-genotyped a previously published cohort of LG/J x SM/J AIL mice (F34; n = 428) using a denser marker set and genotyped a new cohort of AIL mice (F39-43; n = 600) for the first time. We identified 36 novel genome-wide significant loci in the F34 and 25 novel loci in the F39-43 cohort. The subset of traits that were measured in both cohorts (locomotor activity, body weight, and coat color) showed high genetic correlations, although the SNP heritabilities were slightly lower in the F39-43 cohort. For this subset of traits, we attempted to replicate loci identified in either F34 or F39-43 in the other cohort. Coat color was robustly replicated; locomotor activity and body weight were only partially replicated, which was inconsistent with our power simulations. We used a random effects model to show that the partial replications could not be explained by Winner’s Curse but could be explained by study-specific heterogeneity. Despite this heterogeneity, we performed a mega-analysis by combining F34 and F39-43 cohorts (n = 1,028), which identified four novel loci associated with locomotor activity and body weight. These results illustrate that even with the high degree of genetic and environmental control possible in our experimental system, replication was hindered by study-specific heterogeneity, which has broad implications for ongoing concerns about reproducibility.  more » « less
Award ID(s):
1910885
PAR ID:
10283142
Author(s) / Creator(s):
; ; ; ; ; ; ;
Date Published:
Journal Name:
G3 Genes|Genomes|Genetics
Volume:
10
Issue:
3
ISSN:
2160-1836
Page Range / eLocation ID:
951 to 965
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Matise, T (Ed.)
    Abstract Combining samples for genetic association is standard practice in human genetic analysis of complex traits, but is rarely undertaken in rodent genetics. Here, using 23 phenotypes and genotypes from two independent laboratories, we obtained a sample size of 3076 commercially available outbred mice and identified 70 loci, more than double the number of loci identified in the component studies. Fine-mapping in the combined sample reduced the number of likely causal variants, with a median reduction in set size of 51%, and indicated novel gene associations, including Pnpo, Ttll6, and GM11545 with bone mineral density, and Psmb9 with weight. However, replication at a nominal threshold of 0.05 between the two component studies was low, with less than one-third of loci identified in one study replicated in the second. In addition to overestimates in the effect size in the discovery sample (Winner’s Curse), we also found that heterogeneity between studies explained the poor replication, but the contribution of these two factors varied among traits. Leveraging these observations, we integrated information about replication rates, study-specific heterogeneity, and Winner’s Curse corrected estimates of power to assign variants to one of four confidence levels. Our approach addresses concerns about reproducibility and demonstrates how to obtain robust results from mapping complex traits in any genome-wide association study. 
    more » « less
  2. null (Ed.)
    High rates of dispersal can breakdown coadapted gene complexes. However, concentrated genomic architecture (i.e., genomic islands of divergence) can suppress recombination to allow evolution of local adaptations despite high gene flow. Pacific lamprey (Entosphenus tridentatus) is a highly dispersive anadromous fish. Observed trait diversity and evidence for genetic basis of traits suggests it may be locally adapted. We addressed whether concentrated genomic architecture could influence local adaptation for Pacific lamprey. Using two new whole genome assemblies and genotypes from 7,716 single nucleotide polymorphism (SNP) loci in 518 individuals from across the species range, we identified four genomic islands of divergence (on chromosomes 01, 02, 04, and 22). We determined robust phenotype-by-genotype relationships by testing multiple traits across geographic sites. These trait associations probably explain genomic divergence across the species’ range. We genotyped a subset of 302 broadly distributed SNPs in 2,145 individuals for association testing for adult body size, sexual maturity, migration distance and timing, adult swimming ability, and larval growth. Body size traits were strongly associated with SNPs on chromosomes 02 and 04. Moderate associations also implicated SNPs on chromosome 01 as being associated with variation in female maturity. Finally, we used candidate SNPs to extrapolate a heterogeneous spatiotemporal distribution of these predicted phenotypes based on independent data sets of larval and adult collections. These maturity and body size results guide future elucidation of factors driving regional optimization of these traits for fitness. Pacific lamprey is culturally important and imperiled. This research addresses biological uncertainties that challenge restoration efforts. 
    more » « less
  3. Abstract Urbanization is the dominant trend of global land use change. The replicated nature of environmental change associated with urbanization should drive parallel evolution, yet insight into the repeatability of evolutionary processes in urban areas has been limited by a lack of multi-city studies. Here we leverage community science data on coat color in > 60,000 eastern gray squirrels (Sciurus carolinensis) across 43 North American cities to test for parallel clines in melanism, a genetically based trait associated with thermoregulation and crypsis. We show the prevalence of melanism was positively associated with urbanization as measured by impervious cover. Urban–rural clines in melanism were strongest in the largest cities with extensive forest cover and weakest or absent in cities with warmer winter temperatures, where thermal selection likely limits the prevalence of melanism. Our results suggest that novel traits can evolve in a highly repeatable manner among urban areas, modified by factors intrinsic to individual cities, including their size, land cover, and climate. 
    more » « less
  4. Abstract MotivationMany variants identified by genome-wide association studies (GWAS) have been found to affect multiple traits, either directly or through shared pathways. There is currently a wealth of GWAS data collected in numerous phenotypes, and analyzing multiple traits at once can increase power to detect shared variant effects. However, traditional meta-analysis methods are not suitable for combining studies on different traits. When applied to dissimilar studies, these meta-analysis methods can be underpowered compared to univariate analysis. The degree to which traits share variant effects is often not known, and the vast majority of GWAS meta-analysis only consider one trait at a time. ResultsHere, we present a flexible method for finding associated variants from GWAS summary statistics for multiple traits. Our method estimates the degree of shared effects between traits from the data. Using simulations, we show that our method properly controls the false positive rate and increases power when an effect is present in a subset of traits. We then apply our method to the North Finland Birth Cohort and UK Biobank datasets using a variety of metabolic traits and discover novel loci. Availability and implementationOur source code is available at https://github.com/lgai/CONFIT. Supplementary informationSupplementary data are available at Bioinformatics online. 
    more » « less
  5. Testosterone exerts high affinity for the Transient Receptor Potential Melastatin 8 (TRPM8) Ca2+ channel. TRPM8 -/- male mice exhibit disrupted sexual behavior (e.g., indiscriminate approach, delayed satiety), possibly due to decreased ventral tegmental area dopamine neuron activity. It is hypothesized that TRPM8 null mutant mice (Jackson Laboratories) will exhibit disruptions across a range of motivationally-relevant behaviors, including spontaneous locomotor activation, detection of novel stimuli, sucrose preference, and sensitivity to the psychomotor stimulant amphetamine. Initial findings indicate that male TRPM8 mutant mice (n=6) exhibit decreased nocturnal locomotor activity (F(1,12)=23.41, p<0.001), increased behavioral anxiety in the light/dark task (t(10)=2.44, p<0.05; d=1.4), and behavioral despair in the forced swim task (t(10)=3.70, p<0.005; d=2.1). In contrast, these mice tended to prefer a low concentration (0.1%) of sucrose compared to wildtype males (n=6; t(10)=1.35, p=0.09; d=0.83). Tests for sensitivity to amphetamine are in progress. These data suggest a pivotal role for TRPM8 in motivated behavior. 
    more » « less