skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Mathematical constraints on F ST : multiallelic markers in arbitrarily many populations
Interpretations of values of the F ST measure of genetic differentiation rely on an understanding of its mathematical constraints. Previously, it has been shown that F ST values computed from a biallelic locus in a set of multiple populations and F ST values computed from a multiallelic locus in a pair of populations are mathematically constrained as a function of the frequency of the allele that is most frequent across populations. We generalize from these cases to report here the mathematical constraint on F ST given the frequency M of the most frequent allele at a multiallelic locus in a set of multiple populations. Using coalescent simulations of an island model of migration with an infinitely-many-alleles mutation model, we argue that the joint distribution of F ST and M helps in disentangling the separate influences of mutation and migration on F ST . Finally, we show that our results explain a puzzling pattern of microsatellite differentiation: the lower F ST in an interspecific comparison between humans and chimpanzees than in the comparison of chimpanzee populations. We discuss the implications of our results for the use of F ST . This article is part of the theme issue ‘Celebrating 50 years since Lewontin's apportionment of human diversity’.  more » « less
Award ID(s):
2116322
PAR ID:
10329249
Author(s) / Creator(s):
;
Date Published:
Journal Name:
Philosophical Transactions of the Royal Society B: Biological Sciences
Volume:
377
Issue:
1852
ISSN:
0962-8436
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Allele-sharing statistics for a genetic locus measure the dissimilarity between two populations as a mean of the dissimilarity between random pairs of individuals, one from each population. Owing to within-population variation in genotype, allele-sharing dissimilarities can have the property that they have a nonzero value when computed between a population and itself. We consider the mathematical properties of allele-sharing dissimilarities in a pair of populations, treating the allele frequencies in the two populations parametrically. Examining two formulations of allele-sharing dissimilarity, we obtain the distributions of within-population and between-population dissimilarities for pairs of individuals. We then mathematically explore the scenarios in which, for certain allele-frequency distributions, the within-population dissimilarity – the mean dissimilarity between randomly chosen members of a population – can exceed the dissimilarity between two populations. Such scenarios assist in explaining observations in population-genetic data that members of a population can be empirically more genetically dissimilar from each other on average than they are from members of another population. For a population pair, however, the mathematical analysis finds that at least one of the two populations always possesses smaller within-population dissimilarity than the value of the between-population dissimilarity. We illustrate the mathematical results with an application to human population-genetic data. 
    more » « less
  2. Abstract Recurrent mutation produces multiple copies of the same allele which may be co-segregating in a population. Yet, most analyses of allele-frequency or site-frequency spectra assume that all observed copies of an allele trace back to a single mutation. We develop a sampling theory for the number of latent mutations in the ancestry of a rare variant, specifically a variant observed in relatively small count in a large sample. Our results follow from the statistical independence of low-count mutations, which we show to hold for the standard neutral coalescent or diffusion model of population genetics as well as for more general coalescent trees. For populations of constant size, these counts are distributed like the number of alleles in the Ewens sampling formula. We develop a Poisson sampling model for populations of varying size and illustrate it using new results for site-frequency spectra in an exponentially growing population. We apply our model to a large data set of human SNPs and use it to explain dramatic differences in site-frequency spectra across the range of mutation rates in the human genome. 
    more » « less
  3. Fraser, Bonnie (Ed.)
    Abstract Selection on standing genetic variation may be effective enough to allow for adaptation to distinct niche environments within a single generation. Minor allele frequency changes at multiple, redundant loci of small effect can produce remarkable phenotypic shifts. Yet, demonstrating rapid adaptation via polygenic selection in the wild remains challenging. Here we harness natural replicate populations that experience similar selection pressures and harbor high within-, yet negligible among-population genetic variation. Such populations can be found among the teleost Fundulus heteroclitus that inhabits marine estuaries characterized by high environmental heterogeneity. We identify 10,861 single nucleotide polymorphisms in F. heteroclitus that belong to a single, panmictic population yet reside in environmentally distinct niches (one coastal basin and three replicate tidal ponds). By sampling at two time points within a single generation, we quantify both allele frequency change within as well as spatial divergence among niche subpopulations. We observe few individually significant allele frequency changes yet find that the “number” of moderate changes exceeds the neutral expectation by 10–100%. We find allele frequency changes to be significantly concordant in both direction and magnitude among all niche subpopulations, suggestive of parallel selection. In addition, within-generation allele frequency changes generate subtle but significant divergence among niches, indicative of local adaptation. Although we cannot distinguish between selection and genotype-dependent migration as drivers of within-generation allele frequency changes, the trait/s determining fitness and/or migration likelihood appear to be polygenic. In heterogeneous environments, polygenic selection and polygenic, genotype-dependent migration offer conceivable mechanisms for within-generation, local adaptation to distinct niches. 
    more » « less
  4. Abstract The large geographic distribution of the eastern oyster,Crassostrea virginica,makes it an ideal species to test how populations have adapted to latitudinal gradients in temperature. Despite inhabiting distinct thermal regimes, populations ofC. virginicanear the species’ southern and northern geographic range show no population differences in their physiological response to temperature. In this study, we used comparative transcriptomics to understand how oysters from either end of the species’ range maintain enantiostasis across three acclimation temperatures (10, 20, and 30°C). With this approach, we identified genes that were differentially expressed in response to temperature between individuals ofC. virginicacollected from New Brunswick, Canada and Louisiana, USA. We observed a core set of genes whose expression responded to temperature in both populations, but also an even larger set of genes with expression patterns that were unique to each population. Intriguingly, the genes with population‐specific responses to temperature had elevatedFSTand Ka/Ks ratios compared to the genome‐wide average. In contrast, genes showing only a response to temperature were found to only have elevatedFSTvalues suggesting that divergentFSTmay be due to selection on linked regulatory regions rather than positive selection on protein coding regions. Taken together, our results suggest that, despite coarse‐scale physiological similarities, natural selection has shaped divergent gene expression responses to temperature in geographically separated populations of this broadly eurythermal marine invertebrate. 
    more » « less
  5. Evolution by natural selection may be effective enough to allow for recurrent, rapid adaptation to distinct niche environments within a well-mixed population. For this to occur, selection must act on standing genetic variation such that mortality i.e. genetic load, is minimized while polymorphism is maintained. Selection on multiple, redundant loci of small effect provides a potentially inexpensive solution. Yet, demonstrating adaptation via redundant, polygenic selection in the wild remains extremely challenging because low per-locus effect sizes and high genetic redundancy severely reduce statistical power. One approach to facilitate identification of loci underlying polygenic selection is to harness natural replicate populations experiencing similar selection pressures that harbor high within-, yet negligible among-population genetic variation. Such populations can be found among the teleost Fundulus heteroclitus. F. heteroclitus inhabits salt marsh estuaries that are characterized by high environmental heterogeneity e.g. tidal ponds, creeks, coastal basins. Here, we sample four of these heterogeneous niches (one coastal basin and three replicate tidal ponds) at two time points from among a single, panmictic F. heteroclitus population. We identify 10,861 single nucleotide polymorphisms using a genotyping-by-sequencing approach and quantify temporal allele frequency change within, as well as spatial divergence among subpopulations residing in these niches. We find a significantly elevated number of concordant allele frequency changes among all subpopulations, suggesting ecosystem-wide adaptation to a common selection pressure. Remarkably, we also find an unexpected number of temporal allele frequency changes that generate fine-scale divergence among subpopulations, suggestive of local adaptation to distinct niche environments. Both patterns are characterized by a lack of large-effect loci yet an elevated total number of significant loci. Adaptation via redundant, polygenic selection offers a likely explanation for these patterns as well as a potential mechanism for polymorphism maintenance in the F. heteroclitus system. 
    more » « less