skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.
Attention:The NSF Public Access Repository (NSF-PAR) system and access will be unavailable from 7:00 AM ET to 7:30 AM ET on Friday, April 24 due to maintenance. We apologize for the inconvenience.


Title: Leveraging ancient DNA to uncover signals of natural selection in Europe lost due to admixture or drift
Abstract Large ancient DNA (aDNA) studies offer the chance to examine genomic changes over time, providing direct insights into human evolution. While recent studies have used time-stratified aDNA for selection scans, most focus on single-locus methods. We conducted a multi-locus genotype scan on 708 samples spanning 7000 years of European history. We show that the G12 statistic, originally designed for unphased diploid data, can effectively detect selection in aDNA processed to create ‘pseudo-haplotypes’. In simulations and at known positive control loci (e.g., lactase persistence), G12 outperforms the allele frequency-based selection statistic, SweepFinder2, previously used on aDNA. Applying our approach, we identified 14 candidate regions of selection across four time periods, with half the signals detectable only in the earliest period. Our findings suggest that selective events in European prehistory, including from the onset of animal domestication, have been obscured by neutral processes like genetic drift and demographic shifts such as admixture.  more » « less
Award ID(s):
2240098
PAR ID:
10563726
Author(s) / Creator(s):
; ; ;
Publisher / Repository:
Nature Communications
Date Published:
Journal Name:
Nature Communications
Volume:
15
Issue:
1
ISSN:
2041-1723
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Kim, Yuseob (Ed.)
    Abstract Natural selection leaves a spatial pattern along the genome, with a haplotype distribution distortion near the selected locus that fades with distance. Evaluating the spatial signal of a population-genetic summary statistic across the genome allows for patterns of natural selection to be distinguished from neutrality. Considering the genomic spatial distribution of multiple summary statistics is expected to aid in uncovering subtle signatures of selection. In recent years, numerous methods have been devised that consider genomic spatial distributions across summary statistics, utilizing both classical machine learning and deep learning architectures. However, better predictions may be attainable by improving the way in which features are extracted from these summary statistics. We apply wavelet transform, multitaper spectral analysis, and S-transform to summary statistic arrays to achieve this goal. Each analysis method converts one-dimensional summary statistic arrays to two-dimensional images of spectral analysis, allowing simultaneous temporal and spectral assessment. We feed these images into convolutional neural networks and consider combining models using ensemble stacking. Our modeling framework achieves high accuracy and power across a diverse set of evolutionary settings, including population size changes and test sets of varying sweep strength, softness, and timing. A scan of central European whole-genome sequences recapitulated well-established sweep candidates and predicted novel cancer-associated genes as sweeps with high support. Given that this modeling framework is also robust to missing genomic segments, we believe that it will represent a welcome addition to the population-genomic toolkit for learning about adaptive processes from genomic data. 
    more » « less
  2. Abstract Much research on the evolution of altruism via kin selection, group selection, and reciprocity focuses on the role of a single locus or quantitative trait. Very few studies have explored how linked selection, or selection at loci neighboring an altruism locus, impacts the evolution of altruism. While linked selection can decrease the efficacy of selection at neighboring loci, it might have other effects including promoting selection for altruism by increasing relatedness in regions of low recombination. Here, we used population genetic simulations to study how negative selection at linked loci, or background selection, affects the evolution of altruism. When altruism occurs between full siblings, we found that background selection interfered with selection on the altruistic allele, increasing its fixation probability when the altruistic allele was disfavored and reducing its fixation when the allele was favored. In other words, background selection has the same effect on altruistic genes in family‐structured populations as it does on other, nonsocial, genes. This contrasts with prior research showing that linked selective sweeps can favor the evolution of cooperation, and we discuss possibilities for resolving these contrasting results. 
    more » « less
  3. Abstract The human gut microbiome is composed of a highly diverse consortia of species that are continually evolving within and across hosts1,2. The ability to identify adaptations common to many human gut microbiomes would show not only shared selection pressures across hosts but also key drivers of functional differentiation of the microbiome that may affect community structure and host traits. However, the extent to which adaptations have spread across human gut microbiomes is relatively unknown. Here we develop a new selection scan statistic named the integrated linkage disequilibrium score (iLDS) that can detect sweeps of adaptive alleles spreading across host microbiomes by migration and horizontal gene transfer. Specifically, iLDS leverages signals of hitchhiking of deleterious variants with a beneficial variant. Application of the statistic to around 30 of the most prevalent commensal gut species from 24 human populations around the world showed more than 300 selective sweeps across species. We find an enrichment for selective sweeps at loci involved in carbohydrate metabolism, indicative of adaptation to host diet, and we find that the targets of selection differ significantly between industrialized populations and non-industrialized populations. One of these sweeps is at a locus known to be involved in the metabolism of maltodextrin—a synthetic starch that has recently become a widespread component of industrialized diets. In summary, our results indicate that recombination between strains fuels pervasive adaptive evolution among human gut commensal bacteria, and strongly implicate host diet and lifestyle as critical selection pressures. 
    more » « less
  4. ABSTRACT Ancient DNA (aDNA) analysis of archaeological dental calculus has provided a wealth of insights into ancient health, demography and lifestyles. However, the workflow for ancient metagenomics is still evolving, raising concerns about reproducibility. Few systematic investigations have examined how DNA extraction methods and library preparation protocols influence ancient oral microbiome recovery, despite evidence from modern populations suggesting that they do. This leaves a gap in our understanding of how wet‐lab protocols impact aDNA recovery from dental calculus. In this study, we apply two DNA extraction and two library preparation methods in the aDNA field on dental calculus samples from Hungary and Niger. Samples from each context have similar chronological ages, but differences in their levels of aDNA preservation are notable, providing additional insights into how the efficacy of wet‐lab protocols is impacted by sample preservation. Several metrics were employed to assess intra‐ and inter‐sample variability, such as DNA fragment length recovery, GC content, clonality, endogenous content, DNA deamination and microbial composition. Our findings indicate that both DNA extraction and library preparation protocols can considerably impact ancient DNA recovery from archaeological dental calculus. Furthermore, no single protocol consistently outperformed the others across all assessments, and the effectiveness of specific protocol combinations depended on the preservation of the sample. These findings highlight the challenges of meta‐analyses and underscore the need to account for technical variability. Lastly, our study raises the question of whether the field should strive to standardise methods for comparability or optimise protocols based on sample preservation and specific research objectives. 
    more » « less
  5. Abstract Key messageSelection over 70 years has led to almost complete fixation of a haplotype spanning ~ 250 Mbp of chomosome 5H in European two-rowed spring barleys, possibly originating from North Africa. AbstractPlant breeding and selection have shaped the genetic composition of modern crops over the past decades and centuries and have led to great improvements in agronomic and quality traits. Knowledge of the genetic composition of breeding germplasm is essential to make informed decisions in breeding programs. In this study, we characterized the structure and composition of 209 barley cultivars representative of the European two-rowed spring barley germplasm of the past 190 years. Utilizing high-density SNP marker data, we identified a distinct centromeric haplotype spanning a ~ 250 Mbp large region on chromosome 5H which likely was first introduced into the European breeding germplasm in the early to mid-twentieth century and has been non-recombining and under strong positive selection over the past 70 years. Almost all cultivars in our panel that were released after 2000 carry this new haplotype, suggesting that this region carries one or several genes conferring highly beneficial traits. Using the global barley collection of the German Federal ex situ gene bank at IPK Gatersleben, we found the new haplotype at high frequencies in six-rowed spring-type landraces from Northern Africa, from which it may have been introduced into modern European barley germplasm via southern European landraces. The presence of a 250 Mbp genomic region characterized by lack of recombination and high levels of fixation in modern barley germplasm has substantial implications for the genetic diversity of the modern barley germplasm and for barley breeding. 
    more » « less