Abstract Convergent evolution is often documented in organisms inhabiting isolated environments with distinct ecological conditions and similar selective regimes. Several Central America islands harbor dwarf Boa populations that are characterized by distinct differences in growth, mass, and craniofacial morphology, which are linked to the shared arboreal and feast-famine ecology of these island populations. Using high-density RADseq data, we inferred three dwarf island populations with independent origins and demonstrate that selection, along with genetic drift, has produced both divergent and convergent molecular evolution across island populations. Leveraging whole-genome resequencing data for 20 individuals and a newly annotated Boa genome, we identify four genes with evidence of phenotypically relevant protein-coding variation that differentiate island and mainland populations. The known roles of these genes involved in body growth (PTPRS, DMGDH, and ARSB), circulating fat and cholesterol levels (MYLIP), and craniofacial development (DMGDH and ARSB) in mammals link patterns of molecular evolution with the unique phenotypes of these island forms. Our results provide an important genome-wide example for quantifying expectations of selection and convergence in closely related populations. We also find evidence at several genomic loci that selection may be a prominent force of evolutionary change—even for small island populations for which drift is predicted to dominate. Overall, while phenotypically convergent island populations show relatively few loci under strong selection, infrequent patterns of molecular convergence are still apparent and implicate genes with strong connections to convergent phenotypes. 
                        more » 
                        « less   
                    
                            
                            Direct detection of natural selection in Bronze Age Britain
                        
                    
    
            We developed a novel method for efficiently estimating time-varying selection coefficients from genome-wide ancient DNA data. In simulations, our method accurately recovers selective trajectories and is robust to misspecification of population size. We applied it to a large data set of ancient and present-day human genomes from Britain and identified seven loci with genome-wide significant evidence of selection in the past 4500 yr. Almost all of them can be related to increased vitamin D or calcium levels, suggesting strong selective pressure on these or related phenotypes. However, the strength of selection on individual loci varied substantially over time, suggesting that cultural or environmental factors moderated the genetic response. Of 28 complex anthropometric and metabolic traits, skin pigmentation was the only one with significant evidence of polygenic selection, further underscoring the importance of phenotypes related to vitamin D. Our approach illustrates the power of ancient DNA to characterize selection in human populations and illuminates the recent evolutionary history of Britain. 
        more » 
        « less   
        
    
                            - Award ID(s):
- 2052653
- PAR ID:
- 10403462
- Date Published:
- Journal Name:
- Genome Research
- Volume:
- 32
- Issue:
- 11-12
- ISSN:
- 1088-9051
- Page Range / eLocation ID:
- 2057 to 2067
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
- 
            
- 
            Abstract Large ancient DNA (aDNA) studies offer the chance to examine genomic changes over time, providing direct insights into human evolution. While recent studies have used time-stratified aDNA for selection scans, most focus on single-locus methods. We conducted a multi-locus genotype scan on 708 samples spanning 7000 years of European history. We show that the G12 statistic, originally designed for unphased diploid data, can effectively detect selection in aDNA processed to create ‘pseudo-haplotypes’. In simulations and at known positive control loci (e.g., lactase persistence), G12 outperforms the allele frequency-based selection statistic, SweepFinder2, previously used on aDNA. Applying our approach, we identified 14 candidate regions of selection across four time periods, with half the signals detectable only in the earliest period. Our findings suggest that selective events in European prehistory, including from the onset of animal domestication, have been obscured by neutral processes like genetic drift and demographic shifts such as admixture.more » « less
- 
            Abstract Multiple lines of genetic and archaeological evidence suggest that there were major demographic changes in the terminal Late Pleistocene epoch and early Holocene epoch of sub-Saharan Africa 1–4 . Inferences about this period are challenging to make because demographic shifts in the past 5,000 years have obscured the structures of more ancient populations 3,5 . Here we present genome-wide ancient DNA data for six individuals from eastern and south-central Africa spanning the past approximately 18,000 years (doubling the time depth of sub-Saharan African ancient DNA), increase the data quality for 15 previously published ancient individuals and analyse these alongside data from 13 other published ancient individuals. The ancestry of the individuals in our study area can be modelled as a geographically structured mixture of three highly divergent source populations, probably reflecting Pleistocene interactions around 80–20 thousand years ago, including deeply diverged eastern and southern African lineages, plus a previously unappreciated ubiquitous distribution of ancestry that occurs in highest proportion today in central African rainforest hunter-gatherers. Once established, this structure remained highly stable, with limited long-range gene flow. These results provide a new line of genetic evidence in support of hypotheses that have emerged from archaeological analyses but remain contested, suggesting increasing regionalization at the end of the Pleistocene epoch.more » « less
- 
            Buerkle, Alex (Ed.)The inference of positive selection in genomes is a problem of great interest in evolutionary genomics. By identifying putative regions of the genome that contain adaptive mutations, we are able to learn about the biology of organisms and their evolutionary history. Here we introduce a composite likelihood method that identifies recently completed or ongoing positive selection by searching for extreme distortions in the spatial distribution of the haplotype frequency spectrum along the genome relative to the genome-wide expectation taken as neutrality. Furthermore, the method simultaneously infers two parameters of the sweep: the number of sweeping haplotypes and the “width” of the sweep, which is related to the strength and timing of selection. We demonstrate that this method outperforms the leading haplotype-based selection statistics, though strong signals in low-recombination regions merit extra scrutiny. As a positive control, we apply it to two well-studied human populations from the 1000 Genomes Project and examine haplotype frequency spectrum patterns at the LCT and MHC loci. We also apply it to a data set of brown rats sampled in NYC and identify genes related to olfactory perception. To facilitate use of this method, we have implemented it in user-friendly open source software.more » « less
- 
            Abstract Biobanks that collect deep phenotypic and genomic data across many individuals have emerged as a key resource in human genetics. However, phenotypes in biobanks are often missing across many individuals, limiting their utility. We propose AutoComplete, a deep learning-based imputation method to impute or ‘fill-in’ missing phenotypes in population-scale biobank datasets. When applied to collections of phenotypes measured across ~300,000 individuals from the UK Biobank, AutoComplete substantially improved imputation accuracy over existing methods. On three traits with notable amounts of missingness, we show that AutoComplete yields imputed phenotypes that are genetically similar to the originally observed phenotypes while increasing the effective sample size by about twofold on average. Further, genome-wide association analyses on the resulting imputed phenotypes led to a substantial increase in the number of associated loci. Our results demonstrate the utility of deep learning-based phenotype imputation to increase power for genetic discoveries in existing biobank datasets.more » « less
 An official website of the United States government
An official website of the United States government 
				
			 
					 
					
 
                                    