skip to main content

Attention:

The NSF Public Access Repository (NSF-PAR) system and access will be unavailable from 5:00 PM ET until 11:00 PM ET on Friday, June 21 due to maintenance. We apologize for the inconvenience.


Title: Fast and flexible estimation of effective migration surfaces
Spatial population genetic data often exhibits ‘isolation-by-distance,’ where genetic similarity tends to decrease as individuals become more geographically distant. The rate at which genetic similarity decays with distance is often spatially heterogeneous due to variable population processes like genetic drift, gene flow, and natural selection. Petkova et al., 2016 developed a statistical method called Estimating Effective Migration Surfaces (EEMS) for visualizing spatially heterogeneous isolation-by-distance on a geographic map. While EEMS is a powerful tool for depicting spatial population structure, it can suffer from slow runtimes. Here, we develop a related method called Fast Estimation of Effective Migration Surfaces (FEEMS). FEEMS uses a Gaussian Markov Random Field model in a penalized likelihood framework that allows for efficient optimization and output of effective migration surfaces. Further, the efficient optimization facilitates the inference of migration parameters per edge in the graph, rather than per node (as in EEMS). With simulations, we show conditions under which FEEMS can accurately recover effective migration surfaces with complex gene-flow histories, including those with anisotropy. We apply FEEMS to population genetic data from North American gray wolves and show it performs favorably in comparison to EEMS, with solutions obtained orders of magnitude faster. Overall, FEEMS expands the ability of users to quickly visualize and interpret spatial structure in their data.  more » « less
Award ID(s):
1654076
NSF-PAR ID:
10331198
Author(s) / Creator(s):
; ; ;
Date Published:
Journal Name:
eLife
Volume:
10
ISSN:
2050-084X
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    Human commensal species such as rodent pests are often widely distributed across cities and threaten both infrastructure and public health. Spatially explicit population genomic methods provide insights into movements for cryptic pests that drive evolutionary connectivity across multiple spatial scales. We examined spatial patterns of neutral genomewide variation in brown rats (Rattus norvegicus) across Manhattan, New York City (NYC), using 262 samples and 61,401SNPs to understand (i) relatedness among nearby individuals and the extent of spatial genetic structure in a discrete urban landscape; (ii) the geographic origin ofNYCrats, using a large, previously published data set of global rat genotypes; and (iii) heterogeneity in gene flow across the city, particularly deviations from isolation by distance. We found that rats separated by ≤200 m exhibit strong spatial autocorrelation (r = .3,p = .001) and the effects of localized genetic drift extend to a range of 1,400 m. Across Manhattan, rats exhibited a homogeneous population origin from rats that likely invaded from Great Britain. While traditional approaches identified a single evolutionary cluster with clinal structure across Manhattan, recently developed methods (e.g., fineSTRUCTURE,sPCA,EEMS) provided evidence of reduced dispersal across the island's less residential Midtown region resulting in fine‐scale genetic structuring (FST = 0.01) and two evolutionary clusters (Uptown and Downtown Manhattan). Thus, while some urban populations of human commensals may appear to be continuously distributed, landscape heterogeneity within cities can drive differences in habitat quality and dispersal, with implications for the spatial distribution of genomic variation, population management and the study of widely distributed pests.

     
    more » « less
  2. Abstract Aim

    A central aim of biogeography is to understand how biodiversity is generated and maintained across landscapes. Here, we establish phylogenetic and population genetic patterns in a widespread reptile to quantify the influence of historical biogeography and current environmental variation on patterns of genetic diversity.

    Location

    Western North America.

    Taxon

    Western terrestrial garter snake,Thamnophis elegans.

    Methods

    We used double‐digest RADseq to estimate phylogenetic relationships and characterize population genetic structure across the three widespread subspecies ofTelegans:T. e. vagrans(wandering garter snake),Teelegans(mountain garter snake) andTeterrestris(coast garter snake). We assessed patterns of dispersal and vicariance across biogeographic regions using ancestral area reconstruction (AAR) and deviations from isolation‐by‐distance across the landscape using estimated effective migration surfaces (EEMS). We identified environmental variables potentially shaping local adaptation in regional lineages using genetic‐environment association (GEA) analyses.

    Results

    We recovered three well‐differentiated genetic groups that correspond to the three subspecies. AAR analyses inferred the eastern Cascade Range as the ancestral area, with dispersal to both the east and west across western North America. Populations ofT. e. elegansdisplayed a latitudinal gradient in genetic variation across the Sierra Nevada and northern California, while populations ofTeterrestrisshow discrete genetic breaks consistent with well‐known biogeographic barriers. Lastly, GEA analyses identified allele frequency shifts at loci associated with a common set of environmental variables in bothTeelegansandTeterrestris.

    Main Conclusion

    T. elegansis composed of distinct evolutionary lineages, each with its own geographic range and history of diversification.TeelegansandTeterrestrisshow unique patterns of diversification as populations dispersed from east to west and while adapting to the new environments they colonized. Historical events, landscape features and environmental variation have all contributed to patterns of differentiation inTelegans.

     
    more » « less
  3. The estimation of malaria parasite migration can play a vital role in informing elimination strategies by pinpointing regions with higher parasite migration that act as transmission sources, and that could be the focus of elimination interventions. Gene flow simulation methods such as Estimated Effective Migration Surfaces (EEMS) and Migration and Population-Size Surfaces (MAPS) use a Markov Chain Monte Carlo simulation-based approach to visualize a species' migration and diversity. These methods utilize georeferenced genomic data and present output in the form of migration contour maps. Despite their potential, there is uncertainty in EEMS and MAPS outputs when sampling locations are sparse - an aspect that remains under-explored in current research. We present a framework designed to systematically assess the impact of sample locations and sample size on migration contours in gene flow simulations that goes beyond the posterior probability map available in EEMS. We test our framework using publicly available genomic data collected from Cambodia and border regions of Thailand, Vietnam, and Laos during 2008-2013. The methodology leverages kernel density estimation and topological skeletons in conjunction with other spatial analysis methods to quantify the impact of sparse sample locations on gene flow simulations. Multiple sample resolutions were tested against a baseline resolution, and the findings highlight how migration contours vary with sampling resolution and how our approach can be applied to guide the production and mapping of reliable migration contours. Our research provides valuable insights about both the reliability and precision of model outputs when employing gene flow simulation techniques e.g., EEMS and MAPS, to estimate malaria parasite migration. The findings revealed that by employing our approach, we were able to maintain approximately 67% consistency between the contours and the reference dataset, even when utilizing only half of the sample locations. This knowledge will improve both the reliability and precision of these model outputs in future studies. 
    more » « less
  4. Abstract Pouched lamprey (Geotria australis) or kanakana/piharau is a culturally and ecologically significant jawless fish that is distributed throughout Aotearoa New Zealand. Despite its importance, much remains unknown about historical relationships and gene flow between populations of this enigmatic species within New Zealand. To help inform management, we assembled a draft Geotria australis genome and completed the first comprehensive population genomics analysis of pouched lamprey within New Zealand using targeted gene sequencing (Cyt-b and COI) and restriction site-associated DNA sequencing (RADSeq) methods. Employing 16,000 genome-wide single nucleotide polymorphisms (SNPs) derived from RADSeq (n=186) and sequence data from Cyt-b (766 bp, n=94) and COI (589 bp, n=20), we reveal low levels of structure across 10 sampling locations spanning the species range within New Zealand. F-statistics, outlier analyses, and STRUCTURE suggest a single panmictic population, and Mantel and EEMS tests reveal no significant isolation by distance. This implies either ongoing gene flow among populations or recent shared ancestry among New Zealand pouched lamprey. We can now use the information gained from these genetic tools to assist managers with monitoring effective population size, managing potential diseases, and conservation measures such as artificial propagation programs. We further demonstrate the general utility of these genetic tools for acquiring information about elusive species. 
    more » « less
  5. Abstract

    Metapopulation‐structured species can be negatively affected when landscape fragmentation impairs connectivity. We investigated the effects of urbanization on genetic diversity and gene flow for two sympatric amphibian species, spotted salamanders (Ambystoma maculatum) and wood frogs (Lithobates sylvaticus), across a large (>35,000 km2) landscape in Maine, USA, containing numerous natural and anthropogenic gradients. Isolation‐by‐distance (IBD) patterns differed between the species. Spotted salamanders showed a linear and relatively high variance relationship between genetic and geographic distances (r = .057,p < .001), whereas wood frogs exhibited a strongly nonlinear and lower variance relationship (r = 0.429,p < .001). Scale dependence analysis of IBD found gene flow has its most predictable influence (strongest IBD correlations) at distances up to 9 km for spotted salamanders and up to 6 km for wood frogs. Estimated effective migration surfaces revealed contrasting patterns of high and low genetic diversity and gene flow between the two species. Population isolation, quantified as the mean IBD residuals for each population, was associated with local urbanization and less genetic diversity in both species. The influence of geographic proximity and urbanization on population connectivity was further supported by distance‐based redundancy analysis and multiple matrix regression with randomization. Resistance surface modeling found interpopulation connectivity to be influenced by developed land cover, light roads, interstates, and topography for both species, plus secondary roads and rivers for wood frogs. Our results highlight the influence of anthropogenic landscape features within the context of natural features and broad spatial genetic patterns, in turn supporting the premise that while urbanization significantly restricts interpopulation connectivity for wood frogs and spotted salamanders, specific landscape elements have unique effects on these two sympatric species.

     
    more » « less