skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: ONeSAMP 3.0: estimation of effective population size via single nucleotide polymorphism data from one population
Abstract The genetic effective size (Ne) is arguably one of the most important characteristics of a population as it impacts the rate of loss of genetic diversity. Methods that estimate Ne are important in population and conservation genetic studies as they quantify the risk of a population being inbred or lacking genetic diversity. Yet there are very few methods that can estimate the Ne from data from a single population and without extensive information about the genetics of the population, such as a linkage map, or a reference genome of the species of interest. We present ONeSAMP 3.0, an algorithm for estimating Ne from single nucleotide polymorphism data collected from a single population sample using approximate Bayesian computation and local linear regression. We demonstrate the utility of this approach using simulated Wright–Fisher populations, and empirical data from five endangered Channel Island fox (Urocyon littoralis) populations to evaluate the performance of ONeSAMP 3.0 compared to a commonly used Ne estimator. Our results show that ONeSAMP 3.0 is broadly applicable to natural populations and is flexible enough that future versions could easily include summary statistics appropriate for a suite of biological and sampling conditions. ONeSAMP 3.0 is publicly available under the GNU General Public License at https://github.com/AaronHong1024/ONeSAMP_3.  more » « less
Award ID(s):
2013998
PAR ID:
10533833
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ;
Publisher / Repository:
Oxford University Press
Date Published:
Journal Name:
G3: Genes, Genomes, Genetics
ISSN:
2160-1836
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. ABSTRACT Although patterns of population genomic variation are well‐studied in animals, there remains room for studies that focus on non‐model taxa with unique biologies. Here we characterise and attempt to explain such patterns in mygalomorph spiders, which are generally sedentary, often occur as spatially clustered demes and show remarkable longevity. Genome‐wide single nucleotide polymorphism (SNP) data were collected for 500 individuals across a phylogenetically representative sample of taxa. We inferred genetic populations within focal taxa using a phylogenetically informed clustering approach, and characterised patterns of diversity and differentiation within‐ and among these genetic populations, respectively. Using phylogenetic comparative methods we asked whether geographical range sizes and ecomorphological variables (behavioural niche and body size) significantly explain patterns of diversity and differentiation. Specifically, we predicted higher genetic diversity in genetic populations with larger geographical ranges, and in small‐bodied taxa. We also predicted greater genetic differentiation in small‐bodied taxa, and in burrowing taxa. We recovered several significant predictors of genetic diversity, but not genetic differentiation. However, we found generally high differentiation across genetic populations for all focal taxa, and a consistent signal for isolation‐by‐distance irrespective of behavioural niche or body size. We hypothesise that high population genetic structuring, likely reflecting combined dispersal limitation and microhabitat specificity, is a shared trait for all mygalomorphs. Few studies have found ubiquitous genetic structuring for an entire ancient and species‐rich animal clade. 
    more » « less
  2. Abstract Island biotas provide unparalleled opportunities to examine evolutionary processes. Founder effects and bottlenecks, e.g., typically decrease genetic diversity in island populations, while selection for reduced dispersal can increase population structure. Given that support for these generalities mostly comes from single-species analyses, assemblage-level comparisons are needed to clarify how (i) colonization affects the gene pools of interacting insular organisms, and (ii) patterns of genetic differentiation vary within assemblages of organisms. Here, we use genome-wide sequence data from ultraconserved elements (UCEs) to compare the genetic diversity and population structure of mainland and island populations of nine ant species in coastal southern California. As expected, island populations (from Santa Cruz Island) had lower expected heterozygosity and Watterson’s theta compared to mainland populations (from the Lompoc Valley). Island populations, however, exhibited smaller genetic distances among samples, indicating less population subdivision. Within the focal assemblage, pairwise Fst values revealed pronounced interspecific variation in mainland-island differentiation, which increases with gyne body size. Our results reveal population differences across an assemblage of interacting species and illuminate general patterns of insularization in ants. Compared to single-species studies, our analysis of nine conspecific population pairs from the same island-mainland system offers a powerful approach to studying fundamental evolutionary processes. 
    more » « less
  3. Abstract Camelina (Camelina sativa), an allohexaploid species, is an emerging aviation biofuel crop that has been the focus of resurgent interest in recent decades. To guide future breeding and crop improvement efforts, the community requires a deeper comprehension of subgenome dominance, often noted in allopolyploid species, “alongside an understanding of the genetic diversity” and population structure of material present within breeding programs. We conducted population genetic analyses of a C. sativa diversity panel, leveraging a new genome, to estimate nucleotide diversity and population structure, and analyzed for patterns of subgenome expression dominance among different organs. Our analyses confirm that C. sativa has relatively low genetic diversity and show that the SG3 subgenome has substantially lower genetic diversity compared to the other two subgenomes. Despite the low genetic diversity, our analyses identified 13 distinct subpopulations including two distinct wild populations and others putatively representing founders in existing breeding populations. When analyzing for subgenome composition of long non-coding RNAs, which are known to play important roles in (a)biotic stress tolerance, we found that the SG3 subgenome contained significantly more lincRNAs compared to other subgenomes. Similarly, transcriptome analyses revealed that expression dominance of SG3 is not as strong as previously reported and may not be universal across all organ types. From a global analysis, SG3 “was only significant higher expressed” in flower, flower bud, and fruit organs, which is an important discovery given that the crop yield is associated with these organs. Collectively, these results will be valuable for guiding future breeding efforts in camelina. 
    more » « less
  4. Abstract Phylodynamics is an area of population genetics that uses genetic sequence data to estimate past population dynamics. Modern state‐of‐the‐art Bayesian nonparametric methods for recovering population size trajectories of unknown form use either change‐point models or Gaussian process priors. Change‐point models suffer from computational issues when the number of change‐points is unknown and needs to be estimated. Gaussian process‐based methods lack local adaptivity and cannot accurately recover trajectories that exhibit features such as abrupt changes in trend or varying levels of smoothness. We propose a novel, locally adaptive approach to Bayesian nonparametric phylodynamic inference that has the flexibility to accommodate a large class of functional behaviors. Local adaptivity results from modeling the log‐transformed effective population size a priori as a horseshoe Markov random field, a recently proposed statistical model that blends together the best properties of the change‐point and Gaussian process modeling paradigms. We use simulated data to assess model performance, and find that our proposed method results in reduced bias and increased precision when compared to contemporary methods. We also use our models to reconstruct past changes in genetic diversity of human hepatitis C virus in Egypt and to estimate population size changes of ancient and modern steppe bison. These analyses show that our new method captures features of the population size trajectories that were missed by the state‐of‐the‐art methods. 
    more » « less
  5. Abstract The integration of ecological niche modelling into phylogeographic analyses has allowed for the identification and testing of potential refugia under a hypothesis‐based framework, where the expected patterns of higher genetic diversity in refugial populations and evidence of range expansion of nonrefugial populations are corroborated with empirical data. In this study, we focus on a montane‐restricted cryophilic harvestman,Sclerobunus robustus, distributed throughout the heterogeneous Southern Rocky Mountains and Intermontane Plateau of southwestern North America. We identified hypothetical refugia using ecological niche models (ENMs) across three time periods, corroborated these refugia with population genetic methods using double‐digest RAD‐seq data and conducted population‐level phylogenetic and divergence dating analyses. ENMs identify two large temporally persistent regions in the mid‐latitude highlands. Genetic patterns support these two hypothesized refugia with higher genetic diversity within refugial populations and evidence for range expansion in populations found outside hypothesized refugia. Phylogenetic analyses identify five to six genetically divergent, geographically cohesive clades ofS. robustus. Divergence dating analyses suggest that these separate refugia date to the Pliocene and that divergence between clades pre‐dates the late Pleistocene glacial cycles, while diversification within clades was likely driven by these cycles. Population genetic analyses reveal effects of both isolation by distance (IBD) and isolation by environment (IBE), with IBD more important in the continuous mountainous portion of the distribution, while IBE was stronger in the populations inhabiting the isolated sky islands of the south. Using model‐based coalescent approaches, we find support for postdivergence migration between clades from separate refugia. 
    more » « less