skip to main content


Title: Nuclear eDNA estimates population allele frequencies and abundance in experimental mesocosms and field samples
Abstract

Advances in environmental DNA (eDNA) methodologies have led to improvements in the ability to detect species and communities in aquatic environments, yet the majority of studies emphasize biological diversity at the species level by targeting variable sites within the mitochondrial genome. Here, we demonstrate that eDNA approaches also have the capacity to detect intraspecific diversity in the nuclear genome, allowing for assessments of population‐level allele frequencies and estimates of the number of genetic contributors in an eDNA sample. Using a panel of microsatellite loci developed for the round goby (Neogobius melanostomus), we tested the similarity between eDNA‐based and individual tissue‐based estimates of allele frequencies from experimental mesocosms and in a field‐based trial. Subsequently, we used a likelihood‐based DNA mixture framework to estimate the number of unique genetic contributors in eDNA samples and in simulated mixtures of alleles. In both mesocosm and field samples, allele frequencies from eDNA were highly correlated with allele frequencies from genotyped round goby tissue samples, indicating nuclear markers can be reliably amplified from water samples. DNA mixture analyses were able to estimate the number of genetic contributors from mesocosm eDNA samples and simulated mixtures of DNA from up to 58 individuals, with the degree of positive or negative bias dependent on the filtering scheme of low‐frequency alleles. With this study we document the application of eDNA and multiple amplicon‐based methods to obtain intraspecific nuclear genetic information and estimate the absolute abundance of a species in eDNA samples. With proper validation, this approach has the potential to advance noninvasive survey methods to characterize populations and detect population‐level genetic diversity.

 
more » « less
NSF-PAR ID:
10405001
Author(s) / Creator(s):
 ;  ;  ;  
Publisher / Repository:
Wiley-Blackwell
Date Published:
Journal Name:
Molecular Ecology
Volume:
30
Issue:
3
ISSN:
0962-1083
Page Range / eLocation ID:
p. 685-697
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    Wild specimens are often collected in challenging field conditions, where samples may be contaminated with the DNA of conspecific individuals. This contamination can result in false genotype calls, which are difficult to detect, but may also cause inaccurate estimates of heterozygosity, allele frequencies and genetic differentiation. Marine broadcast spawners are especially problematic, because population genetic differentiation is low and samples are often collected in bulk and sometimes from active spawning aggregations. Here, we used contaminated and clean Pacific herring (Clupea pallasi) samples to test (a) the efficacy of bleach decontamination, (b) the effect of decontamination on RAD genotypes and (c) the consequences of contaminated samples on population genetic analyses. We collected fin tissue samples from actively spawning (and thus contaminated) wild herring and nonspawning (uncontaminated) herring. Samples were soaked for 10 min in bleach or left untreated, and extracted DNA was used to prepare DNA libraries using a restriction site‐associated DNA (RAD) approach. Our results demonstrate that intraspecific DNA contamination affects patterns of individual and population variability, causes an excess of heterozygotes and biases estimates of population structure. Bleach decontamination was effective at removing intraspecific DNA contamination and compatible with RAD sequencing, producing high‐quality sequences, reproducible genotypes and low levels of missing data. Although sperm contamination may be specific to broadcast spawners, intraspecific contamination of samples may be common and difficult to detect from high‐throughput sequencing data and can impact downstream analyses.

     
    more » « less
  2. Abstract

    A recent focus in community ecology has been on how within‐species variability shapes interspecific niche partitioning. Primate color vision offers a rich system in which to explore this issue. Most neotropical primates exhibit intraspecific variation in color vision due to allelic variation at the middle‐to‐long‐wavelength opsin gene on the X chromosome. Studies of opsin polymorphisms have typically sampled primates from different sites, limiting the ability to relate this genetic diversity to niche partitioning. We surveyed genetic variation in color vision of five primate species, belonging to all three families of the primate infraorder Platyrrhini, found in the Yasuní Biosphere Reserve in Ecuador. The frugivorous spider monkeys and woolly monkeys (Ateles belzebuthandLagothrix lagotricha poeppigii, family Atelidae) each had two opsin alleles, and more than 75% of individuals carried the longest‐wavelength (553–556 nm) allele. Among the other species,Saimiri sciureus macrodon(family Cebidae) andPithecia aequatorialis(family Pitheciidae) had three alleles, whilePlecturocebus discolor(family Pitheciidae) had four alleles—the largest number yet identified in a wild population of titi monkeys. For all three non‐atelid species, the middle‐wavelength (545 nm) allele was the most common. Overall, we identified genetic evidence of fourteen different visual phenotypes—seven types of dichromats and seven trichromats—among the five sympatric taxa. The differences we found suggest that interspecific competition among primates may influence intraspecific frequencies of opsin alleles. The diversity we describe invites detailed study of foraging behavior of different vision phenotypes to learn how they may contribute to niche partitioning.

     
    more » « less
  3. Abstract

    Environmental DNA (eDNA) sampling—the detection of genetic material in the environment to infer species presence—has rapidly grown as a tool for sampling aquatic animal communities. A potentially powerful feature of environmental sampling is that all taxa within the habitat shed DNA and so may be detectable, creating opportunity for whole‐community assessments. However, animal DNA in the environment tends to be comparatively rare, making it necessary to enrich for genetic targets from focal taxa prior to sequencing. Current metabarcoding approaches for enrichment rely on bulk amplification using conserved primer annealing sites, which can result in skewed relative sequence abundance and failure to detect some taxa because of PCR bias. Here, we test capture enrichment via hybridization as an alternative strategy for target enrichment using a series of experiments on environmental samples and laboratory‐generated, known‐composition DNA mixtures. Capture enrichment resulted in detecting multiple species in both kinds of samples, and postcapture relative sequence abundance accurately reflected initial relative template abundance. However, further optimization is needed to permit reliable species detection at the very low‐DNA quantities typical of environmental samples (<0.1 ng DNA). We estimate that our capture protocols are comparable to, but less sensitive than, current PCR‐based eDNA analyses.

     
    more » « less
  4. INTRODUCTION A major challenge in genomics is discerning which bases among billions alter organismal phenotypes and affect health and disease risk. Evidence of past selective pressure on a base, whether highly conserved or fast evolving, is a marker of functional importance. Bases that are unchanged in all mammals may shape phenotypes that are essential for organismal health. Bases that are evolving quickly in some species, or changed only in species that share an adaptive trait, may shape phenotypes that support survival in specific niches. Identifying bases associated with exceptional capacity for cellular recovery, such as in species that hibernate, could inform therapeutic discovery. RATIONALE The power and resolution of evolutionary analyses scale with the number and diversity of species compared. By analyzing genomes for hundreds of placental mammals, we can detect which individual bases in the genome are exceptionally conserved (constrained) and likely to be functionally important in both coding and noncoding regions. By including species that represent all orders of placental mammals and aligning genomes using a method that does not require designating humans as the reference species, we explore unusual traits in other species. RESULTS Zoonomia’s mammalian comparative genomics resources are the most comprehensive and statistically well-powered produced to date, with a protein-coding alignment of 427 mammals and a whole-genome alignment of 240 placental mammals representing all orders. We estimate that at least 10.7% of the human genome is evolutionarily conserved relative to neutrally evolving repeats and identify about 101 million significantly constrained single bases (false discovery rate < 0.05). We cataloged 4552 ultraconserved elements at least 20 bases long that are identical in more than 98% of the 240 placental mammals. Many constrained bases have no known function, illustrating the potential for discovery using evolutionary measures. Eighty percent are outside protein-coding exons, and half have no functional annotations in the Encyclopedia of DNA Elements (ENCODE) resource. Constrained bases tend to vary less within human populations, which is consistent with purifying selection. Species threatened with extinction have few substitutions at constrained sites, possibly because severely deleterious alleles have been purged from their small populations. By pairing Zoonomia’s genomic resources with phenotype annotations, we find genomic elements associated with phenotypes that differ between species, including olfaction, hibernation, brain size, and vocal learning. We associate genomic traits, such as the number of olfactory receptor genes, with physical phenotypes, such as the number of olfactory turbinals. By comparing hibernators and nonhibernators, we implicate genes involved in mitochondrial disorders, protection against heat stress, and longevity in this physiologically intriguing phenotype. Using a machine learning–based approach that predicts tissue-specific cis - regulatory activity in hundreds of species using data from just a few, we associate changes in noncoding sequence with traits for which humans are exceptional: brain size and vocal learning. CONCLUSION Large-scale comparative genomics opens new opportunities to explore how genomes evolved as mammals adapted to a wide range of ecological niches and to discover what is shared across species and what is distinctively human. High-quality data for consistently defined phenotypes are necessary to realize this potential. Through partnerships with researchers in other fields, comparative genomics can address questions in human health and basic biology while guiding efforts to protect the biodiversity that is essential to these discoveries. Comparing genomes from 240 species to explore the evolution of placental mammals. Our new phylogeny (black lines) has alternating gray and white shading, which distinguishes mammalian orders (labeled around the perimeter). Rings around the phylogeny annotate species phenotypes. Seven species with diverse traits are illustrated, with black lines marking their branch in the phylogeny. Sequence conservation across species is described at the top left. IMAGE CREDIT: K. MORRILL 
    more » « less
  5. Copy number variants (CNVs) are regions of the genome that vary in integer copy number. CNVs, which comprise both amplifications and deletions of DNA sequence, have been identified across all domains of life, from bacteria and archaea to plants and animals. CNVs are an important source of genetic diversity, and can drive rapid adaptive evolution and progression of heritable and somatic human diseases, such as cancer. However, despite their evolutionary importance and clinical relevance, CNVs remain understudied compared to single-nucleotide variants (SNVs). This is a consequence of the inherent difficulties in detecting CNVs at low-to-intermediate frequencies in heterogeneous populations of cells. Here, we discuss molecular methods used to detect CNVs, the limitations associated with using these techniques, and the application of new and emerging technologies that present solutions to these challenges. The goal of this short review and perspective is to highlight aspects of CNV biology that are understudied and define avenues for further research that address specific gaps in our knowledge of these complex alleles. We describe our recently developed method for CNV detection in which a fluorescent gene functions as a single-cell CNV reporter and present key findings from our evolution experiments in Saccharomyces cerevisiae. Using a CNV reporter, we found that CNVs are generated at a high rate and undergo selection with predictable dynamics across independently evolving replicate populations. Many CNVs appear to be generated through DNA replication-based processes that are mediated by the presence of short, interrupted, inverted-repeat sequences. Our results have important implications for the role of CNVs in evolutionary processes and the molecular mechanisms that underlie CNV formation. We discuss the possible extension of our method to other applications, including tracking the dynamics of CNVs in models of human tumors. 
    more » « less