skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


This content will become publicly available on March 1, 2026

Title: Inference from eDNA-based field distributions vs laboratory analysis of isolated strains: physiological performance of non-marine Antarctic biota
Abstract Environmental DNA (eDNA) is frequently used to infer distributions of microorganisms in Antarctica. Their distributions relative to environmental variables are, in turn, sometimes used to infer their physiological range (and a relationship between the two is generally assumed for conservation purposes). We sought to determine whether ecological inferences based on distributions accurately reflect tolerances of the organisms concerned, using 249 legacy non-marine samples from a latitudinal gradient between 72 and 86°S, Antarctica. A cyanobacterium, a heterotrophic bacterium, two eukaryotic algae, two fungi, and a moss were isolated into culture, and their field distributions inferred using eDNA analysis of the samples above. Tolerances of each organism with respect to environmental predictors were then inferred from the eDNA distribution and metadata using Generalised Additive Models. We then measured growth of the cultured isolates in response to a set of these predictors. Laboratory responses were then compared to inferences from the eDNA/metadata. Predictions from eDNA/metadata agreed with the results of physiological laboratory experiments for strains that were detected at high taxonomic resolution in the field samples. However, errors were never completely eliminated, and direct contradictions occurred when strains were represented at lower taxonomic resolution in the field data. We found that accurate ecological inference from eDNA studies would be best achieved via maximising both taxonomic resolution (through marker choice/read length) and ecological signal (through careful sampling design and rigorous metadata collection).  more » « less
Award ID(s):
2133685 2224760
PAR ID:
10585054
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ;
Publisher / Repository:
Springer
Date Published:
Journal Name:
Polar Biology
Volume:
48
Issue:
1
ISSN:
0722-4060
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Ecosystems globally are under threat from ongoing anthropogenic environmental change. Effective conservation management requires more thorough biodiversity surveys that can reveal system‐level patterns and that can be applied rapidly across space and time. Using modern ecological models and community science, we integrate environmental DNA and Earth observations to produce a time snapshot of regional biodiversity patterns and provide multi‐scalar community‐level characterization. We collected 278 samples in spring 2017 from coastal, shrub, and lowland forest sites in California, a complex ecosystem and biodiversity hotspot. We recovered 16,118 taxonomic entries from eDNA analyses and compiled associated traditional observations and environmental data to assess how well they predicted alpha, beta, and zeta diversity. We found that local habitat classification was diagnostic of community composition and distinct communities and organisms in different kingdoms are predicted by different environmental variables. Nonetheless, gradient forest models of 915 families recovered by eDNA analysis and using BIOCLIM variables, Sentinel‐2 satellite data, human impact, and topographical features as predictors, explained 35% of the variance in community turnover. Elevation, sand percentage, and photosynthetic activities (NDVI32) were the top predictors. In addition to this signal of environmental filtering, we found a positive relationship between environmentally predicted families and their numbers of biotic interactions, suggesting environmental change could have a disproportionate effect on community networks. Together, these analyses show that coupling eDNA with environmental predictors including remote sensing data has capacity to test proposed Essential Biodiversity Variables and create new landscape biodiversity baselines that span the tree of life. 
    more » « less
  2. Abstract MotivationEnvironmental DNA (eDNA), as a rapidly expanding research field, stands to benefit from shared resources including sampling protocols, study designs, discovered sequences, and taxonomic assignments to sequences. High-quality community shareable eDNA resources rely heavily on comprehensive metadata documentation that captures the complex workflows covering field sampling, molecular biology lab work, and bioinformatic analyses. There are limited sources that provide documentation of database development on comprehensive metadata for eDNA and these workflows and no open-source software. ResultsWe present medna-metadata, an open-source, modular system that aligns with Findable, Accessible, Interoperable, and Reusable guiding principles that support scholarly data reuse and the database and application development of a standardized metadata collection structure that encapsulates critical aspects of field data collection, wet lab processing, and bioinformatic analysis. Medna-metadata is showcased with metabarcoding data from the Gulf of Maine (Polinski et al., 2019). Availability and implementationThe source code of the medna-metadata web application is hosted on GitHub (https://github.com/Maine-eDNA/medna-metadata). Medna-metadata is a docker-compose installable package. Documentation can be found at https://medna-metadata.readthedocs.io/en/latest/?badge=latest. The application is implemented in Python, PostgreSQL and PostGIS, RabbitMQ, and NGINX, with all major browsers supported. A demo can be found at https://demo.metadata.maine-edna.org/. Supplementary informationSupplementary data are available at Bioinformatics online. 
    more » « less
  3. The environmental preferences of many microbes remain undetermined. This is the case for bacterial pH preferences, which can be difficult to predict a priori despite the importance of pH as a factor structuring bacterial communities in many systems. We compiled data on bacterial distributions from five datasets spanning pH gradients in soil and freshwater systems (1470 samples), quantified the pH preferences of bacterial taxa across these datasets, and compiled genomic data from representative bacterial taxa. While taxonomic and phylogenetic information were generally poor predictors of bacterial pH preferences, we identified genes consistently associated with pH preference across environments. We then developed and validated a machine learning model to estimate bacterial pH preferences from genomic information alone, a model that could aid in the selection of microbial inoculants, improve species distribution models, or help design effective cultivation strategies. More generally, we demonstrate the value of combining biogeographic and genomic data to infer and predict the environmental preferences of diverse bacterial taxa. 
    more » « less
  4. PremiseHybrid capture with high‐throughput sequencing (Hyb‐Seq) is a powerful tool for evolutionary studies. The applicability of an Asteraceae family‐specific Hyb‐Seq probe set and the outcomes of different phylogenetic analyses are investigated here. MethodsHyb‐Seq data from 112 Asteraceae samples were organized into groups at different taxonomic levels (tribe, genus, and species). For each group, data sets of non‐paralogous loci were built and proportions of parsimony informative characters estimated. The impacts of analyzing alternative data sets, removing long branches, and type of analysis on tree resolution and inferred topologies were investigated in tribe Cichorieae. ResultsAlignments of the Asteraceae family‐wide Hyb‐Seq locus set were parsimony informative at all taxonomic levels. Levels of resolution and topologies inferred at shallower nodes differed depending on the locus data set and the type of analysis, and were affected by the presence of long branches. DiscussionThe approach used to build a Hyb‐Seq locus data set influenced resolution and topologies inferred in phylogenetic analyses. Removal of long branches improved the reliability of topological inferences in maximum likelihood analyses. The Astereaceae Hyb‐Seq probe set is applicable at multiple taxonomic depths, which demonstrates that probe sets do not necessarily need to be lineage‐specific. 
    more » « less
  5. Gorokhova, Elena (Ed.)
    Environmental DNA (eDNA) is an increasingly useful method for detecting pelagic animals in the ocean but typically requires large water volumes to sample diverse assemblages. Ship-based pelagic sampling programs that could implement eDNA methods generally have restrictive water budgets. Studies that quantify how eDNA methods perform on low water volumes in the ocean are limited, especially in deep-sea habitats with low animal biomass and poorly described species assemblages. Using 12S rRNA and COI gene primers, we quantified assemblages comprised of micronekton, coastal forage fishes, and zooplankton from low volume eDNA seawater samples (n = 436, 380–1800 mL) collected at depths of 0–2200 m in the southern California Current. We compared diversity in eDNA samples to concurrently collected pelagic trawl samples (n = 27), detecting a higher diversity of vertebrate and invertebrate groups in the eDNA samples. Differences in assemblage composition could be explained by variability in size-selectivity among methods and DNA primer suitability across taxonomic groups. The number of reads and amplicon sequences variants (ASVs) did not vary substantially among shallow (<200 m) and deep samples (>600 m), but the proportion of invertebrate ASVs that could be assigned a species-level identification decreased with sampling depth. Using hierarchical clustering, we resolved horizontal and vertical variability in marine animal assemblages from samples characterized by a relatively low diversity of ecologically important species. Low volume eDNA samples will quantify greater taxonomic diversity as reference libraries, especially for deep-dwelling invertebrate species, continue to expand. 
    more » « less