skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Characterizing organisms from three domains of life with universal primers from throughout the global ocean
Abstract We introduce the Global rRNA Universal Metabarcoding Plankton database (GRUMP), which consists of 1194 samples that were collected from 2003–2020 and cover extensive latitudinal and longitudinal transects, as well as depth profiles in all major ocean basins. DNA from unfractionated (>0.2 µm) seawater samples was amplified using the 515Y/926 R universal three-domain rRNA gene primers, simultaneously quantifying the relative abundance of amplicon sequencing variants (ASVs) from bacteria, archaea, eukaryotic nuclear 18S, and eukaryotic plastid 16S. Thus, the ratio between taxa in one sample is directly comparable to the ratio in any other GRUMP sample, regardless of gene copy number differences. This obviates a problem in prior global studies that used size-fractionation and different rRNA gene primers for bacteria, archaea, and eukaryotes, precluding comparisons across size fractions or domains. On average, bacteria contributed 71%, eukaryotes 19%, and archaea 8% to rRNA gene abundance, though eukaryotes contributed 32% at latitudes >40°. GRUMP is publicly available on the Simons Collaborative Marine Atlas Project (CMAP), promoting the global comparison of marine microbial dynamics.  more » « less
Award ID(s):
2125142
PAR ID:
10611804
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; more » ; ; ; ; ; ; « less
Publisher / Repository:
Nature Publishing Group
Date Published:
Journal Name:
Scientific Data
Volume:
12
Issue:
1
ISSN:
2052-4463
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Gilbert, Jack A. (Ed.)
    ABSTRACT Small subunit rRNA (SSU rRNA) amplicon sequencing can quantitatively and comprehensively profile natural microbiomes, representing a critically important tool for studying diverse global ecosystems. However, results will only be accurate if PCR primers perfectly match the rRNA of all organisms present. To evaluate how well marine microorganisms across all 3 domains are detected by this method, we compared commonly used primers with >300 million rRNA gene sequences retrieved from globally distributed marine metagenomes. The best-performing primers compared to 16S rRNA of bacteria and archaea were 515Y/926R and 515Y/806RB, which perfectly matched over 96% of all sequences. Considering cyanobacterial and chloroplast 16S rRNA, 515Y/926R had the highest coverage (99%), making this set ideal for quantifying marine primary producers. For eukaryotic 18S rRNA sequences, 515Y/926R also performed best (88%), followed by V4R/V4RB (18S rRNA specific; 82%)—demonstrating that the 515Y/926R combination performs best overall for all 3 domains. Using Atlantic and Pacific Ocean samples, we demonstrate high correspondence between 515Y/926R amplicon abundances (generated for this study) and metagenomic 16S rRNA (median R 2 = 0.98, n  = 272), indicating amplicons can produce equally accurate community composition data compared with shotgun metagenomics. Our analysis also revealed that expected performance of all primer sets could be improved with minor modifications, pointing toward a nearly completely universal primer set that could accurately quantify biogeochemically important taxa in ecosystems ranging from the deep sea to the surface. In addition, our reproducible bioinformatic workflow can guide microbiome researchers studying different ecosystems or human health to similarly improve existing primers and generate more accurate quantitative amplicon data. IMPORTANCE PCR amplification and sequencing of marker genes is a low-cost technique for monitoring prokaryotic and eukaryotic microbial communities across space and time but will work optimally only if environmental organisms match PCR primer sequences exactly. In this study, we evaluated how well primers match globally distributed short-read oceanic metagenomes. Our results demonstrate that primer sets vary widely in performance, and that at least for marine systems, rRNA amplicon data from some primers lack significant biases compared to metagenomes. We also show that it is theoretically possible to create a nearly universal primer set for diverse saline environments by defining a specific mixture of a few dozen oligonucleotides, and present a software pipeline that can guide rational design of primers for any environment with available meta’omic data. 
    more » « less
  2. Summary Universal primers for SSU rRNA genes allow profiling of natural communities by simultaneously amplifying templates from Bacteria, Archaea, and Eukaryota in a single PCR reaction. Despite the potential to show relative abundance for all rRNA genes, universal primers are rarely used, due to various concerns including amplicon length variation and its effect on bioinformatic pipelines. We thus developed 16S and 18S rRNA mock communities and a bioinformatic pipeline to validate this approach. Using these mocks, we show that universal primers (515Y/926R) outperformed eukaryote‐specific V4 primers in observed versus expected abundance correlations (slope = 0.88 vs. 0.67–0.79), and mock community members with single mismatches to the primer were strongly underestimated (threefold to eightfold). Using field samples, both primers yielded similar 18S beta‐diversity patterns (Mantel test,p < 0.001) but differences in relative proportions of many rarer taxa. To test for length biases, we mixed mock communities (16S + 18S) before PCR and found a twofold underestimation of 18S sequences due to sequencing bias. Correcting for the twofold underestimation, we estimate that, in Southern California field samples (1.2–80 μm), there were averages of 35% 18S, 28% chloroplast 16S, and 37% prokaryote 16S rRNA genes. These data demonstrate the potential for universal primers to generate comprehensive microbiome profiles. 
    more » « less
  3. Abstract Community dynamics are central in microbial ecology, yet we lack studies comparing diversity patterns among marine protists and prokaryotes over depth and multiple years. Here, we characterized microbes at the San-Pedro Ocean Time series (2005–2018), using SSU rRNA gene sequencing from two size fractions (0.2–1 and 1–80 μm), with a universal primer set that amplifies from both prokaryotes and eukaryotes, allowing direct comparisons of diversity patterns in a single set of analyses. The 16S + 18S rRNA gene composition in the small size fraction was mostly prokaryotic (>92%) as expected, but the large size fraction unexpectedly contained 46–93% prokaryotic 16S rRNA genes. Prokaryotes and protists showed opposite vertical diversity patterns; prokaryotic diversity peaked at mid-depth, protistan diversity at the surface. Temporal beta-diversity patterns indicated prokaryote communities were much more stable than protists. Although the prokaryotic communities changed monthly, the average community stayed remarkably steady over 14 years, showing high resilience. Additionally, particle-associated prokaryotes were more diverse than smaller free-living ones, especially at deeper depths, contributed unexpectedly by abundant and diverse SAR11 clade II. Eukaryotic diversity was strongly correlated with the diversity of particle-associated prokaryotes but not free-living ones, reflecting that physical associations result in the strongest interactions, including symbioses, parasitism, and decomposer relationships. 
    more » « less
  4. The microbiomes of tropical corals are actively studied using 16S rRNA gene amplicons to understand microbial roles in coral health, metabolism, and disease resistance. However, due to the prokaryotic origins of mitochondria, primers targeting bacterial and archaeal 16S rRNA genes may also amplify homologous 12S mitochondrial rRNA genes from the host coral, associated microbial eukaryotes, and encrusting organisms. Standard microbial bioinformatics pipelines attempt to identify and remove these sequences by comparing them to reference taxonomies. However, commonly used tools have severely under-annotated mitochondrial sequences in 1440 coral microbiomes from the Global Coral Microbiome Project, preventing annotation of over 95% of reads in some samples. This issue persists when using Greengenes or SILVA prokaryotic reference taxonomies, and in other hosts, including 16S studies of vertebrates, and of marine sponges. Worse, mitochondrial under-annotation varies between coral families and across coral compartments, biasing comparisons of  - and  -diversity. By supplementing existing reference taxonomies with over 3000 animal mitochondrial rRNA gene sequences, we resolved roughly 97% of unique unclassified sequences as mitochondrial. These additional sequences did not cause a false elevation in mitochondrial annotations in mock communities with known compositions. We recommend using these extended taxonomies for coral microbiome analysis and whenever eukaryotic contamination may be a concern. 
    more » « less
  5. Tropical environments with unique abiotic and biotic factors—such as salt ponds, mangroves, and coral reefs—are often in close proximity. The heterogeneity of these environments is reflected in community shifts over short distances, resulting in high biodiversity. While phytoplankton assemblages physically associated with corals, particularly their symbionts, are well studied, less is known about phytoplankton diversity across tropical aquatic environments. We assess shifts in phytoplankton community composition along inshore to offshore gradients by sequencing and analyzing 16S rRNA gene amplicons using primers targeting the V1-V2 region that capture plastids from eukaryotic phytoplankton and cyanobacteria, as well as heterotrophic bacteria. Microbial alpha diversity computed from 16S V1-V2 amplicon sequence variant (ASV) data from 282 samples collected in and around Curaçao, in the Southern Caribbean Sea, varied more within the dynamic salt ponds, salterns, and mangroves, compared to the seemingly stable above-reef, off-reef, and open sea environments. Among eukaryotic phytoplankton, stramenopiles often exhibited the highest relative abundances in mangrove, above-reef, off-reef, and open sea environments, where cyanobacteria also showed high relative abundances. Within stramenopiles, diatom amplicons dominated in salt ponds and mangroves, while dictyochophytes and pelagophytes prevailed above reefs and offshore. Green algae and cryptophytes were also present, and the former exhibited transitions following the gradient from inland to offshore. Chlorophytes and prasinophyte Class IV dominated in salt ponds, while prasinophyte Class II, including Micromonas commoda and Ostreococcus Clade OII, had the highest relative abundances of green algae in mangroves, above-reef, off-reef, and the open sea. To improve Class II prasinophyte classification, we sequenced 18S rRNA gene amplicons from the V4 region in 41 samples which were used to interrelate plastid-based results with information on uncultured prasinophyte species from prior 18S rRNA gene-based studies. This highlighted the presence of newly described Ostreococcus bengalensis and two Micromonas candidate species. Network analyses identified co-occurrence patterns between individual phytoplankton groups, including cyanobacteria, and heterotrophic bacteria. Our study reveals multiple uncultured and novel lineages within green algae and dictyochophytes in tropical marine habitats. Collectively, the algal diversity patterns and potential co-occurrence relationships observed in connection to physicochemical and spatial influences help provide a baseline against which future change can be assessed. 
    more » « less