skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Global biogeography of N 2 -fixing microbes: nifH  amplicon database and analytics workflow
Abstract. Marine dinitrogen (N2) fixation is a globally significant biogeochemical process carried out by a specialized group of prokaryotes (diazotrophs), yet our understanding of their ecology is constantly evolving. Although marine N2 fixation is often ascribed to cyanobacterial diazotrophs, indirect evidence suggests that non-cyanobacterial diazotrophs (NCDs) might also be important. One widely used approach for understanding diazotroph diversity and biogeography is polymerase chain reaction (PCR) amplification of a portion of the nifH gene, which encodes a structural component of the N2-fixing enzyme complex, nitrogenase. An array of bioinformatic tools exists to process nifH amplicon data; however, the lack of standardized practices has hindered cross-study comparisons. This has led to a missed opportunity to more thoroughly assess diazotroph diversity and biogeography, as well as their potential contributions to the marine N cycle. To address these knowledge gaps, a bioinformatic workflow was designed that standardizes the processing of nifH amplicon datasets originating from high-throughput sequencing (HTS). Multiple datasets are efficiently and consistently processed with a specialized DADA2 pipeline to identify amplicon sequence variants (ASVs). A series of customizable post-pipeline stages then detect and discard spurious nifH sequences and annotate the subsequent quality-filtered nifH ASVs using multiple reference databases and classification approaches. This newly developed workflow was used to reprocess nearly all publicly available nifH amplicon HTS datasets from marine studies and to generate a comprehensive nifH ASV database containing 9383 ASVs aggregated from 21 studies that represent the diazotrophic populations in the global ocean. For each sample, the database includes physical and chemical metadata obtained from the Simons Collaborative Marine Atlas Project (CMAP). Here we demonstrate the utility of this database for revealing global biogeographical patterns of prominent diazotroph groups and highlight the influence of sea surface temperature. The workflow and nifH ASV database provide a robust framework for studying marine N2 fixation and diazotrophic diversity captured by nifH amplicon HTS. Future datasets that target understudied ocean regions can be added easily, and users can tune parameters and studies included for their specific focus. The workflow and database are available, respectively, on GitHub (https://github.com/jdmagasin/nifH-ASV-workflow, last access: 21 January 2025; Morando et al., 2024c) and Figshare (https://doi.org/10.6084/m9.figshare.23795943.v2; Morando et al., 2024b).  more » « less
Award ID(s):
2023498
PAR ID:
10589086
Author(s) / Creator(s):
; ; ; ; ;
Publisher / Repository:
ESSD
Date Published:
Journal Name:
Earth System Science Data
Volume:
17
Issue:
2
ISSN:
1866-3516
Page Range / eLocation ID:
393 to 422
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract. Marine diazotrophs convert dinitrogen (N2) gas intobioavailable nitrogen (N), supporting life in the global ocean. In 2012, thefirst version of the global oceanic diazotroph database (version 1) waspublished. Here, we present an updated version of the database (version 2),significantly increasing the number of in situ diazotrophic measurements from13 565 to 55 286. Data points for N2 fixation rates, diazotrophic cellabundance, and nifH gene copy abundance have increased by 184 %, 86 %, and809 %, respectively. Version 2 includes two new data sheets for the nifH genecopy abundance of non-cyanobacterial diazotrophs and cell-specific N2fixation rates. The measurements of N2 fixation rates approximatelyfollow a log-normal distribution in both version 1 and version 2. However,version 2 considerably extends both the left and right tails of thedistribution. Consequently, when estimating global oceanic N2 fixationrates using the geometric means of different ocean basins, version 1 andversion 2 yield similar rates (43–57 versus 45–63 Tg N yr−1; rangesbased on one geometric standard error). In contrast, when using arithmeticmeans, version 2 suggests a significantly higher rate of 223±30 Tg N yr−1 (mean ± standard error; same hereafter) compared to version 1(74±7 Tg N yr−1). Specifically, substantial rate increases areestimated for the South Pacific Ocean (88±23 versus 20±2 Tg N yr−1), primarily driven by measurements in the southwestern subtropics,and for the North Atlantic Ocean (40±9 versus 10±2 Tg N yr−1). Moreover, version 2 estimates the N2 fixation rate in theIndian Ocean to be 35±14 Tg N yr−1, which could not be estimatedusing version 1 due to limited data availability. Furthermore, a comparisonof N2 fixation rates obtained through different measurement methods atthe same months, locations, and depths reveals that the conventional15N2 bubble method yields lower rates in 69 % cases compared tothe new 15N2 dissolution method. This updated version of thedatabase can facilitate future studies in marine ecology andbiogeochemistry. The database is stored at the Figshare repository(https://doi.org/10.6084/m9.figshare.21677687; Shao etal., 2022). 
    more » « less
  2. Abstract Exploring the diversity of diazotrophs is key to understanding their role in supplying fixed nitrogen that supports marine productivity. A nested PCR assay using the universal primer set nifH1-nifH4, which targets the nitrogenase (nifH) gene, is a widely used approach for studying marine diazotrophs by amplicon sequencing. Metagenomics, direct sequencing of DNA without PCR, has provided complementary views of the diversity of marine diazotrophs. A significant fraction of the metagenome-derived nifH sequences (e.g. Planctomycete- and Proteobacteria-affiliated) were reported to have nucleotide mismatches with the nifH1-nifH4 primers, leading to the suggestion that nifH amplicon sequencing does not detect specific diazotrophic taxa and underrepresents diazotroph diversity. Here, we report that these mismatches are mostly located in a single-base at the 5′-end of the nifH4 primer, which does not impact detection of the nifH genes. This is demonstrated by the presence of nifH genes that contain the nucleotide mismatches in a recent compilation of global ocean nifH amplicon datasets, with high relative abundances detected in a variety of samples. While the metagenome- and metatranscriptome-derived nifH genes accounted for 4.4% of the total amplicon sequence variants from the global ocean nifH amplicon database, the corresponding amplicon sequence variants can have high relative abundances (accounting for 47% of the reads in the database). These analyses underscore that nifH amplicon sequencing using the nifH1-nifH4 primers is an important tool for studying diversity of marine diazotrophs, particularly as a complement to metagenomics which can provide taxonomic and metabolic information for some dominant groups. 
    more » « less
  3. Abstract Biological dinitrogen (N2) fixation supplies nitrogen to the oceans, supporting primary productivity, and is carried out by some bacteria and archaea referred to as diazotrophs. Cyanobacteria are conventionally considered to be the major contributors to marine N2 fixation, but non-cyanobacterial diazotrophs (NCDs) have been shown to be distributed throughout ocean ecosystems. However, the biogeochemical significance of marine NCDs has not been demonstrated. This review synthesizes multiple datasets, drawing from cultivation-independent molecular techniques and data from extensive oceanic expeditions, to provide a comprehensive view into the diversity, biogeography, ecophysiology, and activity of marine NCDs. A NCD nifH gene catalog was compiled containing sequences from both PCR-based and PCR-free methods, identifying taxa for future studies. NCD abundances from a novel database of NCD nifH-based abundances were colocalized with environmental data, unveiling distinct distributions and environmental drivers of individual taxa. Mechanisms that NCDs may use to fuel and regulate N2 fixation in response to oxygen and fixed nitrogen availability are discussed, based on a metabolic analysis of recently available Tara Oceans expedition data. The integration of multiple datasets provides a new perspective that enhances understanding of the biology, ecology, and biogeography of marine NCDs and provides tools and directions for future research. 
    more » « less
  4. Dinitrogen (N2) fixation is carried out by specialized microbes, called diazotrophs, and is a major source of nitrogen supporting primary production in oligotrophic oceans. One of the best-characterized diazotroph habitats is the North Pacific Subtropical Gyre (NPSG), where warm, chronically N-limited surface waters promote year-round N2fixation. At Station ALOHA (A Long-Term Oligotrophic Habitat Assessment) in the NPSG, N2fixation is typically ascribed to conspicuous, filamentous cyanobacterial diazotrophs (TrichodesmiumandRichelia), unicellular free-livingCrocosphaera, and the UCYN-A/haptophyte symbiosis, based on using microscopy and quantitative PCR (qPCR). However, the diazotroph community in this ecosystem is diverse and includes non-cyanobacterial diazotrophs (NCDs). We investigated the diversity, depth distributions, and seasonality of diazotroph communities at Stn. ALOHA using high throughput sequencing (HTS) ofnifHgene fragments from samples collected throughout the euphotic zone (0-175 m) at near-monthly intervals from June 2013 to July 2016. The UCYN-A symbioses andTrichodesmiumsp. consistently had the highest relative abundances and seasonal patterns that corroborated qPCR-based analyses. Other prevalent community members included a newCrocosphaera-like species, and several NCDs affiliated with γ- and δ-proteobacteria. Notably, some of the NCDs appear to be stable components of the community at Stn. ALOHA, having also been reported in prior studies. Depth and temporal patterns in microdiversity within two major diazotroph groups (Trichodesmiumand UCYN-A) suggested that sub-populations are adapted to time- and depth-dependent environmental variation. A network analysis of the upper euphotic (0-75 m) HTS data identified two modules that reflect a diazotroph community structure with seasonal turnover between UCYN-A/Gamma A, andTrichodesmium/Crocosphaera. It also reveals the seasonality of several important cyanobacteria and NCDs about which little is known, including a putative δ-proteobacterial phylotype originally discovered at Stn. ALOHA. Collectively, these results underscore the importance of couplingnifHgene HTS with other molecular techniques to obtain a comprehensive view of diazotroph community composition in the marine environment and reveal several understudied diazotroph groups that may contribute to N2fixation in the NPSG. 
    more » « less
  5. Abstract Dinitrogen (N2) fixation represents a key source of reactive nitrogen in marine ecosystems. While the process has been rather well-explored in low latitudes of the Atlantic and Pacific Oceans, other higher latitude regions and particularly the Indian Ocean have been chronically overlooked. Here, we characterize N2 fixation and diazotroph community composition across nutrient and trace metals gradients spanning the multifrontal system separating the oligotrophic waters of the Indian Ocean subtropical gyre from the high nutrient low chlorophyll waters of the Southern Ocean. We found a sharp contrasting distribution of diazotroph groups across the frontal system. Notably, cyanobacterial diazotrophs dominated north of fronts, driving high N2 fixation rates (up to 13.96 nmol N l−1 d−1) with notable peaks near the South African coast. South of the fronts non-cyanobacterial diazotrophs prevailed without significant N2 fixation activity being detected. Our results provide new crucial insights into high latitude diazotrophy in the Indian Ocean, which should contribute to improved climate model parameterization and enhanced constraints on global net primary productivity projections. 
    more » « less