skip to main content

Title: Complementary Metagenomic Approaches Improve Reconstruction of Microbial Diversity in a Forest Soil
ABSTRACT Soil ecosystems harbor diverse microorganisms and yet remain only partially characterized as neither single-cell sequencing nor whole-community sequencing offers a complete picture of these complex communities. Thus, the genetic and metabolic potential of this “uncultivated majority” remains underexplored. To address these challenges, we applied a pooled-cell-sorting-based mini-metagenomics approach and compared the results to bulk metagenomics. Informatic binning of these data produced 200 mini-metagenome assembled genomes (sorted-MAGs) and 29 bulk metagenome assembled genomes (MAGs). The sorted and bulk MAGs increased the known phylogenetic diversity of soil taxa by 7.2% with respect to the Joint Genome Institute IMG/M database and showed clade-specific sequence recruitment patterns across diverse terrestrial soil metagenomes. Additionally, sorted-MAGs expanded the rare biosphere not captured through MAGs from bulk sequences, exemplified through phylogenetic and functional analyses of members of the phylum Bacteroidetes . Analysis of 67 Bacteroidetes sorted-MAGs showed conserved patterns of carbon metabolism across four clades. These results indicate that mini-metagenomics enables genome-resolved investigation of predicted metabolism and demonstrates the utility of combining metagenomics methods to tap into the diversity of heterogeneous microbial assemblages. IMPORTANCE Microbial ecologists have historically used cultivation-based approaches as well as amplicon sequencing and shotgun metagenomics to characterize microbial diversity in soil. However, more » challenges persist in the study of microbial diversity, including the recalcitrance of the majority of microorganisms to laboratory cultivation and limited sequence assembly from highly complex samples. The uncultivated majority thus remains a reservoir of untapped genetic diversity. To address some of the challenges associated with bulk metagenomics as well as low throughput of single-cell genomics, we applied flow cytometry-enabled mini-metagenomics to capture expanded microbial diversity from forest soil and compare it to soil bulk metagenomics. Our resulting data from this pooled-cell sorting approach combined with bulk metagenomics revealed increased phylogenetic diversity through novel soil taxa and rare biosphere members. In-depth analysis of genomes within the highly represented Bacteroidetes phylum provided insights into conserved and clade-specific patterns of carbon metabolism. « less
; ; ; ; ; ; ; ; ; ; ; ;
Jansson, Janet K.
Award ID(s):
Publication Date:
Journal Name:
Sponsoring Org:
National Science Foundation
More Like this
  1. Kent, Angela D. (Ed.)
    ABSTRACT Methylmercury is a potent bioaccumulating neurotoxin that is produced by specific microorganisms that methylate inorganic mercury. Methylmercury production in diverse anaerobic bacteria and archaea was recently linked to the hgcAB genes. However, the full phylogenetic and metabolic diversity of mercury-methylating microorganisms has not been fully unraveled due to the limited number of cultured experimentally verified methylators and the limitations of primer-based molecular methods. Here, we describe the phylogenetic diversity and metabolic flexibility of putative mercury-methylating microorganisms by hgcAB identification in publicly available isolate genomes and metagenome-assembled genomes (MAGs) as well as novel freshwater MAGs. We demonstrate that putative mercury methylators are much more phylogenetically diverse than previously known and that hgcAB distribution among genomes is most likely due to several independent horizontal gene transfer events. The microorganisms we identified possess diverse metabolic capabilities spanning carbon fixation, sulfate reduction, nitrogen fixation, and metal resistance pathways. We identified 111 putative mercury methylators in a set of previously published permafrost metatranscriptomes and demonstrated that different methylating taxa may contribute to hgcA expression at different depths. Overall, we provide a framework for illuminating the microbial basis of mercury methylation using genome-resolved metagenomics and metatranscriptomics to identify putative methylators based upon hgcAB presence andmore »describe their putative functions in the environment. IMPORTANCE Accurately assessing the production of bioaccumulative neurotoxic methylmercury by characterizing the phylogenetic diversity, metabolic functions, and activity of methylators in the environment is crucial for understanding constraints on the mercury cycle. Much of our understanding of methylmercury production is based on cultured anaerobic microorganisms within the Deltaproteobacteria , Firmicutes , and Euryarchaeota. Advances in next-generation sequencing technologies have enabled large-scale cultivation-independent surveys of diverse and poorly characterized microorganisms from numerous ecosystems. We used genome-resolved metagenomics and metatranscriptomics to highlight the vast phylogenetic and metabolic diversity of putative mercury methylators and their depth-discrete activities in thawing permafrost. This work underscores the importance of using genome-resolved metagenomics to survey specific putative methylating populations of a given mercury-impacted ecosystem.« less
  2. Abstract Background

    Advances in microbiome science are being driven in large part due to our ability to study and infer microbial ecology from genomes reconstructed from mixed microbial communities using metagenomics and single-cell genomics. Such omics-based techniques allow us to read genomic blueprints of microorganisms, decipher their functional capacities and activities, and reconstruct their roles in biogeochemical processes. Currently available tools for analyses of genomic data can annotate and depict metabolic functions to some extent; however, no standardized approaches are currently available for the comprehensive characterization of metabolic predictions, metabolite exchanges, microbial interactions, and microbial contributions to biogeochemical cycling.


    We present METABOLIC (METabolic And BiogeOchemistry anaLyses In miCrobes), a scalable software to advance microbial ecology and biogeochemistry studies using genomes at the resolution of individual organisms and/or microbial communities. The genome-scale workflow includes annotation of microbial genomes, motif validation of biochemically validated conserved protein residues, metabolic pathway analyses, and calculation of contributions to individual biogeochemical transformations and cycles. The community-scale workflow supplements genome-scale analyses with determination of genome abundance in the microbiome, potential microbial metabolic handoffs and metabolite exchange, reconstruction of functional networks, and determination of microbial contributions to biogeochemical cycles. METABOLIC can take input genomes from isolates, metagenome-assembled genomes, ormore »single-cell genomes. Results are presented in the form of tables for metabolism and a variety of visualizations including biogeochemical cycling potential, representation of sequential metabolic transformations, community-scale microbial functional networks using a newly defined metric “MW-score” (metabolic weight score), and metabolic Sankey diagrams. METABOLIC takes ~ 3 h with 40 CPU threads to process ~ 100 genomes and corresponding metagenomic reads within which the most compute-demanding part of hmmsearch takes ~ 45 min, while it takes ~ 5 h to complete hmmsearch for ~ 3600 genomes. Tests of accuracy, robustness, and consistency suggest METABOLIC provides better performance compared to other software and online servers. To highlight the utility and versatility of METABOLIC, we demonstrate its capabilities on diverse metagenomic datasets from the marine subsurface, terrestrial subsurface, meadow soil, deep sea, freshwater lakes, wastewater, and the human gut.


    METABOLIC enables the consistent and reproducible study of microbial community ecology and biogeochemistry using a foundation of genome-informed microbial metabolism, and will advance the integration of uncultivated organisms into metabolic and biogeochemical models. METABOLIC is written in Perl and R and is freely available under GPLv3 at

    « less
  3. Abstract Background

    Climate change will result in more frequent droughts that can impact soil-inhabiting microbiomes (rhizobiomes) in the agriculturally vital North American perennial grasslands. Rhizobiomes have contributed to enhancing drought resilience and stress resistance properties in plant hosts. In the predicted events of more future droughts, how the changing rhizobiome under environmental stress can impact the plant host resilience needs to be deciphered. There is also an urgent need to identify and recover candidate microorganisms along with their functions, involved in enhancing plant resilience, enabling the successful development of synthetic communities.


    In this study, we used the combination of cultivation and high-resolution genomic sequencing of bacterial communities recovered from the rhizosphere of a tallgrass prairie foundation grass,Andropogon gerardii. We cultivated the plant host-associated microbes under artificial drought-induced conditions and identified the microbe(s) that might play a significant role in the rhizobiome ofAndropogon gerardiiunder drought conditions. Phylogenetic analysis of the non-redundant metagenome-assembled genomes (MAGs) identified a bacterial genome of interest – MAG-Pseudomonas. Further metabolic pathway and pangenome analyses recovered genes and pathways related to stress responses including ACC deaminase; nitrogen transformation including assimilatory nitrate reductase in MAG-Pseudomonas,which might be associated with enhanced drought tolerance and growth forAndropogon gerardii.


    Our data indicated thatmore »the metagenome-assembled MAG-Pseudomonashas the functional potential to contribute to the plant host’s growth during stressful conditions. Our study also suggested the nitrogen transformation potential ofMAG-Pseudomonasthat could impactAndropogon gerardiigrowth in a positive way. The cultivation of MAG-Pseudomonassets the foundation to construct a successful synthetic community forAndropogon gerardii. To conclude, stress resilience mediated through genes ACC deaminase, nitrogen transformation potential through assimilatory nitrate reductase in MAG-Pseudomonascould place this microorganism as an important candidate of the rhizobiome aiding the plant host resilience under environmental stress. This study, therefore, provided insights into the MAG-Pseudomonasand its potential to optimize plant productivity under ever-changing climatic patterns, especially in frequent drought conditions.

    « less
  4. Deep subsurface environments are decoupled from Earth’s surface processes yet diverse, active, and abundant microbial communities thrive in these isolated environments. Microbes inhabiting the deep biosphere face unique challenges such as electron donor/acceptor limitations, pore space/fracture network limitations, and isolation from other microbes within the formation. Of the few systems that have been characterized, it is apparent that nutrient limitations likely facilitate diverse microbe-microbe interactions (i.e., syntrophic, symbiotic, or parasitic) and that these interactions drive biogeochemical cycling of major elements. Here we describe microbial communities living in low temperature, chemically reduced brines at the Soudan Underground Mine State Park, United States. The Soudan Iron mine intersects a massive hematite formation at the southern extent of the Canadian Shield. Fractured rock aquifer brines continuously flow from exploratory boreholes drilled circa 1960 and are enriched in deuterium compared to the global meteoric values, indicating brines have had little contact with surface derived waters, and continually degas low molecular weight hydrocarbons C 1 -C 4 . Microbial enrichments suggest that once brines exit the boreholes, oxidation of the hydrocarbons occur. Amplicon sequencing show these borehole communities are low in diversity and dominated by Firmicute and Proteobacteria phyla. From the metagenome assemblies, we recoveredmore »approximately thirty genomes with estimated completion over 50%. Analysis of genome taxonomy generally followed the amplicon data, and highlights that several of the genomes represent novel families and genera. Metabolic reconstruction shows two carbon-fixation pathways were dominant, the Wood-Ljungdahl (acetogenesis) and Calvin-Benson-Bassham (via RuBisCo), indicating that inorganic carbon likely enters into the microbial foodweb with differing carbon fractionation potentials. Interestingly, methanogenesis is likely driven by Methanolobus and suggests cycling of methylated compounds and not H 2 /CO 2 or acetate. Furthermore, the abundance of sulfate in brines suggests cryptic sulfur cycling may occur, as we detect possible sulfate reducing and thiosulfate oxidizing microorganisms. Finally, a majority of the microorganisms identified contain genes that would allow them to participate in several element cycles, highlighting that in these deep isolated systems metabolic flexibility may be an important life history trait.« less
  5. Abstract With advances in DNA sequencing and miniaturized molecular biology workflows, rapid and affordable sequencing of single-cell genomes has become a reality. Compared to 16S rRNA gene surveys and shotgun metagenomics, large-scale application of single-cell genomics to whole microbial communities provides an integrated snapshot of community composition and function, directly links mobile elements to their hosts, and enables analysis of population heterogeneity of the dominant community members. To that end, we sequenced nearly 500 single-cell genomes from a low diversity hot spring sediment sample from Dewar Creek, British Columbia, and compared this approach to 16S rRNA gene amplicon and shotgun metagenomics applied to the same sample. We found that the broad taxonomic profiles were similar across the three sequencing approaches, though several lineages were missing from the 16S rRNA gene amplicon dataset, likely the result of primer mismatches. At the functional level, we detected a large array of mobile genetic elements present in the single-cell genomes but absent from the corresponding same species metagenome-assembled genomes. Moreover, we performed a single-cell population genomic analysis of the three most abundant community members, revealing differences in population structure based on mutation and recombination profiles. While the average pairwise nucleotide identities were similar acrossmore »the dominant species-level lineages, we observed differences in the extent of recombination between these dominant populations. Most intriguingly, the creek’s Hydrogenobacter sp . population appeared to be so recombinogenic that it more closely resembled a sexual species than a clonally evolving microbe. Together, this work demonstrates that a randomized single-cell approach can be useful for the exploration of previously uncultivated microbes from community composition to population structure.« less