Inferring linkages between microbial metabolism and dissolved organic matter (DOM) across environmental gradients is a promising avenue to improve biogeochemical predictions at large spatial scales. Despite decades of metagenomic studies identifying microbial functional trait-environment patterns at small spatial scales, general patterns at continental or global scales that may improve large-scale models remain unresolved. Recent influx of multi-omics datasets that represent diverse environmental conditions has enabled scalable analyses linking microbial metabolic niche breadths with key environmental processes, such as carbon and nutrient transformations.Here, we leveraged publicly available microbial metagenome assembled genomes (MAGs) derived from the Worldwide Hydrobiogeochemistry Observation Network for Dynamic River Systems (WHONDRS) data paired with metabolomic (FTICR-MS) and sediment chemistry data to link microbial metabolic potential with organic chemistry. We annotated 1,384 MAGs representing 65 sites using the R tool microTrait, which categorizes functional traits under the YAS (growth yield-resource acquisition-stress tolerance) framework. Following Hutchinsonian niche theory, we modeled microbial trait combinations as n-dimensional hypervolumes and observed trait-DOM patterns at the continental scale, showing microbial functional tradeoffs along gradients of organic carbon. We expect that at the continental scale, microbial trait profiles will be distinct across climatic regions, and that niche breadth (i.e. the size of individual hypervolumes in trait space) will correlate with DOM/metabolite diversity. The results of this work will distill generalizable patterns of microbe-DOM availability and diversity at large spatial scales, thus identifying information to improve current biogeochemical models.
more »
« less
METABOLIC: high-throughput profiling of microbial genomes for functional traits, metabolism, biogeochemistry, and community-scale functional networks
Abstract BackgroundAdvances in microbiome science are being driven in large part due to our ability to study and infer microbial ecology from genomes reconstructed from mixed microbial communities using metagenomics and single-cell genomics. Such omics-based techniques allow us to read genomic blueprints of microorganisms, decipher their functional capacities and activities, and reconstruct their roles in biogeochemical processes. Currently available tools for analyses of genomic data can annotate and depict metabolic functions to some extent; however, no standardized approaches are currently available for the comprehensive characterization of metabolic predictions, metabolite exchanges, microbial interactions, and microbial contributions to biogeochemical cycling. ResultsWe present METABOLIC (METabolic And BiogeOchemistry anaLyses In miCrobes), a scalable software to advance microbial ecology and biogeochemistry studies using genomes at the resolution of individual organisms and/or microbial communities. The genome-scale workflow includes annotation of microbial genomes, motif validation of biochemically validated conserved protein residues, metabolic pathway analyses, and calculation of contributions to individual biogeochemical transformations and cycles. The community-scale workflow supplements genome-scale analyses with determination of genome abundance in the microbiome, potential microbial metabolic handoffs and metabolite exchange, reconstruction of functional networks, and determination of microbial contributions to biogeochemical cycles. METABOLIC can take input genomes from isolates, metagenome-assembled genomes, or single-cell genomes. Results are presented in the form of tables for metabolism and a variety of visualizations including biogeochemical cycling potential, representation of sequential metabolic transformations, community-scale microbial functional networks using a newly defined metric “MW-score” (metabolic weight score), and metabolic Sankey diagrams. METABOLIC takes ~ 3 h with 40 CPU threads to process ~ 100 genomes and corresponding metagenomic reads within which the most compute-demanding part of hmmsearch takes ~ 45 min, while it takes ~ 5 h to complete hmmsearch for ~ 3600 genomes. Tests of accuracy, robustness, and consistency suggest METABOLIC provides better performance compared to other software and online servers. To highlight the utility and versatility of METABOLIC, we demonstrate its capabilities on diverse metagenomic datasets from the marine subsurface, terrestrial subsurface, meadow soil, deep sea, freshwater lakes, wastewater, and the human gut. ConclusionMETABOLIC enables the consistent and reproducible study of microbial community ecology and biogeochemistry using a foundation of genome-informed microbial metabolism, and will advance the integration of uncultivated organisms into metabolic and biogeochemical models. METABOLIC is written in Perl and R and is freely available under GPLv3 athttps://github.com/AnantharamanLab/METABOLIC.
more »
« less
- Award ID(s):
- 2047598
- PAR ID:
- 10367464
- Publisher / Repository:
- Springer Science + Business Media
- Date Published:
- Journal Name:
- Microbiome
- Volume:
- 10
- Issue:
- 1
- ISSN:
- 2049-2618
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Abstract BackgroundRock-dwelling microorganisms are key players in ecosystem functioning of Antarctic ice free-areas. Yet, little is known about their diversity and ecology, and further still, viruses in these communities have been largely unexplored despite important roles related to host metabolism and nutrient cycling. To begin to address this, we present a large-scale viral catalog from Antarctic rock microbial communities. ResultsWe performed metagenomic analyses on rocks from across Antarctica representing a broad range of environmental and spatial conditions, and which resulted in a predicted viral catalog comprising > 75,000 viral operational taxonomic units (vOTUS). We found largely undescribed, highly diverse and spatially structured virus communities which had predicted auxiliary metabolic genes (AMGs) with functions indicating that they may be potentially influencing bacterial adaptation and biogeochemistry. ConclusionThis catalog lays the foundation for expanding knowledge of virosphere diversity, function, spatial ecology, and dynamics in extreme environments. This work serves as a step towards exploring adaptability of microbial communities in the face of a changing climate.more » « less
-
Gralnick, Jeffrey A. (Ed.)ABSTRACT Reconstructing microbial genomes from metagenomic short-read data can be challenging due to the unknown and uneven complexity of microbial communities. This complexity encompasses highly diverse populations, which often includes strain variants. Reconstructing high-quality genomes is a crucial part of the metagenomic workflow, as subsequent ecological and metabolic inferences depend on their accuracy, quality, and completeness. In contrast to microbial communities in other ecosystems, there has been no systematic assessment of genome-centric metagenomic workflows for drinking water microbiomes. In this study, we assessed the performance of a combination of assembly and binning strategies for time series drinking water metagenomes that were collected over 6 months. The goal of this study was to identify the combination of assembly and binning approaches that result in high-quality and -quantity metagenome-assembled genomes (MAGs), representing most of the sequenced metagenome. Our findings suggest that the metaSPAdes coassembly strategies had the best performance, as they resulted in larger and less fragmented assemblies, with at least 85% of the sequence data mapping to contigs greater than 1 kbp. Furthermore, a combination of metaSPAdes coassembly strategies and MetaBAT2 produced the highest number of medium-quality MAGs while capturing at least 70% of the metagenomes based on read recruitment. Utilizing different assembly/binning approaches also assists in the reconstruction of unique MAGs from closely related species that would have otherwise collapsed into a single MAG using a single workflow. Overall, our study suggests that leveraging multiple binning approaches with different metaSPAdes coassembly strategies may be required to maximize the recovery of good-quality MAGs. IMPORTANCE Drinking water contains phylogenetic diverse groups of bacteria, archaea, and eukarya that affect the esthetic quality of water, water infrastructure, and public health. Taxonomic, metabolic, and ecological inferences of the drinking water microbiome depend on the accuracy, quality, and completeness of genomes that are reconstructed through the application of genome-resolved metagenomics. Using time series metagenomic data, we present reproducible genome-centric metagenomic workflows that result in high-quality and -quantity genomes, which more accurately signifies the sequenced drinking water microbiome. These genome-centric metagenomic workflows will allow for improved taxonomic and functional potential analysis that offers enhanced insights into the stability and dynamics of drinking water microbial communities.more » « less
-
Abstract The introduction of high-throughput chromosome conformation capture (Hi-C) into metagenomics enables reconstructing high-quality metagenome-assembled genomes (MAGs) from microbial communities. Despite recent advances in recovering eukaryotic, bacterial, and archaeal genomes using Hi-C contact maps, few of Hi-C-based methods are designed to retrieve viral genomes. Here we introduce ViralCC, a publicly available tool to recover complete viral genomes and detect virus-host pairs using Hi-C data. Compared to other Hi-C-based methods, ViralCC leverages the virus-host proximity structure as a complementary information source for the Hi-C interactions. Using mock and real metagenomic Hi-C datasets from several different microbial ecosystems, including the human gut, cow fecal, and wastewater, we demonstrate that ViralCC outperforms existing Hi-C-based binning methods as well as state-of-the-art tools specifically dedicated to metagenomic viral binning. ViralCC can also reveal the taxonomic structure of viruses and virus-host pairs in microbial communities. When applied to a real wastewater metagenomic Hi-C dataset, ViralCC constructs a phage-host network, which is further validated using CRISPR spacer analyses. ViralCC is an open-source pipeline available athttps://github.com/dyxstat/ViralCC.more » « less
-
While recent efforts to catalogue Earth’s microbial diversity have focused upon surface and marine habitats, 12–20 % of Earth’s biomass is suggested to exist in the terrestrial deep subsurface, compared to ~1.8 % in the deep subseafloor. Metagenomic studies of the terrestrial deep subsurface have yielded a trove of divergent and functionally important microbiomes from a range of localities. However, a wider perspective of microbial diversity and its relationship to environmental conditions within the terrestrial deep subsurface is still required. Our meta-analysis reveals that terrestrial deep subsurface microbiota are dominated byBetaproteobacteria, GammaproteobacteriaandFirmicutes, probably as a function of the diverse metabolic strategies of these taxa. Evidence was also found for a common small consortium of prevalentBetaproteobacteriaandGammaproteobacteriaoperational taxonomic units across the localities. This implies a core terrestrial deep subsurface community, irrespective of aquifer lithology, depth and other variables, that may play an important role in colonizing and sustaining microbial habitats in the deep terrestrial subsurface. Anin silicocontamination-aware approach to analysing this dataset underscores the importance of downstream methods for assuring that robust conclusions can be reached from deep subsurface-derived sequencing data. Understanding the global panorama of microbial diversity and ecological dynamics in the deep terrestrial subsurface provides a first step towards understanding the role of microbes in global subsurface element and nutrient cycling.more » « less