Abstract Profiling the taxonomic and functional composition of microbes using metagenomic (MG) and metatranscriptomic (MT) sequencing is advancing our understanding of microbial functions. However, the sensitivity and accuracy of microbial classification using genome– or core protein-based approaches, especially the classification of eukaryotic organisms, is limited by the availability of genomes and the resolution of sequence databases. To address this, we propose the MicroFisher, a novel approach that applies multiple hypervariable marker genes to profile fungal communities from MGs and MTs. This approach utilizes the hypervariable regions of ITS and large subunit (LSU) rRNA genes for fungal identification with high sensitivity and resolution. Simultaneously, we propose a computational pipeline (MicroFisher) to optimize and integrate the results from classifications using multiple hypervariable markers. To test the performance of our method, we applied MicroFisher to the synthetic community profiling and found high performance in fungal prediction and abundance estimation. In addition, we also used MGs from forest soil and MTs of root eukaryotic microbes to test our method and the results showed that MicroFisher provided more accurate profiling of environmental microbiomes compared to other classification tools. Overall, MicroFisher serves as a novel pipeline for classification of fungal communities from MGs and MTs.
more »
« less
Learning from the unknown: exploring the range of bacterial functionality
Abstract Determining the repertoire of a microbe's molecular functions is a central question in microbial biology. Modern techniques achieve this goal by comparing microbial genetic material against reference databases of functionally annotated genes/proteins or known taxonomic markers such as 16S rRNA. Here, we describe a novel approach to exploring bacterial functional repertoires without reference databases. Our Fusion scheme establishes functional relationships between bacteria and assigns organisms to Fusion-taxa that differ from otherwise defined taxonomic clades. Three key findings of our work stand out. First, bacterial functional comparisons outperform marker genes in assigning taxonomic clades. Fusion profiles are also better for this task than other functional annotation schemes. Second, Fusion-taxa are robust to addition of novel organisms and are, arguably, able to capture the environment-driven bacterial diversity. Finally, our alignment-free nucleic acid-based Siamese Neural Network model, created using Fusion functions, enables finding shared functionality of very distant, possibly structurally different, microbial homologs. Our work can thus help annotate functional repertoires of bacterial organisms and further guide our understanding of microbial communities.
more »
« less
- Award ID(s):
- 1553289
- PAR ID:
- 10464502
- Publisher / Repository:
- Oxford University Press
- Date Published:
- Journal Name:
- Nucleic Acids Research
- Volume:
- 51
- Issue:
- 19
- ISSN:
- 0305-1048
- Format(s):
- Medium: X Size: p. 10162-10175
- Size(s):
- p. 10162-10175
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Abstract Bacterial diversity can be overwhelming. There is an ever-expanding number of bacterial taxa being discovered, but many of these taxa remain uncharacterized with unknown traits and environmental preferences. This diversity makes it challenging to interpret ecological patterns in microbiomes and understand why individual taxa, or assemblages, may vary across space and time. While we can use information from the rapidly growing databases of bacterial genomes to infer traits, we still need an approach to organize what we know, or think we know, about bacterial taxa to match taxonomic and phylogenetic information to trait inferences. Inspired by the periodic table of the elements, we have constructed a ‘periodic table’ of bacterial taxa to organize and visualize monophyletic groups of bacteria based on the distributions of key traits predicted from genomic data. By analyzing 50,745 genomes across 31 bacterial phyla, we used the Haar-like wavelet transformation, a model-free transformation of trait data, to identify clades of bacteria which are nearly uniform with respect to six selected traits - oxygen tolerance, autotrophy, chlorophototrophy, maximum potential growth rate, GC content and genome size. The identified functionally uniform clades of bacteria are presented in a concise ‘periodic table’-like format to facilitate identification and exploration of bacterial lineages in trait space. While our approach could be improved and expanded in the future, we demonstrate its utility for integrating phylogenetic information with genome-derived trait values to improve our understanding of the bacterial diversity found in environmental and host-associated microbiomes.more » « less
-
Abstract Food webs govern interactions among organisms and drive energy fluxes within ecosystems. With an increasing appreciation for the role of symbiotic microbes in host metabolism and development, it is imperative to understand the extent to which microbes conform to, and potentially influence, canonical food web efficiencies and structures. Here, we investigate whether bacteria and their taxa and functional genes are compositionally nested within a simple model food web hierarchy, and the extent to which this is predicted by the trophic position of the host. Using shotgun and amplicon sequencing of discrete food web compartments within replicate tank bromeliads, we find that both taxonomy and function are compositionally nested and largely mirror the pyramid-shaped distribution of food webs. Further, nearly the entirety of bacterial taxa and functional genes associated with hosts are contained within host-independent environmental samples. Community composition of bacterial taxa did not significantly correlate with that of functional genes, indicating a high likelihood of functional redundancy. Whereas bacterial taxa were shaped by both location and trophic position of their host, functional genes were not spatially structured. Our work illustrates the advantages of applying food web ecology to predict patterns of overlapping microbiome composition among unrelated hosts and distinct habitats. Because bacterial symbionts are critical components of host metabolic potential, this result raises important questions about whether bacterial consortia are shaped by the same energetic constraints as hosts, and whether they play an active role in food web efficiency.more » « less
-
Hird, Sarah M. (Ed.)The gut microbiome provides vital functions for mammalian hosts, yet research on its variability and function across adult life spans and multiple generations is limited in large mammalian carnivores. Here, we used 16S rRNA gene and metagenomic high-throughput sequencing to profile the bacterial taxonomic composition, genomic diversity, and metabolic function of fecal samples collected from 12 wild spotted hyenas ( Crocuta crocuta ) residing in the Masai Mara National Reserve, Kenya, over a 23-year period spanning three generations. The metagenomic data came from four of these hyenas and spanned two 2-year periods. With these data, we determined the extent to which host factors predicted variation in the gut microbiome and identified the core microbes present in the guts of hyenas. We also investigated novel genomic diversity in the mammalian gut by reporting the first metagenome-assembled genomes (MAGs) for hyenas. We found that gut microbiome taxonomic composition varied temporally, but despite this, a core set of 14 bacterial genera were identified. The strongest predictors of the microbiome were host identity and age, suggesting that hyenas possess individualized microbiomes and that these may change with age during adulthood. The gut microbiome functional profiles of the four adult hyenas were also individual specific and were associated with prey abundance, indicating that the functions of the gut microbiome vary with host diet. We recovered 149 high-quality MAGs from the hyenas’ guts; some MAGs were classified as taxa previously reported for other carnivores, but many were novel and lacked species-level matches to genomes in existing reference databases. IMPORTANCE There is a gap in knowledge regarding the genomic diversity and variation of the gut microbiome across a host’s life span and across multiple generations of hosts in wild mammals. Using two types of sequencing approaches, we found that although gut microbiomes were individualized and temporally variable among hyenas, they correlated similarly to large-scale changes in the ecological conditions experienced by their hosts. We also recovered 149 high-quality MAGs from the hyena gut, greatly expanding the microbial genome repertoire known for hyenas, carnivores, and wild mammals in general. Some MAGs came from genera abundant in the gastrointestinal tracts of canid species and other carnivores, but over 80% of MAGs were novel and from species not previously represented in genome databases. Collectively, our novel body of work illustrates the importance of surveying the gut microbiome of nonmodel wild hosts, using multiple sequencing methods and computational approaches and at distinct scales of analysis.more » « less
-
Abstract Study of life history strategies may help predict the performance of microorganisms in nature by organizing the complexity of microbial communities into groups of organisms with similar strategies. Here, we tested the extent that one common application of life history theory, the copiotroph-oligotroph framework, could predict the relative population growth rate of bacterial taxa in soils from four different ecosystems. We measured the change of in situ relative growth rate to added glucose and ammonium using both 18O–H2O and 13C quantitative stable isotope probing to test whether bacterial taxa sorted into copiotrophic and oligotrophic groups. We saw considerable overlap in nutrient responses across most bacteria regardless of phyla, with many taxa growing slowly and few taxa that grew quickly. To define plausible life history boundaries based on in situ relative growth rates, we applied Gaussian mixture models to organisms’ joint 18O–13C signatures and found that across experimental replicates, few taxa could consistently be assigned as copiotrophs, despite their potential for fast growth. When life history classifications were assigned based on average relative growth rate at varying taxonomic levels, finer resolutions (e.g., genus level) were significantly more effective in capturing changes in nutrient response than broad taxonomic resolution (e.g., phylum level). Our results demonstrate the difficulty in generalizing bacterial life history strategies to broad lineages, and even to single organisms across a range of soils and experimental conditions. We conclude that there is a continued need for the direct measurement of microbial communities in soil to advance ecologically realistic frameworks.more » « less
An official website of the United States government
