Abstract The introduction of high-throughput chromosome conformation capture (Hi-C) into metagenomics enables reconstructing high-quality metagenome-assembled genomes (MAGs) from microbial communities. Despite recent advances in recovering eukaryotic, bacterial, and archaeal genomes using Hi-C contact maps, few of Hi-C-based methods are designed to retrieve viral genomes. Here we introduce ViralCC, a publicly available tool to recover complete viral genomes and detect virus-host pairs using Hi-C data. Compared to other Hi-C-based methods, ViralCC leverages the virus-host proximity structure as a complementary information source for the Hi-C interactions. Using mock and real metagenomic Hi-C datasets from several different microbial ecosystems, including the human gut, cow fecal, and wastewater, we demonstrate that ViralCC outperforms existing Hi-C-based binning methods as well as state-of-the-art tools specifically dedicated to metagenomic viral binning. ViralCC can also reveal the taxonomic structure of viruses and virus-host pairs in microbial communities. When applied to a real wastewater metagenomic Hi-C dataset, ViralCC constructs a phage-host network, which is further validated using CRISPR spacer analyses. ViralCC is an open-source pipeline available athttps://github.com/dyxstat/ViralCC.
more »
« less
Integrating biological knowledge for mechanistic inference in the host-associated microbiome
Advances in high-throughput technologies have enhanced our ability to describe microbial communities as they relate to human health and disease. Alongside the growth in sequencing data has come an influx of resources that synthesize knowledge surrounding microbial traits, functions, and metabolic potential with knowledge of how they may impact host pathways to influence disease phenotypes. These knowledge bases can enable the development of mechanistic explanations that may underlie correlations detected between microbial communities and disease. In this review, we survey existing resources and methodologies for the computational integration of broad classes of microbial and host knowledge. We evaluate these knowledge bases in their access methods, content, and source characteristics. We discuss challenges of the creation and utilization of knowledge bases including inconsistency of nomenclature assignment of taxa and metabolites across sources, whether the biological entities represented are rooted in ontologies or taxonomies, and how the structure and accessibility limit the diversity of applications and user types. We make this information available in a code and data repository at:https://github.com/lozuponelab/knowledge-source-mappings. Addressing these challenges will allow for the development of more effective tools for drawing from abundant knowledge to find new insights into microbial mechanisms in disease by fostering a systematic and unbiased exploration of existing information.
more »
« less
- Award ID(s):
- 2022138
- PAR ID:
- 10522037
- Publisher / Repository:
- Frontiers
- Date Published:
- Journal Name:
- Frontiers in Microbiology
- Volume:
- 15
- ISSN:
- 1664-302X
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
ABSTRACT Phosphorylation is a substantial posttranslational modification of proteins that refers to adding a phosphate group to the amino acid side chain after translation process in the ribosome. It is vital to coordinate cellular functions, such as regulating metabolism, proliferation, apoptosis, subcellular trafficking, and other crucial physiological processes. Phosphorylation prediction in a microbial organism can assist in understanding pathogenesis and host–pathogen interaction, drug and antibody design, and antimicrobial agent development. Experimental methods for predicting phosphorylation sites are costly, slow, and tedious. Hence low‐cost and high‐speed computational approaches are highly desirable. This paper presents a new deep learning tool called DeepPhoPred for predicting microbial phospho‐serine (pS), phospho‐threonine (pT), and phospho‐tyrosine (pY) sites. DeepPhoPred incorporates a two‐headed convolutional neural network architecture with the squeeze and excitation blocks followed by fully connected layers that jointly learn significant features from the peptide's structural and evolutionary information to predict phosphorylation sites. Our empirical results demonstrate that DeepPhoPred significantly outperforms the existing microbial phosphorylation site predictors with its highly efficient deep‐learning architecture. DeepPhoPred as a standalone predictor, all its source codes, and our employed datasets are publicly available athttps://github.com/faisalahm3d/DeepPhoPred.more » « less
-
Faust, Karoline (Ed.)ABSTRACT Microbes commonly organize into communities consisting of hundreds of species involved in complex interactions with each other. 16S ribosomal RNA (16S rRNA) amplicon profiling provides snapshots that reveal the phylogenies and abundance profiles of these microbial communities. These snapshots, when collected from multiple samples, can reveal the co-occurrence of microbes, providing a glimpse into the network of associations in these communities. However, the inference of networks from 16S data involves numerous steps, each requiring specific tools and parameter choices. Moreover, the extent to which these steps affect the final network is still unclear. In this study, we perform a meticulous analysis of each step of a pipeline that can convert 16S sequencing data into a network of microbial associations. Through this process, we map how different choices of algorithms and parameters affect the co-occurrence network and identify the steps that contribute substantially to the variance. We further determine the tools and parameters that generate robust co-occurrence networks and develop consensus network algorithms based on benchmarks with mock and synthetic data sets. The Microbial Co-occurrence Network Explorer, or MiCoNE (available athttps://github.com/segrelab/MiCoNE) follows these default tools and parameters and can help explore the outcome of these combinations of choices on the inferred networks. We envisage that this pipeline could be used for integrating multiple data sets and generating comparative analyses and consensus networks that can guide our understanding of microbial community assembly in different biomes. IMPORTANCEMapping the interrelationships between different species in a microbial community is important for understanding and controlling their structure and function. The surge in the high-throughput sequencing of microbial communities has led to the creation of thousands of data sets containing information about microbial abundances. These abundances can be transformed into co-occurrence networks, providing a glimpse into the associations within microbiomes. However, processing these data sets to obtain co-occurrence information relies on several complex steps, each of which involves numerous choices of tools and corresponding parameters. These multiple options pose questions about the robustness and uniqueness of the inferred networks. In this study, we address this workflow and provide a systematic analysis of how these choices of tools affect the final network and guidelines on appropriate tool selection for a particular data set. We also develop a consensus network algorithm that helps generate more robust co-occurrence networks based on benchmark synthetic data sets.more » « less
-
Skin microbial communities are an essential part of host health and can play a role in mitigating disease. Host and environmental factors can shape and alter these microbial communities and, therefore, we need to understand to what extent these factors influence microbial communities and how this can impact disease dynamics. Microbial communities have been studied in amphibian systems due to skin microbial communities providing some resistance to the amphibian chytrid fungus, Batrachochytrium dendrobatidis . However, we are only starting to understand how host and environmental factors shape these communities for amphibians. In this study, we examined whether amphibian skin bacterial communities differ among host species, host infection status, host developmental stage, and host habitat. We collected skin swabs from tadpoles and adults of three Ranid frog species ( Lithobates spp.) at the Mianus River Gorge Preserve in Bedford, New York, USA, and used 16S rRNA gene amplicon sequencing to determine bacterial community composition. Our analysis suggests amphibian skin bacterial communities change across host developmental stages, as has been documented previously. Additionally, we found that skin bacterial communities differed among Ranid species, with skin communities on the host species captured in streams or bogs differing from the communities of the species captured on land. Thus, habitat use of different species may drive differences in host-associated microbial communities for closely-related host species.more » « less
-
Abstract Warmer winters with less snowfall are increasing the frequency of soil freeze–thaw cycles across temperate regions. Soil microbial responses to freeze–thaw cycles vary and some of this variation may be explained by microbial conditioning to prior winter conditions, yet such linkages remain largely unexplored. We investigated how differences in temperature history influenced microbial community composition and activity in response to freeze–thaw cycles.We collected soil microbial communities that developed under colder (high elevation) and warmer (low elevation) temperature regimes in spruce‐fir forests, then added each of these soil microbial communities to a sterile bulk‐soil in a laboratory microcosm experiment. The inoculated high‐elevation cold and low‐elevation warm microcosms were subjected to diurnal freeze–thaw cycles or constant above‐freezing temperature for 9 days. Then, all microcosms were subjected to a 7‐day above‐freezing recovery period.Overall, we found that the high‐elevation cold community had, relative to the low‐elevation warm community, a smaller reduction in microbial respiration (CO2flux) during freeze–thaw cycles. Further, the high‐elevation cold community, on average, experienced lower freeze–thaw‐induced bacterial mortality than the warm community and may have partly acclimated to freeze–thaw cycles via increased lipid membrane fluidity. Respiration of both microbial communities quickly recovered following the end of the freeze–thaw treatment period and there were no changes in soil extractable carbon or nitrogen.Our results provide evidence that past soil temperature conditions may influence the responses of soil microbial communities to freeze–thaw cycles. The microbial community that developed under a colder temperature regime was more tolerant of freeze–thaw cycles than the community that developed under a warmer temperature regime, although both communities displayed some level of resilience. Taken together, our data suggest that microbial communities conditioned to less extreme winter soil temperatures may be most vulnerable to rapid changes in freeze–thaw regimes as winters warm, but they also may be able to quickly recover if mortality is low. Read the freePlain Language Summaryfor this article on the Journal blog.more » « less
An official website of the United States government

