skip to main content


Title: C3NA: correlation and consensus-based cross-taxonomy network analysis for compositional microbial data
Abstract Background

Studying the co-occurrence network structure of microbial samples is one of the critical approaches to understanding the perplexing and delicate relationship between the microbe, host, and diseases. It is also critical to develop a tool for investigating co-occurrence networks and differential abundance analyses to reveal the disease-related taxa–taxa relationship. In addition, it is also necessary to tighten the co-occurrence network into smaller modules to increase the ability for functional annotation and interpretability of  these taxa-taxa relationships.  Also, it is critical to retain the phylogenetic relationship among the taxa to identify differential abundance patterns, which can be used to resolve contradicting functions reported by different studies.

Results

In this article, we present Correlation and Consensus-based Cross-taxonomy Network Analysis (C3NA), a user-friendly R package for investigating compositional microbial sequencing data to identify and compare co-occurrence patterns across different taxonomic levels. C3NA contains two interactive graphic user interfaces (Shiny applications), one of them dedicated to the comparison between two diagnoses, e.g., disease versus control. We used C3NA to analyze two well-studied diseases, colorectal cancer, and Crohn’s disease. We discovered clusters of study and disease-dependent taxa that overlap with known functional taxa studied by other discovery studies and differential abundance analyses.

Conclusion

C3NA offers a new microbial data analyses pipeline for refined and enriched taxa–taxa co-occurrence network analyses, and the usability was further expanded via the built-in Shiny applications for interactive investigation.

 
more » « less
NSF-PAR ID:
10379315
Author(s) / Creator(s):
;
Publisher / Repository:
Springer Science + Business Media
Date Published:
Journal Name:
BMC Bioinformatics
Volume:
23
Issue:
1
ISSN:
1471-2105
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Background

    Histone post-translational modifications (PTMs) play an important role in our system by regulating the structure of chromatin and therefore contribute to the regulation of gene and protein expression. Irregularities in histone PTMs can lead to a variety of different diseases including various forms of cancer. Histone modifications are analyzed using high resolution mass spectrometry, which generate large amounts of data that requires sophisticated bioinformatics tools for analysis and visualization. PTMViz is designed for downstream differential abundance analysis and visualization of both protein and/or histone modifications.

    Results

    PTMViz provides users with data tables and visualization plots of significantly differentiated proteins and histone PTMs between two sample groups. All the data is packaged into interactive data tables and graphs using the Shiny platform to help the user explore the results in a fast and efficient manner to assess if changes in the system are due to protein abundance changes or epigenetic changes. In the example data provided, we identified several proteins differentially regulated in the dopaminergic pathway between mice treated with methamphetamine compared to a saline control. We also identified histone post-translational modifications including histone H3K9me, H3K27me3, H4K16ac, and that were regulated due to drug exposure.

    Conclusions

    Histone modifications play an integral role in the regulation of gene expression. PTMViz provides an interactive platform for analyzing proteins and histone post-translational modifications from mass spectrometry data in order to quickly identify differentially expressed proteins and PTMs.

     
    more » « less
  2. Abstract Motivation

    High-throughput sequencing technologies, in particular RNA sequencing (RNA-seq), have become the basic practice for genomic studies in biomedical research. In addition to studying genes individually, for example, through differential expression analysis, investigating co-ordinated expression variations of genes may help reveal the underlying cellular mechanisms to derive better understanding and more effective prognosis and intervention strategies. Although there exists a variety of co-expression network based methods to analyze microarray data for this purpose, instead of blindly extending these methods for microarray data that may introduce unnecessary bias, it is crucial to develop methods well adapted to RNA-seq data to identify the functional modules of genes with similar expression patterns.

    Results

    We have developed a fully Bayesian covariate-dependent negative binomial factor analysis (dNBFA) method—dNBFA—for RNA-seq count data, to capture coordinated gene expression changes, while considering effects from covariates reflecting different influencing factors. Unlike existing co-expression network based methods, our proposed model does not require multiple ad-hoc choices on data processing, transformation, as well as co-expression measures and can be directly applied to RNA-seq data. Furthermore, being capable of incorporating covariate information, the proposed method can tackle setups with complex confounding factors in different experiment designs. Finally, the natural model parameterization removes the need for a normalization preprocessing step, as commonly adopted to compensate for the effect of sequencing-depth variations. Efficient Bayesian inference of model parameters is derived by exploiting conditional conjugacy via novel data augmentation techniques. Experimental results on several real-world RNA-seq datasets on complex diseases suggest dNBFA as a powerful tool for discovering the gene modules with significant differential expression and meaningful biological insight.

    Availability and implementation

    dNBFA is implemented in R language and is available at https://github.com/siamakz/dNBFA.

     
    more » « less
  3. Abstract

    The epidermis of Chondrichthyan fishes consists of dermal denticles with production of minimal but protein-rich mucus that collectively, influence the attachment and biofilm development of microbes, facilitating a unique epidermal microbiome. Here, we use metagenomics to provide the taxonomic and functional characterization of the epidermal microbiome of theTriakis semifasciata(leopard shark) at three time-points collected across 4 years to identify links between microbial groups and host metabolism. Our aims include (1) describing the variation of microbiome taxa over time and identifying recurrent microbiome members (present across all time-points); (2) investigating the relationship between the recurrent and flexible taxa (those which are not found consistently across time-points); (3) describing the functional compositions of the microbiome which may suggest links with the host metabolism; and (4) identifying whether metabolic processes are shared across microbial genera or are unique to specific taxa. Microbial members of the microbiome showed high similarity between all individuals (Bray–Curtis similarity index = 82.7, where 0 = no overlap, 100 = total overlap) with the relative abundance of those members varying across sampling time-points, suggesting flexibility of taxa in the microbiome. One hundred and eighty-eight genera were identified as recurrent, includingPseudomonas,Erythrobacter,Alcanivorax,Marinobacter, andSphingopxisbeing consistently abundant across time-points, whileLimnobacterandXyellaexhibited switching patterns with high relative abundance in 2013,SphingobiumandSphingomonain 2015, andAltermonas,Leeuwenhoekiella,Gramella, andMaribacterin 2017. Of the 188 genera identified as recurrent, the top 19 relatively abundant genera formed three recurrent groups. The microbiome also displayed high functional similarity between individuals (Bray–Curtis similarity index = 97.6) with gene function composition remaining consistent across all time-points. These results show that while the presence of microbial genera exhibits consistency across time-points, their abundances do fluctuate. Microbial functions however remain stable across time-points; thus, we suggest the leopard shark microbiomes exhibit functional redundancy. We show coexistence of microbes hosted in elasmobranch microbiomes that encode genes involved in utilizing nitrogen, but not fixing nitrogen, degrading urea, and resistant to heavy metal.

     
    more » « less
  4. Reguera, Gemma (Ed.)
    ABSTRACT Mucosal defenses are crucial in animals for protection against pathogens and predators. Host defense peptides (antimicrobial peptides, AMPs) as well as skin-associated microbes are key components of mucosal immunity, particularly in amphibians. We integrate microbiology, molecular biology, network-thinking, and proteomics to understand how host and microbially derived products on amphibian skin (referred to as the mucosome) serve as pathogen defenses. We studied defense mechanisms against chytrid pathogens, Batrachochytrium dendrobatidis (Bd) and B. salamandrivorans (Bsal), in four salamander species with different Batrachochytrium susceptibilities. Bd infection was quantified using qPCR, mucosome function (i.e., ability to kill Bd or Bsal zoospores in vitro ), skin bacterial communities using 16S rRNA gene amplicon sequencing, and the role of Bd-inhibitory bacteria in microbial networks across all species. We explored the presence of candidate-AMPs in eastern newts and red-backed salamanders. Eastern newts had the highest Bd prevalence and mucosome function, while red-back salamanders had the lowest Bd prevalence and mucosome function, and two-lined salamanders and seal salamanders were intermediates. Salamanders with highest Bd infection intensity showed greater mucosome function. Bd infection prevalence significantly decreased as putative Bd-inhibitory bacterial richness and relative abundance increased on hosts. In co-occurrence networks, some putative Bd-inhibitory bacteria were found as hub-taxa, with red-backs having the highest proportion of protective hubs and positive associations related to putative Bd-inhibitory hub bacteria. We found more AMP candidates on salamanders with lower Bd susceptibility. These findings suggest that salamanders possess distinct innate mechanisms that affect chytrid fungi. IMPORTANCE How host mucosal defenses interact, and influence disease outcome is critical in understanding host defenses against pathogens. A more detailed understanding is needed of the interactions between the host and the functioning of its mucosal defenses in pathogen defense. This study investigates the variability of chytrid susceptibility in salamanders and the innate defenses each species possesses to mediate pathogens, thus advancing the knowledge toward a deeper understanding of the microbial ecology of skin-associated bacteria and contributing to the development of bioaugmentation strategies to mediate pathogen infection and disease. This study improves the understanding of complex immune defense mechanisms in salamanders and highlights the potential role of the mucosome to reduce the probability of Bd disease development and that putative protective bacteria may reduce likelihood of Bd infecting skin. 
    more » « less
  5. Abstract Motivation

    Differential network analysis is an important way to understand network rewiring involved in disease progression and development. Building differential networks from multiple ‘omics data provides insight into the holistic differences of the interactive system under different patient-specific groups. DINGO was developed to infer group-specific dependencies and build differential networks. However, DINGO and other existing tools are limited to analyze data arising from a single platform, and modeling each of the multiple ‘omics data independently does not account for the hierarchical structure of the data.

    Results

    We developed the iDINGO R package to estimate group-specific dependencies and make inferences on the integrative differential networks, considering the biological hierarchy among the platforms. A Shiny application has also been developed to facilitate easier analysis and visualization of results, including integrative differential networks and hub gene identification across platforms.

    Availability and implementation

    R package is available on CRAN (https://cran.r-project.org/web/packages/iDINGO) and Shiny application at https://github.com/MinJinHa/iDINGO.

    Supplementary information

    Supplementary data are available at Bioinformatics online.

     
    more » « less