skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Multiscale adaptive differential abundance analysis in microbial compositional data
Abstract MotivationDifferential abundance analysis is an essential and commonly used tool to characterize the difference between microbial communities. However, identifying differentially abundant microbes remains a challenging problem because the observed microbiome data are inherently compositional, excessive sparse, and distorted by experimental bias. Besides these major challenges, the results of differential abundance analysis also depend largely on the choice of analysis unit, adding another practical complexity to this already complicated problem. ResultsIn this work, we introduce a new differential abundance test called the MsRDB test, which embeds the sequences into a metric space and integrates a multiscale adaptive strategy for utilizing spatial structure to identify differentially abundant microbes. Compared with existing methods, the MsRDB test can detect differentially abundant microbes at the finest resolution offered by data and provide adequate detection power while being robust to zero counts, compositional effect, and experimental bias in the microbial compositional dataset. Applications to both simulated and real microbial compositional datasets demonstrate the usefulness of the MsRDB test. Availability and implementationAll analyses can be found under https://github.com/lakerwsl/MsRDB-Manuscript-Code.  more » « less
Award ID(s):
2113458
PAR ID:
10407450
Author(s) / Creator(s):
;
Publisher / Repository:
Oxford University Press
Date Published:
Journal Name:
Bioinformatics
Volume:
39
Issue:
4
ISSN:
1367-4811
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Summary Differential abundance tests for compositional data are essential and fundamental in various biomedical applications, such as single-cell, bulk RNA-seq and microbiome data analysis. However, because of the compositional constraint and the prevalence of zero counts in the data, differential abundance analysis on compositional data remains a complicated and unsolved statistical problem. This article proposes a new differential abundance test, the robust differential abundance test, to address these challenges. Compared with existing methods, the robust differential abundance test is simple and computationally efficient, is robust to prevalent zero counts in compositional datasets, can take the data’s compositional nature into account, and has a theoretical guarantee of controlling false discoveries in a general setting. Furthermore, in the presence of observed covariates, the robust differential abundance test can work with covariate-balancing techniques to remove potential confounding effects and draw reliable conclusions. The proposed test is applied to several numerical examples, and its merits are demonstrated using both simulated and real datasets. 
    more » « less
  2. Abstract BackgroundStudying the co-occurrence network structure of microbial samples is one of the critical approaches to understanding the perplexing and delicate relationship between the microbe, host, and diseases. It is also critical to develop a tool for investigating co-occurrence networks and differential abundance analyses to reveal the disease-related taxa–taxa relationship. In addition, it is also necessary to tighten the co-occurrence network into smaller modules to increase the ability for functional annotation and interpretability of  these taxa-taxa relationships.  Also, it is critical to retain the phylogenetic relationship among the taxa to identify differential abundance patterns, which can be used to resolve contradicting functions reported by different studies. ResultsIn this article, we present Correlation and Consensus-based Cross-taxonomy Network Analysis (C3NA), a user-friendly R package for investigating compositional microbial sequencing data to identify and compare co-occurrence patterns across different taxonomic levels. C3NA contains two interactive graphic user interfaces (Shiny applications), one of them dedicated to the comparison between two diagnoses, e.g., disease versus control. We used C3NA to analyze two well-studied diseases, colorectal cancer, and Crohn’s disease. We discovered clusters of study and disease-dependent taxa that overlap with known functional taxa studied by other discovery studies and differential abundance analyses. ConclusionC3NA offers a new microbial data analyses pipeline for refined and enriched taxa–taxa co-occurrence network analyses, and the usability was further expanded via the built-in Shiny applications for interactive investigation. 
    more » « less
  3. Srivastava, Abhishek (Ed.)
    IntroductionMarine particles form in the ocean surface sink through the water column into the deep ocean, sequestering carbon. Microorganisms inhabit and consume carbon in these particles. The East Pacific Rise (EPR) harbors both an Oxygen Deficient Zone (ODZ) and a non-buoyant plume region formed from hydrothermal vents located on the ocean floor, allowing us to explore relationships between microbial community and particle size between a range of environments. MethodsIn this study, we quantified microbial diversity using a fractionation method that separated particles into seven fine scale fractions (0.2–1.2, 1.2–5, 5–20, 20–53, 53–180,180–500, >500 μm), and included a spike-in standard for sequencing the 16S rRNA gene. Size fractionated organic carbon into the same fractions enabled the calculation of bacterial 16S rRNA copies per μg C and per liter. ResultsThere was a large increase in the bacterial 16S rRNA copies/ug C and copies/L on particles >180 μm between the upper water column and the deep water column. Though the total concentration of organic C in particles decreased in the deep water column, the density of bacteria on large particles increased at depth. The microbial community varied statistically significantly as a function of particle size and depth. Quantitative abundance estimates found that ostensibly obligate free-living microbes, such as SAR11 and Thaumarcheota, were more abundant in the free-living fraction but also common and abundant in the particulate size fractions. Conversely, ostensibly obligate particle attached bacteria such as members of Bacteroidetes and Planctomycetes, while most abundant on particles, were also present in the free living fraction. Total bacterial abundance, and the abundance of many taxonomic groups, increased in the ODZ region, particularly in the free-living fraction. Contrastingly, in the non-buoyant plume, there were highly abundant bacteria in the 5–20 and 20–53 μm fractions but reduced bacteria present in the 53–180 and 180–500 μm fractions. ConclusionQuantitative examination of microbial communities highlights the distribution of microbial taxa unburdened by compositional effects. These data are congruent with existing models which suggest high levels of exchange between particle-attached and free-living communities. 
    more » « less
  4. Abstract Aims and backgroundThe resurrection plantMyrothamnus flabellifoliatolerates complete desiccation and is a great model for studying how plants cope with extreme drought. Root-associated microbes play a major role in stress tolerance and are an attractive target for enhancing drought tolerance in staple crops. However, how these dynamics play out under the most extreme water limitation remains underexplored. This study aimed to identify bacterial and fungal communities that tolerate extreme drought stress in the bulk soil, rhizosphere, and endosphere ofM. flabellifolia. MethodsHigh-throughput amplicon sequencing was used to characterise the microbial communities associated withM. flabellifolia. ResultsThe bacterial phyla that were most abundant across all compartments wereAcidobacteriota, Actinobacteriota, Chloroflexota, Planctomycetota,andPseudomonadota, while the most abundant fungal phyla wereAscomycotaandBasidiomycota. Although the bulk soil hosted multiple beneficial root-associated microbes, the rhizosphere compartment showed the highest functional diversity of bacteria and fungi. In contrast, the endosphere exhibited a low abundance and diversity of microbes. These findings share consistent with the theory thatM. flabellifoliarecruits soil microbes from the bulk to the rhizosphere and finally to the endosphere. It is possible that these microbes could promote drought tolerance in associated plant tissues. ConclusionWe find that compartments act as the major driver of microbial diversity, but the soil physicochemical factors also influence microbial composition. These results suggest that the root-associated microbiome ofM. flabellifoliais highly structured and may aid in plant function. 
    more » « less
  5. Abstract BackgroundThe processes that shape microbial biogeography are not well understood, and concepts that apply to macroorganisms, like dispersal barriers, may not affect microorganisms in the same predictable ways. To better understand how known macro-scale biogeographic processes can be applied at micro-scales,we examined seagrass associated microbiota on either side of Wallace’s line to determine the influence of this cryptic dispersal boundary on the community structure of microorganisms. Communities were examined from twelve locations throughout Indonesia on either side of this theoretical line. ResultsWe found significant differences in microbial community structure on either side of this boundary (R2 = 0.09;P = 0.001), and identified seven microbial genera as differentially abundant on either side of the line, six of these were more abundant in the West, with the other more strongly associated with the East. Genera found to be differentially abundant had significantly smaller minimum cell dimensions (GLM: t923 = 59.50,P < 0.001) than the overall community. ConclusionDespite the assumed excellent dispersal ability of microbes, we were able to detect significant differences in community structure on either side of this cryptic biogeographic boundary. Samples from the two closest islands on opposite sides of the line, Bali and Komodo, were more different from each other than either was to its most distant island on the same side. We suggest that limited dispersal across this barrier coupled with habitat differences are primarily responsible for the patterns observed. The cryptic processes that drive macroorganism community divergence across this region may also play a role in the bigeographic patterns of microbiota. 
    more » « less