skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Melody: meta-analysis of microbiome association studies for discovering generalizable microbial signatures
Abstract Standard protocols for meta-analysis of association studies are inadequate for microbiome data due to their complex compositional structure, leading to inaccurate and unstable microbial signature selection. To address this issue, we introduce Melody, a framework that generates, harmonizes, and combines study-specific summary association statistics to powerfully and robustly identify microbial signatures in meta-analysis. Comprehensive and realistic simulations demonstrate that Melody substantially outperforms existing approaches in prioritizing true signatures. In the meta-analyses of five studies on colorectal cancer and eight studies on the gut metabolome, we showcase the superior stability, reliability, and predictive performance of Melody-identified signatures.  more » « less
Award ID(s):
2054346
PAR ID:
10628853
Author(s) / Creator(s):
; ;
Publisher / Repository:
Springer Science + Business Media
Date Published:
Journal Name:
Genome Biology
Volume:
26
Issue:
1
ISSN:
1474-760X
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Immune checkpoint inhibitors (ICIs) have revolutionized melanoma treatment, yet patient responses remain highly variable, underscoring the need for predictive biomarkers. Emerging evidence suggests that gut microbiome composition influences ICI efficacy, though findings remain inconsistent across studies. Here, we present a meta-analysis of seven melanoma-associated microbiome cohorts (N=678) using a standardized computational pipeline to integrate microbial species, biosynthetic gene clusters (BGCs), and functional pathways. We identifyFaecalibacteriumSGB15346 as a key species enriched in responders, alongside RiPP biosynthetic class and pathways involved in short-chain fatty acid fermentation. Conversely, dTDP-sugar biosynthesis correlates with non-response. Our results highlight microbial signatures and metabolic pathways associated with ICI outcomes, offering potential targets for microbiome-based interventions in personalized immunotherapy. 
    more » « less
  2. Abstract Data integration is a powerful tool for facilitating a comprehensive and generalizable understanding of microbial communities and their association with outcomes of interest. However, integrating data sets from different studies remains a challenging problem because of severe batch effects, unobserved confounding variables, and high heterogeneity across data sets. We propose a new data integration method called MetaDICT, which initially estimates the batch effects by weighting methods in causal inference literature and then refines the estimation via a novel shared dictionary learning. Compared with existing methods, MetaDICT can better avoid the overcorrection of batch effects and preserve biological variation when there exist unobserved confounding variables or data sets are highly heterogeneous across studies. Furthermore, MetaDICT can generate comparable embedding at both taxa and sample levels that can be used to unravel the hidden structure of the integrated data and improve the integrative analysis. Applications to synthetic and real microbiome data sets demonstrate the robustness and effectiveness of MetaDICT in integrative analysis. Using MetaDICT, we characterize microbial interaction, identify generalizable microbial signatures, and enhance the accuracy of disease prediction in an integrative analysis of colorectal cancer metagenomics studies. 
    more » « less
  3. Abstract BackgroundExposure to aminoglycosides, a class of potent bactericidal antibiotics naturally produced by soil microorganisms and commonly used in agriculture, has the potential to cause shifts in the population dynamics of microorganisms that impact plant and soil health. In particular, aminoglycoside exposure could result in alterations of the soil and plant root-associated bacterial species diversity and richness due to their potent inhibitory action on microbial growth, the creation of selective conditions for the proliferation of antibiotic-resistant bacteria, or a reduction in the ability to suppress soil pathogens. Previous studies have attempted to understand the relationship between aminoglycoside exposure and the plant-associated microbiota with varying results. Thus, this systematic review aims to survey all relevant published data to answer the question, “What is the impact of aminoglycoside exposure on the soil and plant root-associated microbiota?” MethodsWe searched 5 academic databases and 1 specialist organization database for scientific journal publications written in any language. Articles were included based on the criteria described in Coates et al., 2022. Included studies were subject to critical appraisal using the CEE Critical Appraisal Tool Version 0.2 (Prototype) to evaluate their susceptibility to confounding factors, misclassification bias, selection bias, attrition bias, reporting bias and analysis bias. Studies deemed to be high risk based on critical appraisal results were excluded from further analysis. Descriptive data analysis was performed for studies considered low or unclear for risk of bias. Meta-analyses were conducted for antibiotic resistance and microbial diversity. Review findingsOut of 8370 screened records, 50 articles fulfilled the search criteria, and from these, 13 studies were included in meta-analysis. Most studies investigated the impact of aminoglycoside exposure on soil microbiota (93%) in a laboratory setting (62%), primarily from the United States (32%), China (24%), France, Switzerland and Germany (8%). A limited number of studies investigated the impact of aminoglycoside exposure on disease suppression, so it was excluded from meta-analysis. Therefore, our synthesis primarily details the impact of aminoglycoside exposure on the microbial diversity and antibiotic resistance of the soil microbiota. Overall, exposure to aminoglycosides did not result in a significant change in the microbial diversity. However, soil use, pH, and type of aminoglycoside used could be potential modifiers. Additionally, we observed an average 7% of the microbial population exhibiting resistance to aminoglycosides, with the relationship between the exposure concentration and the selection concentration emerging as a potential modifier. ConclusionsCurrent research is limited by gaps in understanding the relationship between aminoglycoside exposure, microbial community dynamics, and disease suppression, as well as by insufficient data on less-studied aminoglycosides and key confounding factors. Current research also suggests a potential relationship between antibiotic concentrations used for exposure and selection of resistant bacteria. These findings emphasize the need for informed antibiotic management policies and rigorous, targeted research to better understand the relationship between soil factors and antibiotic concentrations used on the impact of aminoglycosides on soil microbiota. 
    more » « less
  4. Abstract MotivationMany variants identified by genome-wide association studies (GWAS) have been found to affect multiple traits, either directly or through shared pathways. There is currently a wealth of GWAS data collected in numerous phenotypes, and analyzing multiple traits at once can increase power to detect shared variant effects. However, traditional meta-analysis methods are not suitable for combining studies on different traits. When applied to dissimilar studies, these meta-analysis methods can be underpowered compared to univariate analysis. The degree to which traits share variant effects is often not known, and the vast majority of GWAS meta-analysis only consider one trait at a time. ResultsHere, we present a flexible method for finding associated variants from GWAS summary statistics for multiple traits. Our method estimates the degree of shared effects between traits from the data. Using simulations, we show that our method properly controls the false positive rate and increases power when an effect is present in a subset of traits. We then apply our method to the North Finland Birth Cohort and UK Biobank datasets using a variety of metabolic traits and discover novel loci. Availability and implementationOur source code is available at https://github.com/lgai/CONFIT. Supplementary informationSupplementary data are available at Bioinformatics online. 
    more » « less
  5. Abstract BackgroundAlthough genome-wide association studies (GWAS) have successfully located various genetic variants susceptible to Alzheimer’s Disease (AD), it is still unclear how specific variants interact with genes and tissues to elucidate pathologies associated with AD. Summary-data-based Mendelian Randomization (SMR) addresses this problem through an instrumental variable approach that integrates data from independent GWAS and expression quantitative trait locus (eQTL) studies in order to infer a causal effect of gene expression on a trait. ResultsOur study employed the SMR approach to integrate a set of meta-analytic cis-eQTL information from the Genotype-Tissue Expression (GTEx), CommonMind Consortium (CMC), and Religious Orders Study and Rush Memory and Aging Project (ROS/MAP) consortiums with three sets of meta-analysis AD GWAS results. ConclusionsOur analysis identified twelve total gene probes (associated with twelve distinct genes) with a significant association with AD. Four of these genes survived a test of pleiotropy from linkage (the HEIDI test).Three of these genes – RP11-385F7.1, PRSS36, and AC012146.7 – have not yet been reported differentially expressed in the brain in the context of AD, and thus are the novel findings warranting further investigation. 
    more » « less