skip to main content

Title: Integrative-omics for discovery of network-level disease biomarkers: a case study in Alzheimer’s disease
Abstract A large number of genetic variations have been identified to be associated with Alzheimer’s disease (AD) and related quantitative traits. However, majority of existing studies focused on single types of omics data, lacking the power of generating a community including multi-omic markers and their functional connections. Because of this, the immense value of multi-omics data on AD has attracted much attention. Leveraging genomic, transcriptomic and proteomic data, and their backbone network through functional relations, we proposed a modularity-constrained logistic regression model to mine the association between disease status and a group of functionally connected multi-omic features, i.e. single-nucleotide polymorphisms (SNPs), genes and proteins. This new model was applied to the real data collected from the frontal cortex tissue in the Religious Orders Study and Memory and Aging Project cohort. Compared with other state-of-art methods, it provided overall the best prediction performance during cross-validation. This new method helped identify a group of densely connected SNPs, genes and proteins predictive of AD status. These SNPs are mostly expression quantitative trait loci in the frontal region. Brain-wide gene expression profile of these genes and proteins were highly correlated with the brain activation map of ‘vision’, a brain function partly controlled by frontal cortex. These genes and proteins were also found to be associated with the amyloid deposition, cortical volume and average thickness of frontal regions. Taken together, these results suggested a potential pathway underlying the development of AD from SNPs to gene expression, protein expression and ultimately brain functional and structural changes.  more » « less
Award ID(s):
1942394 1755836
Author(s) / Creator(s):
; ; ; ; ; ; ;
Date Published:
Journal Name:
Briefings in Bioinformatics
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Background Large-scale genome-wide association studies have successfully identified many genetic variants significantly associated with Alzheimer’s disease (AD), such as rs429358, rs11038106, rs723804, rs13591776, and more. The next key step is to understand the function of these SNPs and the downstream biology through which they exert the effect on the development of AD. However, this remains a challenging task due to the tissue-specific nature of transcriptomic and proteomic data and the limited availability of brain tissue.In this paper, instead of using coupled transcriptomic data, we performed an integrative analysis of existing GWAS findings and expression quantitative trait loci (eQTL) results from AD-related brain regions to estimate the transcriptomic alterations in AD brain. Results We used summary-based mendelian randomization method along with heterogeneity in dependent instruments method and were able to identify 32 genes with potential altered levels in temporal cortex region. Among these, 10 of them were further validated using real gene expression data collected from temporal cortex region, and 19 SNPs from NECTIN and TOMM40 genes were found associated with multiple temporal cortex imaging phenotype. Conclusion Significant pathways from enriched gene networks included neutrophil degranulation, Cell surface interactions at the vascular wall, and Regulation of TP53 activity which are still relatively under explored in Alzheimer’s Disease while also encouraging a necessity to bind further trans-eQTL effects into this integrative analysis. 
    more » « less
  2. null (Ed.)
    Selective serotonin reuptake inhibitors (SSRIs) are a standard of care for the pharmacotherapy of patients suffering from Major Depressive Disorder (MDD). However, only one-half to two-thirds of MDD patients respond to SSRI therapy. Recently, a “multiple omics” research strategy was applied to identify genetic differences between patients who did and did not respond to SSRI therapy. As a first step, plasma metabolites were assayed using samples from the 803 patients in the PGRN-AMPS SSRI MDD trial. The metabolomics data were then used to “inform” genomics by performing a genome-wide association study (GWAS) for plasma concentrations of the metabolite most highly associated with clinical response, serotonin (5-HT). Two genome-wide or near genome-wide significant single nucleotide polymorphism (SNP) signals were identified, one that mapped near the TSPAN5 gene and another across the ERICH3 gene, both genes that are highly expressed in the brain. Knocking down TSPAN5 and ERICH3 resulted in decreased 5-HT concentrations in neuroblastoma cell culture media and decreased expression of enzymes involved in 5-HT biosynthesis and metabolism. Functional genomic studies demonstrated that ERICH3 was involved in clathrin-mediated vesicle formation and TSPAN5 was an ethanol-responsive gene that may be a marker for response to acamprosate pharmacotherapy of alcohol use disorder (AUD), a neuropsychiatric disorder highly co-morbid with MDD. In parallel studies, kynurenine was the plasma metabolite most highly associated with MDD symptom severity and application of a metabolomics-informed pharmacogenomics approach identified DEFB1 and AHR as genes associated with variation in plasma kynurenine levels. Both genes also contributed to kynurenine-related inflammatory pathways. Finally, a multiply replicated predictive algorithm for SSRI clinical response with a balanced predictive accuracy of 76% (compared with 56% for clinical data alone) was developed by including the SNPs in TSPAN5 , ERICH3 , DEFB1 and AHR . In summary, application of a multiple omics research strategy that used metabolomics to inform genomics, followed by functional genomic studies, identified novel genes that influenced monoamine biology and made it possible to develop a predictive algorithm for SSRI clinical outcomes in MDD. A similar pharmaco-omic research strategy might be broadly applicable for the study of other neuropsychiatric diseases and their drug therapy. 
    more » « less
  3. Abstract Background

    Uncovering the functional relevance underlying verbal declarative memory (VDM) genome-wide association study (GWAS) results may facilitate the development of interventions to reduce age-related memory decline and dementia.


    We performed multi-omics and pathway enrichment analyses of paragraph (PAR-dr) and word list (WL-dr) delayed recall GWAS from 29,076 older non-demented individuals of European descent. We assessed the relationship between single-variant associations and expression quantitative trait loci (eQTLs) in 44 tissues and methylation quantitative trait loci (meQTLs) in the hippocampus. We determined the relationship between gene associations and transcript levels in 53 tissues, annotation as immune genes, and regulation by transcription factors (TFs) and microRNAs. To identify significant pathways, gene set enrichment was tested in each cohort and meta-analyzed across cohorts. Analyses of differential expression in brain tissues were conducted for pathway component genes.


    The single-variant associations of VDM showed significant linkage disequilibrium (LD) with eQTLs across all tissues and meQTLs within the hippocampus. Stronger WL-dr gene associations correlated with reduced expression in four brain tissues, including the hippocampus. More robust PAR-dr and/or WL-dr gene associations were intricately linked with immunity and were influenced by 31 TFs and 2 microRNAs. Six pathways, including type I diabetes, exhibited significant associations with both PAR-dr and WL-dr. These pathways included fifteen MHC genes intricately linked to VDM performance, showing diverse expression patterns based on cognitive status in brain tissues.


    VDM genetic associations influence expression regulation via eQTLs and meQTLs. The involvement of TFs, microRNAs, MHC genes, and immune-related pathways contributes to VDM performance in older individuals.

    more » « less
  4. null (Ed.)
    Recent evidence increasingly associates network disruption in brain organization with multiple neurodegenerative diseases, including amyotrophic lateral sclerosis (ALS), a rare terminal disease. However, the comparability of brain network characteristics across different studies remains a challenge for conventional graph theoretical methods. One suggested method to address this issue is minimum spanning tree (MST) analysis, which provides a less biased comparison. Here, we assessed the novel application of MST network analysis to hemodynamic responses recorded by functional near-infrared spectroscopy (fNIRS) neuroimaging modality, during an activity-based paradigm to investigate hypothetical disruptions in frontal functional brain network topology as a marker of the executive dysfunction, one of the most prevalent cognitive deficit reported across ALS studies. We analyzed data recorded from nine participants with ALS and ten age-matched healthy controls by first estimating functional connectivity, using phase-locking value (PLV) analysis, and then constructing the corresponding individual and group MSTs. Our results showed significant between-group differences in several MST topological properties, including leaf fraction, maximum degree, diameter, eccentricity, and degree divergence. We further observed a global shift toward more centralized frontal network organizations in the ALS group, interpreted as a more random or dysregulated network in this cohort. Moreover, the similarity analysis demonstrated marginally significantly increased overlap in the individual MSTs from the control group, implying a reference network with lower topological variation in the healthy cohort. Our nodal analysis characterized the main local hubs in healthy controls as distributed more evenly over the frontal cortex, with slightly higher occurrence in the left prefrontal cortex (PFC), while in the ALS group, the most frequent hubs were asymmetrical, observed primarily in the right prefrontal cortex. Furthermore, it was demonstrated that the global PLV (gPLV) synchronization metric is associated with disease progression, and a few topological properties, including leaf fraction and tree hierarchy, are linked to disease duration. These results suggest that dysregulation, centralization, and asymmetry of the hemodynamic-based frontal functional network during activity are potential neuro-topological markers of ALS pathogenesis. Our findings can possibly support new bedside assessments of the functional status of ALS’ brain network and could hypothetically extend to applications in other neurodegenerative diseases. 
    more » « less
  5. Latent Interacting Variable Effects (LIVE) modeling is a framework to integrate different types of microbiome multi-omics data by combining latent variables from single-omic models into a structured meta-model to determine discriminative, interacting multi-omics features driving disease status. We implemented and tested LIVE modeling in publicly available metagenomics and metabolomics datasets from Crohn’s Disease and Ulcerative Colitis patients. Here, LIVE modeling reduced the number of feature correlations from the original data set for CD and UC to tractable numbers and facilitated prioritization of biological associations between microbes, metabolites, enzymes and IBD status through the application of stringent thresholds on generated inferential statistics. We determined LIVE modeling confirmed previously reported IBD biomarkers and uncovered potentially novel disease mechanisms in IBD. LIVE modeling makes a distinct and complementary contribution to the current methods to integrate microbiome data to predict IBD status because of its flexibility to adapt to different types of microbiome multi-omics data, scalability for large and small cohort studies via reliance on latent variables and dimensionality reduction, and the intuitive interpretability of the linear meta-model integrating -omic data types. The results of LIVE modeling and the biological relationships can be represented in networks that connect local correlation structure of single omic data types with global community and omic structure in the latent variable VIP scores. This model arises as novel tool that allows researchers to be more selective about omic feature interaction without disrupting the structural correlation framework provided by sPLS-DA interaction effects modeling. It will lead to form testable hypothesis by identifying potential and unique interactions between metabolome and microbiome that must be considered for future studies. 
    more » « less