skip to main content


Title: Identifying progressive imaging genetic patterns via multi-task sparse canonical correlation analysis: a longitudinal study of the ADNI cohort
Abstract Motivation

Identifying the genetic basis of the brain structure, function and disorder by using the imaging quantitative traits (QTs) as endophenotypes is an important task in brain science. Brain QTs often change over time while the disorder progresses and thus understanding how the genetic factors play roles on the progressive brain QT changes is of great importance and meaning. Most existing imaging genetics methods only analyze the baseline neuroimaging data, and thus those longitudinal imaging data across multiple time points containing important disease progression information are omitted.

Results

We propose a novel temporal imaging genetic model which performs the multi-task sparse canonical correlation analysis (T-MTSCCA). Our model uses longitudinal neuroimaging data to uncover that how single nucleotide polymorphisms (SNPs) play roles on affecting brain QTs over the time. Incorporating the relationship of the longitudinal imaging data and that within SNPs, T-MTSCCA could identify a trajectory of progressive imaging genetic patterns over the time. We propose an efficient algorithm to solve the problem and show its convergence. We evaluate T-MTSCCA on 408 subjects from the Alzheimer’s Disease Neuroimaging Initiative database with longitudinal magnetic resonance imaging data and genetic data available. The experimental results show that T-MTSCCA performs either better than or equally to the state-of-the-art methods. In particular, T-MTSCCA could identify higher canonical correlation coefficients and capture clearer canonical weight patterns. This suggests that T-MTSCCA identifies time-consistent and time-dependent SNPs and imaging QTs, which further help understand the genetic basis of the brain QT changes over the time during the disease progression.

Availability and implementation

The software and simulation data are publicly available at https://github.com/dulei323/TMTSCCA.

Supplementary information

Supplementary data are available at Bioinformatics online.

 
more » « less
NSF-PAR ID:
10425974
Author(s) / Creator(s):
; ; ; ; ; ; ; ;
Publisher / Repository:
Oxford University Press
Date Published:
Journal Name:
Bioinformatics
Volume:
35
Issue:
14
ISSN:
1367-4803
Page Range / eLocation ID:
p. i474-i483
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Brain imaging genetics is an important research topic in brain science, which combines genetic variations and brain structures or functions to uncover the genetic basis of brain disorders. Imaging data collected by different technologies, measuring the same brain distinctly, might carry complementary but different information. Unfortunately, we do not know the extent to which phenotypic variance is shared among multiple imaging modalities, which might trace back to the complex genetic mechanism. In this study, we propose a novel dirty multi-task SCCA to analyze imaging genetics problems with multiple modalities of brain imaging quantitative traits (QTs) involved. The proposed method can not only identify the shared SNPs and QTs across multiple modalities, but also identify the modality-specific SNPs and QTs, showing a flexible capability of discovering the complex multi-SNP-multi-QT associations. Compared with the multi-view SCCA and multi-task SCCA, our method shows better canonical correlation coefficients and canonical weights on both synthetic and real neuroimaging genetic data. This demonstrates that the proposed dirty multi-task SCCA could be a meaningful and powerful alternative method in multi-modal brain imaging genetics. 
    more » « less
  2. Abstract Background

    In Alzheimer’s Diseases (AD) research, multimodal imaging analysis can unveil complementary information from multiple imaging modalities and further our understanding of the disease. One application is to discover disease subtypes using unsupervised clustering. However, existing clustering methods are often applied to input features directly, and could suffer from the curse of dimensionality with high-dimensional multimodal data. The purpose of our study is to identify multimodal imaging-driven subtypes in Mild Cognitive Impairment (MCI) participants using a multiview learning framework based on Deep Generalized Canonical Correlation Analysis (DGCCA), to learn shared latent representation with low dimensions from 3 neuroimaging modalities.

    Results

    DGCCA applies non-linear transformation to input views using neural networks and is able to learn correlated embeddings with low dimensions that capture more variance than its linear counterpart, generalized CCA (GCCA). We designed experiments to compare DGCCA embeddings with single modality features and GCCA embeddings by generating 2 subtypes from each feature set using unsupervised clustering. In our validation studies, we found that amyloid PET imaging has the most discriminative features compared with structural MRI and FDG PET which DGCCA learns from but not GCCA. DGCCA subtypes show differential measures in 5 cognitive assessments, 6 brain volume measures, and conversion to AD patterns. In addition, DGCCA MCI subtypes confirmed AD genetic markers with strong signals that existing late MCI group did not identify.

    Conclusion

    Overall, DGCCA is able to learn effective low dimensional embeddings from multimodal data by learning non-linear projections. MCI subtypes generated from DGCCA embeddings are different from existing early and late MCI groups and show most similarity with those identified by amyloid PET features. In our validation studies, DGCCA subtypes show distinct patterns in cognitive measures, brain volumes, and are able to identify AD genetic markers. These findings indicate the promise of the imaging-driven subtypes and their power in revealing disease structures beyond early and late stage MCI.

     
    more » « less
  3. Abstract Motivation

    Brain imaging genetics aims to reveal genetic effects on brain phenotypes, where most studies examine phenotypes defined on anatomical or functional regions of interest (ROIs) given their biologically meaningful interpretation and modest dimensionality compared with voxelwise approaches. Typical ROI-level measures used in these studies are summary statistics from voxelwise measures in the region, without making full use of individual voxel signals.

    Results

    In this article, we propose a flexible and powerful framework for mining regional imaging genetic associations via voxelwise enrichment analysis, which embraces the collective effect of weak voxel-level signals and integrates brain anatomical annotation information. Our proposed method achieves three goals at the same time: (i) increase the statistical power by substantially reducing the burden of multiple comparison correction; (ii) employ brain annotation information to enable biologically meaningful interpretation and (iii) make full use of fine-grained voxelwise signals. We demonstrate our method on an imaging genetic analysis using data from the Alzheimer’s Disease Neuroimaging Initiative, where we assess the collective regional genetic effects of voxelwise FDG-positron emission tomography measures between 116 ROIs and 565 373 single-nucleotide polymorphisms. Compared with traditional ROI-wise and voxelwise approaches, our method identified 2946 novel imaging genetic associations in addition to 33 ones overlapping with the two benchmark methods. In particular, two newly reported variants were further supported by transcriptome evidences from region-specific expression analysis. This demonstrates the promise of the proposed method as a flexible and powerful framework for exploring imaging genetic effects on the brain.

    Availability and implementation

    The R code and sample data are freely available at https://github.com/lshen/RIGEA.

    Supplementary information

    Supplementary data are available at Bioinformatics online.

     
    more » « less
  4. ABSTRACT BACKGROUND AND PURPOSE

    Numerous sex‐specific differences in multiple sclerosis (MS) susceptibility, disease manifestation, disability progression, inflammation, and neurodegeneration have been previously reported. Previous magnetic resonance imaging (MRI) studies have shown structural differences between female and male MS brain volumes. To determine sex‐specific global and tissue‐specific brain volume throughout the MS life span in a real‐world large MRI database.

    METHODS

    A total of 2,199 MS patients (female/male ratio of 1,651/548) underwent structural MRI imaging on either a 1.5‐T or 3‐T scanner. Global and tissue‐specific volumes of whole brain (WBV), white matter, and gray matter (GMV) were determined by utilizing Structural Image Evaluation using Normalisation of Atrophy Cross‐sectional (SIENAX). Lateral ventricular volume (LVV) was determined with the Neurological Software Tool for REliable Atrophy Measurement (NeuroSTREAM). General linear models investigated sex and age interactions, and post hoc comparative sex analyses were performed.

    RESULTS

    Despite being age‐matched with female MS patents, a greater proportion of male MS patients were diagnosed with progressive MS and had lower normalized WBV (P < .001), GMV (P< .001), and greater LVV (P< .001). In addition to significant stand‐alone main effects, an interaction between sex and age had an additional effect on the LVV (F‐statistics = 4.53,P= .033) and GMV (F‐statistics = 4.59,P= .032). The sex and age interaction was retained in both models of LVV (F‐statistics = 3.31,P= .069) and GMV (F‐statistics = 6.1,P= .003) when disease subtype and disease‐modifying treatment (DMT) were also included. Although male MS patients presented with significantly greater LVV and lower GMV during the early and midlife period when compared to their female counterparts (P< .001 for LVV andP< .019 for GMV), these differences were nullified in 60+ years old patients. Similar findings were seen within a subanalysis of MS patients that were not on any DMT at the time of enrollment.

    CONCLUSION

    There are sex‐specific differences in the LVV and GMV over the MS life span.

     
    more » « less
  5. Abstract Background

    Alzheimer's disease (AD), the most prevalent form of dementia, affects 6.5 million Americans and over 50 million people globally. Clinical, genetic, and phenotypic studies of dementia provide some insights of the observed progressive neurodegenerative processes, however, the mechanisms underlying AD onset remain enigmatic.

    Aims

    This paper examines late‐onset dementia‐related cognitive impairment utilizing neuroimaging‐genetics biomarker associations.

    Materials and Methods

    The participants, ages 65–85, included 266 healthy controls (HC), 572 volunteers with mild cognitive impairment (MCI), and 188 Alzheimer's disease (AD) patients. Genotype dosage data for AD‐associated single nucleotide polymorphisms (SNPs) were extracted from the imputed ADNI genetics archive using sample‐major additive coding. Such 29 SNPs were selected, representing a subset of independent SNPs reported to be highly associated with AD in a recent AD meta‐GWAS study by Jansen and colleagues.

    Results

    We identified the significant correlations between the 29 genomic markers (GMs) and the 200 neuroimaging markers (NIMs). The odds ratios and relative risks for AD and MCI (relative to HC) were predicted using multinomial linear models.

    Discussion

    In the HC and MCI cohorts, mainly cortical thickness measures were associated with GMs, whereas the AD cohort exhibited different GM‐NIM relations. Network patterns within the HC and AD groups were distinct in cortical thickness, volume, and proportion of White to Gray Matter (pct), but not in the MCI cohort. Multinomial linear models of clinical diagnosis showed precisely the specific NIMs and GMs that were most impactful in discriminating between AD and HC, and between MCI and HC.

    Conclusion

    This study suggests that advanced analytics provide mechanisms for exploring the interrelations between morphometric indicators and GMs. The findings may facilitate further clinical investigations of phenotypic associations that support deep systematic understanding of AD pathogenesis.

     
    more » « less