skip to main content


Title: Learning Deeply Enriched Representations of Longitudinal Imaging-Genetic Data to Predict Alzheimer’s Disease Progression
Alzheimer’s Disease (AD) is a progressive memory disorder that causes irreversible cognitive declines, therefore early diagnosis is imperative to prevent the progression of AD. To this end, many biomarker analysis models have been presented for early AD detection. However, these models may not realize the full data potential due to their failure to integrate longitudinal (dynamic) phenotypic data with (static) genetic data. Sometimes, they may not fully utilize both labeled and unlabeled samples either. To overcome these limitations, we propose a semi-supervised enrichment learning method to learn a fixed-length vectorial representation for each participant, by which the static data record can be integrated with the dynamic data records. We have applied our new method on the Alzheimer’s Disease Neuroimaging Initiative (ADNI) cohort and achieved 75% accuracy on multiclass AD progression prediction by one year in advance.  more » « less
Award ID(s):
1652943 1849359
NSF-PAR ID:
10316104
Author(s) / Creator(s):
;
Date Published:
Journal Name:
2021 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. null (Ed.)
    Alzheimer’s Disease (AD) is a chronic neurodegenerative disease that causes severe problems in patients’ thinking, memory, and behavior. An early diagnosis is crucial to prevent AD progression; to this end, many algorithmic approaches have recently been proposed to predict cognitive decline. However, these predictive models often fail to integrate heterogeneous genetic and neuroimaging biomarkers and struggle to handle missing data. In this work we propose a novel objective function and an associated optimization algorithm to identify cognitive decline related to AD. Our approach is designed to incorporate dynamic neuroimaging data by way of a participant-specific augmentation combined with multimodal data integration aligned via a regression task. Our approach, in order to incorporate additional side-information, utilizes structured regularization techniques popularized in recent AD literature. Armed with the fixed-length vector representation learned from the multimodal dynamic and static modalities, conventional machine learning methods can be used to predict the clinical outcomes associated with AD. Our experimental results show that the proposed augmentation model improves the prediction performance on cognitive assessment scores for a collection of popular machine learning algorithms. The results of our approach are interpreted to validate existing genetic and neuroimaging biomarkers that have been shown to be predictive of cognitive decline. 
    more » « less
  2. Abstract

    Alzheimer’s disease (AD) manifested before age 65 is commonly referred to as early-onset AD (EOAD) (Reitz et al. Neurol Genet. 2020;6:e512). While the majority (> 90%) of EOAD cases are not caused by autosomal-dominant mutations inPSEN1,PSEN2, andAPP, they do have a higher heritability (92–100%) than sporadic late-onset AD (LOAD, 70%) (Wingo et al. Arch Neurol. 2012;69:59–64, Fulton-Howard et al. Neurobiol Aging. 2021;99:101.e1–101.e9). Although the endpoint clinicopathological changes, i.e., Aβ plaques, tau tangles, and cognitive decline, are common across EOAD and LOAD, the disease progression is highly heterogeneous (Neff et al. Sci Adv Am Assoc Adv Sci. 2021;7:eabb5398). This heterogeneity, leading to temporally distinct age at onset (AAO) and stages of cognitive decline, may be caused by myriad combinations of distinct disease-associated molecular mechanisms. We and others have used transcriptome profiling in AD patient-derived neuron models of autosomal-dominant EOAD and sporadic LOAD to identify disease endotypes (Caldwell et al. Sci Adv Am Assoc Adv Sci. 2020;6:eaba5933, Mertens et al. Cell Stem Cell. 2021;28:1533–1548.e6, Caldwell et al. Alzheimers Demen. 2022). Further, analyses of large postmortem brain cohorts demonstrate that only one-third of AD patients show hallmark disease endotypes like increased inflammation and decreased synaptic signaling (Neff et al. Sci Adv Am Assoc Adv Sci. 2021;7:eabb5398). Areas of the brain less affected by AD pathology at early disease stagessuch as the primary visual cortexexhibit similar transcriptomic dysregulation as those regions traditionally affected and, therefore, may offer a view into the molecular mechanisms of AD without the associated inflammatory changes and gliosis induced by pathology (Haroutunian et al. Neurobiol Aging. 2009;30:561–73). To this end, we analyzed AD patient samples from the primary visual cortex (19 EOAD, 20 LOAD) using transcriptomic signatures to identify patient clusters and disease endotypes. Interestingly, although the clusters showed distinct combinations and severity of endotypes, each patient cluster contained both EOAD and LOAD cases, suggesting that AAO may not directly correlate with the identity and severity of AD endotypes.

     
    more » « less
  3. Abstract

    Alzheimer’s Disease (AD) is a progressive neurodegenerative disease and the leading cause of dementia. Early diagnosis is critical for patients to benefit from potential intervention and treatment. The retina has emerged as a plausible diagnostic site for AD detection owing to its anatomical connection with the brain. However, existing AI models for this purpose have yet to provide a rational explanation behind their decisions and have not been able to infer the stage of the disease’s progression. Along this direction, we propose a novel model-agnostic explainable-AI framework, called Granu$$\underline{la}$$la̲r Neuron-le$$\underline{v}$$v̲el Expl$$\underline{a}$$a̲iner (LAVA), an interpretation prototype that probes into intermediate layers of the Convolutional Neural Network (CNN) models to directly assess the continuum of AD from the retinal imaging without the need for longitudinal or clinical evaluations. This innovative approach aims to validate retinal vasculature as a biomarker and diagnostic modality for evaluating Alzheimer’s Disease. Leveraged UK Biobank cognitive tests and vascular morphological features demonstrate significant promise and effectiveness of LAVA in identifying AD stages across the progression continuum.

     
    more » « less
  4. Thung, Kim Han (Ed.)
    Alzheimer’s disease (AD) is a neurodegenerative condition that progresses over decades. Early detection of individuals at high risk of future progression toward AD is likely to be of critical significance for the successful treatment and/or prevention of this devastating disease. In this paper, we present an empirical study to characterize how predictable an individual subjects’ future AD trajectory is, several years in advance, based on rich multi-modal data, and using modern deep learning methods. Crucially, the machine learning strategy we propose can handle different future time horizons and can be trained with heterogeneous data that exhibit missingness and non-uniform follow-up visit times. Our experiments demonstrate that our strategy yields predictions that are more accurate than a model trained on a single time horizon (e.g. 3 years), which is common practice in prior literature. We also provide a comparison between linear and nonlinear models, verifying the well-established insight that the latter can offer a boost in performance. Our results also confirm that predicting future decline for cognitively normal (CN) individuals is more challenging than for individuals with mild cognitive impairment (MCI). Intriguingly, however, we discover that prediction accuracy decreases with increasing time horizon for CN subjects, but the trend is in the opposite direction for MCI subjects. Additionally, we quantify the contribution of different data types in prediction, which yields novel insights into the utility of different biomarkers. We find that molecular biomarkers are not as helpful for CN individuals as they are for MCI individuals, whereas magnetic resonance imaging biomarkers (hippocampus volume, specifically) offer a significant boost in prediction accuracy for CN individuals. Finally, we show how our model’s prediction reveals the evolution of individual-level progression risk over a five-year time horizon. Our code is available at https://github.com/batuhankmkaraman/mlbasedad . 
    more » « less
  5. Early detection of Alzheimer’s disease (AD) during the Mild Cognitive Impairment (MCI) stage could enable effective intervention to slow down disease progression. Computer-aided diagnosis of AD relies on a sufficient amount of biomarker data. When this requirement is not fulfilled, transfer learning can be used to transfer knowledge from a source domain with more amount of labeled data than available in the desired target domain. In this study, an instance-based transfer learning framework is presented based on the gradient boosting machine (GBM). In GBM, a sequence of base learners is built, and each learner focuses on the errors (residuals) of the previous learner. In our transfer learning version of GBM (TrGB), a weighting mechanism based on the residuals of the base learners is defined for the source instances. Consequently, instances with different distribution than the target data will have a lower impact on the target learner. The proposed weighting scheme aims to transfer as much information as possible from the source domain while avoiding negative transfer. The target data in this study was obtained from the Mount Sinai dataset which is collected and processed in a collaborative 5-year project at the Mount Sinai Medical Center. The Alzheimer’s Disease Neuroimaging Initiative (ADNI) dataset was used as the source domain. The experimental results showed that the proposed TrGB algorithm could improve the classification accuracy by 1.5 and 4.5% for CN vs. MCI and multiclass classification, respectively, as compared to the conventional methods. Also, using the TrGB model and transferred knowledge from the CN vs. AD classification of the source domain, the average score of early MCI vs. late MCI classification improved by 5%. 
    more » « less