skip to main content


Title: Modular machine learning for Alzheimer's disease classification from retinal vasculature
Abstract Alzheimer's disease is the leading cause of dementia. The long progression period in Alzheimer's disease provides a possibility for patients to get early treatment by having routine screenings. However, current clinical diagnostic imaging tools do not meet the specific requirements for screening procedures due to high cost and limited availability. In this work, we took the initiative to evaluate the retina, especially the retinal vasculature, as an alternative for conducting screenings for dementia patients caused by Alzheimer's disease. Highly modular machine learning techniques were employed throughout the whole pipeline. Utilizing data from the UK Biobank, the pipeline achieved an average classification accuracy of 82.44%. Besides the high classification accuracy, we also added a saliency analysis to strengthen this pipeline's interpretability. The saliency analysis indicated that within retinal images, small vessels carry more information for diagnosing Alzheimer's diseases, which aligns with related studies.  more » « less
Award ID(s):
1908299
NSF-PAR ID:
10296297
Author(s) / Creator(s):
; ; ; ; ; ; ;
Date Published:
Journal Name:
Scientific Reports
Volume:
11
Issue:
1
ISSN:
2045-2322
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. null (Ed.)
    Introduction: Alzheimer’s disease (AD) causes progressive irreversible cognitive decline and is the leading cause of dementia. Therefore, a timely diagnosis is imperative to maximize neurological preservation. However, current treatments are either too costly or limited in availability. In this project, we explored using retinal vasculature as a potential biomarker for early AD diagnosis. This project focuses on stage 3 of a three-stage modular machine learning pipeline which consisted of image quality selection, vessel map generation, and classification [1]. The previous model only used support vector machine (SVM) to classify AD labels which limited its accuracy to 82%. In this project, random forest and gradient boosting were added and, along with SVM, combined into an ensemble classifier, raising the classification accuracy to 89%. Materials and Methods: Subjects classified as AD were those who were diagnosed with dementia in “Dementia Outcome: Alzheimer’s disease” from the UK Biobank Electronic Health Records. Five control groups were chosen with a 5:1 ratio of control to AD patients where the control patients had the same age, gender, and eye side image as the AD patient. In total, 122 vessel images from each group (AD and control) were used. The vessel maps were then segmented from fundus images through U-net. A t-test feature selection was first done on the training folds and the selected features was fed into the classifiers with a p-value threshold of 0.01. Next, 20 repetitions of 5-fold cross validation were performed where the hyperparameters were solely tuned on the training data. An ensemble classifier consisting of SVM, gradient boosting tree, and random forests was built and the final prediction was made through majority voting and evaluated on the test set. Results and Discussion: Through ensemble classification, accuracy increased by 4-12% relative to the individual classifiers, precision by 9-15%, sensitivity by 2-9%, specificity by at least 9-16%, and F1 score by 712%. Conclusions: Overall, a relatively high classification accuracy was achieved using machine learning ensemble classification with SVM, random forest, and gradient boosting. Although the results are very promising, a limitation of this study is that the requirement of needing images of sufficient quality decreased the amount of control parameters that can be implemented. However, through retinal vasculature analysis, this project shows machine learning’s high potential to be an efficient, more cost-effective alternative to diagnosing Alzheimer’s disease. Clinical Application: Using machine learning for AD diagnosis through retinal images will make screening available for a broader population by being more accessible and cost-efficient. Mobile device based screening can also be enabled at primary screening in resource-deprived regions. It can provide a pathway for future understanding of the association between biomarkers in the eye and brain. 
    more » « less
  2. Abstract

    Alzheimer's disease (AD) is the most common type of dementia in the elderly with no effective treatment currently. Recent studies of noninvasive neuroimaging, resting‐state functional magnetic resonance imaging (rs‐fMRI) with graph theoretical analysis have shown that patients with AD and mild cognitive impairment (MCI) exhibit disrupted topological organization in large‐scale brain networks. In previous work, it is a common practice to threshold such networks. However, it is not only difficult to make a principled choice of threshold values, but also worse is the discard of potential important information. To address this issue, we propose a threshold‐free feature by integrating a prior persistent homology‐based topological feature (the zeroth Betti number) and a newly defined connected component aggregation cost feature to model brain networks over all possible scales. We show that the induced topological feature (Integrated Persistent Feature) follows a monotonically decreasing convergence function and further propose to use its slope as a concise and persistent brain network topological measure. We apply this measure to study rs‐fMRI data from the Alzheimer's Disease Neuroimaging Initiative and compare our approach with five other widely used graph measures across five parcellation schemes ranging from 90 to 1,024 region‐of‐interests. The experimental results demonstrate that the proposed network measure shows more statistical power and stronger robustness in group difference studies in that the absolute values of the proposed measure of AD are lower than MCI and much lower than normal controls, providing empirical evidence for decreased functional integration in AD dementia and MCI.

     
    more » « less
  3. Purpose

    The aim of this study is to identify the potential cerebrospinal fluid (CSF) biomarkers for Alzheimer's disease and to evaluate these markers on independent CSF samples using parallel reaction monitoring (PRM) assays.

    Experimental Design

    High‐Resolution mass spectrometry and tandem mass tag (TMT) multiplexing technology are employed to identify potential biomarkers for Alzheimer's disease. Some of the identified potential biomarkers are validated using PRM assays.

    Results

    A total of 2327 proteins are identified in the CSF of which 139 are observed to be significantly altered in the CSF of AD patients. The proteins altered in AD includes a number of known AD marker such as MAPT, NPTX2, VGF, GFAP, and NCAM1 as well as novel biomarkers such as PKM and YWHAG. These findings are validated in a separate set of CSF specimens from AD dementia patients and controls. NPTX2, in combination with PKM or YWHAG, leads to the best results with AUCs of 0.935 and 0.933, respectively.

    Conclusions and Clinical Relevance

    The proteins that are found to be altered in the CSF of patients with AD could be used for monitoring disease progression and therapeutic response and perhaps also for early detection once they are validated in larger studies.

     
    more » « less
  4. Abstract Background

    Alzheimer's disease (AD), the most prevalent form of dementia, affects 6.5 million Americans and over 50 million people globally. Clinical, genetic, and phenotypic studies of dementia provide some insights of the observed progressive neurodegenerative processes, however, the mechanisms underlying AD onset remain enigmatic.

    Aims

    This paper examines late‐onset dementia‐related cognitive impairment utilizing neuroimaging‐genetics biomarker associations.

    Materials and Methods

    The participants, ages 65–85, included 266 healthy controls (HC), 572 volunteers with mild cognitive impairment (MCI), and 188 Alzheimer's disease (AD) patients. Genotype dosage data for AD‐associated single nucleotide polymorphisms (SNPs) were extracted from the imputed ADNI genetics archive using sample‐major additive coding. Such 29 SNPs were selected, representing a subset of independent SNPs reported to be highly associated with AD in a recent AD meta‐GWAS study by Jansen and colleagues.

    Results

    We identified the significant correlations between the 29 genomic markers (GMs) and the 200 neuroimaging markers (NIMs). The odds ratios and relative risks for AD and MCI (relative to HC) were predicted using multinomial linear models.

    Discussion

    In the HC and MCI cohorts, mainly cortical thickness measures were associated with GMs, whereas the AD cohort exhibited different GM‐NIM relations. Network patterns within the HC and AD groups were distinct in cortical thickness, volume, and proportion of White to Gray Matter (pct), but not in the MCI cohort. Multinomial linear models of clinical diagnosis showed precisely the specific NIMs and GMs that were most impactful in discriminating between AD and HC, and between MCI and HC.

    Conclusion

    This study suggests that advanced analytics provide mechanisms for exploring the interrelations between morphometric indicators and GMs. The findings may facilitate further clinical investigations of phenotypic associations that support deep systematic understanding of AD pathogenesis.

     
    more » « less
  5. Abstract

    Early diagnosis of Alzheimer’s disease plays a pivotal role in patient care and clinical trials. In this study, we have developed a new approach based on 3D deep convolutional neural networks to accurately differentiate mild Alzheimer’s disease dementia from mild cognitive impairment and cognitively normal individuals using structural MRIs. For comparison, we have built a reference model based on the volumes and thickness of previously reported brain regions that are known to be implicated in disease progression. We validate both models on an internal held-out cohort from The Alzheimer's Disease Neuroimaging Initiative (ADNI) and on an external independent cohort from The National Alzheimer's Coordinating Center (NACC). The deep-learning model is accurate, achieved an area-under-the-curve (AUC) of 85.12 when distinguishing between cognitive normal subjects and subjects with either MCI or mild Alzheimer’s dementia. In the more challenging task of detecting MCI, it achieves an AUC of 62.45. It is also significantly faster than the volume/thickness model in which the volumes and thickness need to be extracted beforehand. The model can also be used to forecast progression: subjects with mild cognitive impairment misclassified as having mild Alzheimer’s disease dementia by the model were faster to progress to dementia over time. An analysis of the features learned by the proposed model shows that it relies on a wide range of regions associated with Alzheimer's disease. These findings suggest that deep neural networks can automatically learn to identify imaging biomarkers that are predictive of Alzheimer's disease, and leverage them to achieve accurate early detection of the disease.

     
    more » « less