skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Improving clinical disease subtyping and future events prediction through a chest CT-based deep learning approach
Purpose: To develop and evaluate a deep learning (DL) approach to extract rich information from high-resolution computed tomography (HRCT) of patients with chronic obstructive pulmonary disease (COPD). Methods: We develop a DL-based model to learn a compact representation of a subject, which is predictive of COPD physiologic severity and other outcomes. Our DL model learned: (a) to extract informative regional image features from HRCT; (b) to adaptively weight these features and form an aggregate patient representation; and finally, (c) to predict several COPD outcomes. The adaptive weights correspond to the regional lung contribution to the disease. We evaluate the model on 10 300 participants from the COPDGene cohort. Results: Our model was strongly predictive of spirometric obstruction ( r2 = 0.67) and grouped 65.4% of subjects correctly and 89.1% within one stage of their GOLD severity stage. Our model achieved an accuracy of 41.7% and 52.8% in stratifying the population-based on centrilobular (5-grade) and paraseptal (3-grade) emphysema severity score, respectively. For predicting future exacerbation, combining subjects' representations from our model with their past exacerbation histories achieved an accuracy of 80.8% (area under the ROC curve of 0.73). For all-cause mortality, in Cox regression analysis, we outperformed the BODE index improving the concordance metric (ours: 0.61 vs BODE: 0.56). Conclusions: Our model independently predicted spirometric obstruction, emphysema severity, exacerbation risk, and mortality from CT imaging alone. This method has potential applicability in both research and clinical practice.  more » « less
Award ID(s):
1839332
PAR ID:
10299291
Author(s) / Creator(s):
Date Published:
Journal Name:
Medical physics
Issue:
3
ISSN:
0094-2405
Page Range / eLocation ID:
1168-1181
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. First- and second-hand exposure to smoke or air pollutants is the primary cause of chronic obstructive pulmonary disease (COPD) pathogenesis, where genetic and age-related factors predispose the subject to the initiation and progression of obstructive lung disease. Briefly, airway inflammation, specifically bronchitis, initiates the lung disease, leading to difficulty in breathing (dyspnea) and coughing as initial symptoms, followed by air trapping and inhibition of the flow of air into the lungs due to damage to the alveoli (emphysema). In addition, mucus obstruction and impaired lung clearance mechanisms lead to recurring acute exacerbations causing progressive decline in lung function, eventually requiring lung transplant and other lifesaving interventions to prevent mortality. It is noteworthy that COPD is much more common in the population than currently diagnosed, as only 16 million adult Americans were reported to be diagnosed with COPD as of 2018, although an additional 14 million American adults were estimated to be suffering from COPD but undiagnosed by the current standard of care (SOC) diagnostic, namely the spirometry-based pulmonary function test (PFT). Thus, the main issue driving the adverse disease outcome and significant mortality for COPD is lack of timely diagnosis in the early stages of the disease. The current treatment regime for COPD emphysema is most effective when implemented early, on COPD onset, where alleviating symptoms and exacerbations with timely intervention(s) can prevent steep lung function decline(s) and disease progression to severe emphysema. Therefore, the key to efficiently combatting COPD relies on early detection. Thus, it is important to detect early regional pulmonary function and structural changes to monitor modest disease progression for implementing timely interventions and effectively eliminating emphysema progression. Currently, COPD diagnosis involves using techniques such as COPD screening questionnaires, PFT, arterial blood gas analysis, and/or lung imaging, but these modalities are limited in their capability for early diagnosis and real-time disease monitoring of regional lung function changes. Hence, promising emerging techniques, such as X-ray phase contrast, photoacoustic tomography, ultrasound computed tomography, electrical impedance tomography, the forced oscillation technique, and the impulse oscillometry system powered by robust artificial intelligence and machine learning analysis capability are emerging as novel solutions for early detection and real time monitoring of COPD progression for timely intervention. We discuss here the scope, risks, and limitations of current SOC and emerging COPD diagnostics, with perspective on novel diagnostics providing real time regional lung function monitoring, and predicting exacerbation and/or disease onset for prognosis-based timely intervention(s) to limit COPD–emphysema progression. 
    more » « less
  2. null (Ed.)
    Probabilistic topic models, have been widely deployed for various applications such as learning disease or tissue subtypes. Yet, learning the parameters of such models is usually an ill-posed problem and may result in losing valuable information about disease severity. A common approach is to add a discriminative loss term to the generative model’s loss in order to learn a representation that is also predictive of disease severity. However, finding a balance between these two losses is not straightforward. We propose an alternative way in this paper. We develop a framework which allows for incorporating external covariates into the generative model’s approximate posterior. These covariates can have more discriminative power for disease severity compared to the representation that we extract from the posterior distribution. For instance, they can be features extracted from a neural network which predicts disease severity from CT images. Effectively, we enforce the generative model’s approximate posterior to reside in the subspace of these discriminative covariates. We illustrate our method’s application on a large-scale lung CT study of Chronic Obstructive Pulmonary Disease (COPD), a highly heterogeneous disease. We aim at identifying tissue subtypes by using a variant of topic model as a generative model. We quantitatively evaluate the predictive performance of the inferred subtypes and demonstrate that our method outperforms or performs on par with some reasonable baselines. We also show that some of the discovered subtypes are correlated with genetic measurements, suggesting that the identified subtypes may characterize the disease’s underlying etiology. 
    more » « less
  3. null (Ed.)
    Summary In this article, we develop a graphical modeling framework for the inference of networks across multiple sample groups and data types. In medical studies, this setting arises whenever a set of subjects, which may be heterogeneous due to differing disease stage or subtype, is profiled across multiple platforms, such as metabolomics, proteomics, or transcriptomics data. Our proposed Bayesian hierarchical model first links the network structures within each platform using a Markov random field prior to relate edge selection across sample groups, and then links the network similarity parameters across platforms. This enables joint estimation in a flexible manner, as we make no assumptions on the directionality of influence across the data types or the extent of network similarity across the sample groups and platforms. In addition, our model formulation allows the number of variables and number of subjects to differ across the data types, and only requires that we have data for the same set of groups. We illustrate the proposed approach through both simulation studies and an application to gene expression levels and metabolite abundances on subjects with varying severity levels of chronic obstructive pulmonary disease. Bayesian inference; Chronic obstructive pulmonary disease (COPD); Data integration; Gaussian graphical model; Markov random field prior; Spike and slab prior. 
    more » « less
  4. Matrix metalloproteinase-12 ( Mmp12 ) is upregulated by cigarette smoke (CS) and plays a critical role in extracellular matrix remodeling, a key mechanism involved in physiological repair processes, and in the pathogenesis of emphysema, asthma, and lung cancer. While cigarette smoking is associated with the development of chronic obstructive pulmonary diseases (COPD) and lung cancer, in utero exposures to CS and second-hand smoke (SHS) are associated with asthma development in the offspring. SHS is an indoor air pollutant that causes known adverse health effects; however, the mechanisms by which in utero SHS exposures predispose to adult lung diseases, including COPD, asthma, and lung cancer, are poorly understood. In this study, we tested the hypothesis that in utero SHS exposure aggravates adult-induced emphysema, asthma, and lung cancer. Methods: Pregnant BALB/c mice were exposed from gestational days 6–19 to either 3 or 10mg/m 3 of SHS or filtered air. At 10, 11, 16, or 17weeks of age, female offspring were treated with either saline for controls, elastase to induce emphysema, house-dust mite (HDM) to initiate asthma, or urethane to promote lung cancer. At sacrifice, specific disease-related lung responses including lung function, inflammation, gene, and protein expression were assessed. Results: In the elastase-induced emphysema model, in utero SHS-exposed mice had significantly enlarged airspaces and up-regulated expression of Mmp12 (10.3-fold compared to air-elastase controls). In the HDM-induced asthma model, in utero exposures to SHS produced eosinophilic lung inflammation and potentiated Mmp12 gene expression (5.7-fold compared to air-HDM controls). In the lung cancer model, in utero exposures to SHS significantly increased the number of intrapulmonary metastases at 58weeks of age and up-regulated Mmp12 (9.3-fold compared to air-urethane controls). In all lung disease models, Mmp12 upregulation was supported at the protein level. Conclusion: Our findings revealed that in utero SHS exposures exacerbate lung responses to adult-induced emphysema, asthma, and lung cancer. Our data show that MMP12 is up-regulated at the gene and protein levels in three distinct adult lung disease models following in utero SHS exposures, suggesting that MMP12 is central to in utero SHS-aggravated lung responses. 
    more » « less
  5. Abstract Chronic obstructive pulmonary disease (COPD) is one of the leading causes of death worldwide. Current COPD diagnosis (i.e., spirometry) could be unreliable because the test depends on an adequate effort from the tester and testee. Moreover, the early diagnosis of COPD is challenging. The authors address COPD detection by constructing two novel physiological signals datasets (4432 records from 54 patients in the WestRo COPD dataset and 13824 medical records from 534 patients in the WestRo Porti COPD dataset). The authors demonstrate their complex coupled fractal dynamical characteristics and perform a fractional‐order dynamics deep learning analysis to diagnose COPD. The authors found that the fractional‐order dynamical modeling can extract distinguishing signatures from the physiological signals across patients with all COPD stages—from stage 0 (healthy) to stage 4 (very severe). They use the fractional signatures to develop and train a deep neural network that predicts COPD stages based on the input features (such as thorax breathing effort, respiratory rate, or oxygen saturation). The authors show that the fractional dynamic deep learning model (FDDLM) achieves a COPD prediction accuracy of 98.66% and can serve as a robust alternative to spirometry. The FDDLM also has high accuracy when validated on a dataset with different physiological signals. 
    more » « less