skip to main content


Title: Integrating landmark modeling framework and machine learning algorithms for dynamic prediction of tuberculosis treatment outcomes
Abstract Objective

This study aims to establish an informative dynamic prediction model of treatment outcomes using follow-up records of tuberculosis (TB) patients, which can timely detect cases when the current treatment plan may not be effective.

Materials and Methods

We used 122 267 follow-up records from 17 958 new cases of pulmonary TB in the Republic of Moldova. A dynamic prediction framework integrating landmark modeling and machine learning algorithms was designed to predict patient outcomes during the course of treatment. Sensitivity and positive predictive value (PPV) were calculated to evaluate performance of the model at critical time points. New measures were defined to determine when follow-up laboratory tests should be conducted to obtain most informative results.

Results

The random-forest algorithm performed better than support vector machine and penalized multinomial logistic regression models for predicting TB treatment outcomes. For all 3 outcome classes (ie, cured, not cured, and died after 24 months following treatment initiation), sensitivity and PPV of prediction models improved as more follow-up information was collected. Specifically, sensitivity and PPV increased from 0.55 to 0.84 and from 0.32 to 0.88, respectively, for the not cured class.

Conclusion

The dynamic prediction framework utilizes longitudinal laboratory test results to predict patient outcomes at various landmarks. Sputum culture and smear results are among the important variables for prediction; however, the most recent sputum result is not always the most informative one. This framework can potentially facilitate a more effective treatment monitoring program and provide insights for policymakers toward improved guidelines on follow-up tests.

 
more » « less
Award ID(s):
1920920
NSF-PAR ID:
10474139
Author(s) / Creator(s):
 ;  ;  ;  
Publisher / Repository:
Oxford University Press
Date Published:
Journal Name:
Journal of the American Medical Informatics Association
Volume:
29
Issue:
5
ISSN:
1527-974X
Format(s):
Medium: X Size: p. 900-908
Size(s):
["p. 900-908"]
Sponsoring Org:
National Science Foundation
More Like this
  1. Treating disease according to precision health requires the individualization of therapeutic solutions as a cardinal step that is part of a process that typically depends on multiple factors. The starting point is the collection and assembly of data over time to assess the patient’s health status and monitor response to therapy. Radiomics is a very important component of this process. Its main goal is implementing a protocol to quantify the image informative contents by first mining and then extracting the most representative features. Further analysis aims to detect potential disease phenotypes through signs and marks of heterogeneity. As multimodal images hinge on various data sources, and these can be integrated with treatment plans and follow-up information, radiomics is naturally centered on dynamically monitoring disease progression and/or the health trajectory of patients. However, radiomics creates critical needs too. A concise list includes: (a) successful harmonization of intra/inter-modality radiomic measurements to facilitate the association with other data domains (genetic, clinical, lifestyle aspects, etc.); (b) ability of data science to revise model strategies and analytics tools to tackle multiple data types and structures (electronic medical records, personal histories, hospitalization data, genomic from various specimens, imaging, etc.) and to offer data-agnostic solutions for patient outcomes prediction; (c) and model validation with independent datasets to ensure generalization of results, clinical value of new risk stratifications, and support to clinical decisions for highly individualized patient management. 
    more » « less
  2. Key points

    Right heart catheterization data from clinical records of heart transplant patients are used to identify patient‐specific models of the cardiovascular system.

    These patient‐specific cardiovascular models represent a snapshot of cardiovascular function at a given post‐transplant recovery time point.

    This approach is used to describe cardiac function in 10 heart transplant patients, five of which had multiple right heart catheterizations allowing an assessment of cardiac function over time.

    These patient‐specific models are used to predict cardiovascular function in the form of right and left ventricular pressure‐volume loops and ventricular power, an important metric in the clinical assessment of cardiac function.

    Outcomes for the longitudinally tracked patients show that our approach was able to identify the one patient from the group of five that exhibited post‐transplant cardiovascular complications.

    Abstract

    Heart transplant patients are followed with periodic right heart catheterizations (RHCs) to identify post‐transplant complications and guide treatment. Post‐transplant positive outcomes are associated with a steady reduction of right ventricular and pulmonary arterial pressures, toward normal levels of right‐side pressure (about 20 mmHg) measured by RHC. This study shows that more information about patient progression is obtained by combining standard RHC measures with mechanistic computational cardiovascular system models. The purpose of this study is twofold: to understand how cardiovascular system models can be used to represent a patient's cardiovascular state, and to use these models to track post‐transplant recovery and outcome. To obtain reliable parameter estimates comparable within and across datasets, we use sensitivity analysis, parameter subset selection, and optimization to determine patient‐specific mechanistic parameters that can be reliably extracted from the RHC data. Patient‐specific models are identified for 10 patients from their first post‐transplant RHC, and longitudinal analysis is carried out for five patients. Results of the sensitivity analysis and subset selection show that we can reliably estimate seven non‐measurable quantities; namely, ventricular diastolic relaxation, systemic resistance, pulmonary venous elastance, pulmonary resistance, pulmonary arterial elastance, pulmonary valve resistance and systemic arterial elastance. Changes in parameters and predicted cardiovascular function post‐transplant are used to evaluate the cardiovascular state during recovery of five patients. Of these five patients, only one showed inconsistent trends during recovery in ventricular pressure–volume relationships and power output. At the four‐year post‐transplant time point this patient exhibited biventricular failure along with graft dysfunction while the remaining four exhibited no cardiovascular complications.

     
    more » « less
  3. Summary

    Predicting patient life expectancy is of great importance for clinicians in making treatment decisions. This prediction needs to be conducted in a dynamic manner, based on longitudinal biomarkers repeatedly measured during the patient's post-treatment follow-up period. The prediction is updated any time a new biomarker measurement is obtained. The heterogeneity across patients of biomarker trajectories over time requires flexible and powerful approaches to model noisy and irregularly measured longitudinal data. In this article, we use functional principal component analysis (FPCA) to extract the dominant features of the biomarker trajectory of each individual, and use these features as time-dependent predictors (covariates) in a transformed mean residual life (MRL) regression model to conduct dynamic prediction. Simulation studies demonstrate the improved performance of the transformed MRL model that includes longitudinal biomarker information in the prediction. We apply the proposed method to predict the remaining time expectancy until disease progression for patients with chronic myeloid leukemia, using the transcript levels of an oncogene, BCR-ABL.

     
    more » « less
  4. Abstract

    Developing prediction models for emerging infectious diseases from relatively small numbers of cases is a critical need for improving pandemic preparedness. Using COVID-19 as an exemplar, we propose a transfer learning methodology for developing predictive models from multi-modal electronic healthcare records by leveraging information from more prevalent diseases with shared clinical characteristics. Our novel hierarchical, multi-modal model ($${\textsc {TransMED}}$$TRANSMED) integrates baseline risk factors from the natural language processing of clinical notes at admission, time-series measurements of biomarkers obtained from laboratory tests, and discrete diagnostic, procedure and drug codes. We demonstrate the alignment of$${\textsc {TransMED}}$$TRANSMED’s predictions with well-established clinical knowledge about COVID-19 through univariate and multivariate risk factor driven sub-cohort analysis.$${\textsc {TransMED}}$$TRANSMED’s superior performance over state-of-the-art methods shows that leveraging patient data across modalities and transferring prior knowledge from similar disorders is critical for accurate prediction of patient outcomes, and this approach may serve as an important tool in the early response to future pandemics.

     
    more » « less
  5. null (Ed.)
    Abstract Inadequate at-home management and self-awareness of heart failure (HF) exacerbations are known to be leading causes of the greater than 1 million estimated HF-related hospitalizations in the USA alone. Most current at-home HF management protocols include paper guidelines or exploratory health applications that lack rigor and validation at the level of the individual patient. We report on a novel triage methodology that uses machine learning predictions for real-time detection and assessment of exacerbations. Medical specialist opinions on statistically and clinically comprehensive, simulated patient cases were used to train and validate prediction algorithms. Model performance was assessed by comparison to physician panel consensus in a representative, out-of-sample validation set of 100 vignettes. Algorithm prediction accuracy and safety indicators surpassed all individual specialists in identifying consensus opinion on existence/severity of exacerbations and appropriate treatment response. The algorithms also scored the highest sensitivity, specificity, and PPV when assessing the need for emergency care. Lay summary Here we develop a machine-learning approach for providing real-time decision support to adults diagnosed with congestive heart failure. The algorithm achieves higher exacerbation and triage classification performance than any individual physician when compared to physician consensus opinion. Graphical abstract 
    more » « less