skip to main content


Title: Early prediction of patient discharge disposition in acute neurological care using machine learning
Abstract Background

Acute neurological complications are some of the leading causes of death and disability in the U.S. The medical professionals that treat patients in this setting are tasked with deciding where (e.g., home or facility), how, and when to discharge these patients. It is important to be able to predict potential patient discharge outcomes as early as possible during the patient’s hospital stay and to know what factors influence the development of discharge planning. This study carried out two parallel experiments: A multi-class outcome (patient discharge targets of ‘home’, ‘nursing facility’, ‘rehab’, ‘death’) and binary class outcome (‘home’ vs. ‘non-home’). The goal of this study is to develop early predictive models for each experiment exploring which patient characteristics and clinical variables significantly influence discharge planning of patients based on the data that are available only within 24 h of their hospital admission. 

Method

Our methodology centers around building and training five different machine learning models followed by testing and tuning those models to find the best-suited predictor for each experiment with a dataset of 5,245 adult patients with neurological conditions taken from the eICU-CRD database.

Results

The results of this study show XGBoost to be the most effective model for predicting between four common discharge outcomes of ‘home’, ‘nursing facility’, ‘rehab’, and ‘death’, with 71% average c-statistic. The XGBoost model was also the best-performer in the binary outcome experiment with a c-statistic of 76%. This article also explores the accuracy, reliability, and interpretability of the best performing models in each experiment by identifying and analyzing the features that are most impactful to the predictions.

Conclusions

The acceptable accuracy and interpretability of the predictive models based on early admission data suggests that the models can be used in a suggestive context to help guide healthcare providers in efforts of planning effective and equitable discharge recommendations.

 
more » « less
NSF-PAR ID:
10376880
Author(s) / Creator(s):
;
Publisher / Repository:
Springer Science + Business Media
Date Published:
Journal Name:
BMC Health Services Research
Volume:
22
Issue:
1
ISSN:
1472-6963
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Objective

    To develop predictive models of coronavirus disease 2019 (COVID-19) outcomes, elucidate the influence of socioeconomic factors, and assess algorithmic racial fairness using a racially diverse patient population with high social needs.

    Materials and Methods

    Data included 7,102 patients with positive (RT-PCR) severe acute respiratory syndrome coronavirus 2 test at a safety-net system in Massachusetts. Linear and nonlinear classification methods were applied. A score based on a recurrent neural network and a transformer architecture was developed to capture the dynamic evolution of vital signs. Combined with patient characteristics, clinical variables, and hospital occupancy measures, this dynamic vital score was used to train predictive models.

    Results

    Hospitalizations can be predicted with an area under the receiver-operating characteristic curve (AUC) of 92% using symptoms, hospital occupancy, and patient characteristics, including social determinants of health. Parsimonious models to predict intensive care, mechanical ventilation, and mortality that used the most recent labs and vitals exhibited AUCs of 92.7%, 91.2%, and 94%, respectively. Early predictive models, using labs and vital signs closer to admission had AUCs of 81.1%, 84.9%, and 92%, respectively.

    Discussion

    The most accurate models exhibit racial bias, being more likely to falsely predict that Black patients will be hospitalized. Models that are only based on the dynamic vital score exhibited accuracies close to the best parsimonious models, although the latter also used laboratories.

    Conclusions

    This large study demonstrates that COVID-19 severity may accurately be predicted using a score that accounts for the dynamic evolution of vital signs. Further, race, social determinants of health, and hospital occupancy play an important role.

     
    more » « less
  2. Abstract Background

    Hospitals are incentivized to reduce rehospitalization rates, creating an emphasis on skilled nursing facilities (SNFs) for post-hospital discharge. How rehospitalization rates vary depending on patient and SNF characteristics is not well understood, in part because these characteristics are high-dimensional. We sought to estimate rehospitalization and mortality risks by patient and skilled nursing facility (SNF) leveraging high-dimensional characteristics.

    Methods

    Using 1,060,337 discharges from 13,708 SNFs of Medicare patients residing or visiting a provider in Wisconsin, Iowa, and Illinois, factor analysis was performed to reduce the number of patient and SNF characteristics. K-means clustering was applied to SNF factors to categorize SNFs into groups. Rehospitalization and mortality risks within 60 days of discharge was estimated by SNF group for various values of patient factors.

    Results

    Patient and SNF characteristics (616 in total) were reduced to 12 patient factors and 4 SNF groups. Patient factors reflected broad conditions. SNF groups differed in beds and staff capacity, off-site services, and physical and occupational therapy capacity; and in mortality and rehospitalization rates for some patients. Patients with cardiac, orthopedic, and neuropsychiatric conditions are associated with better outcomes when assigned to SNFs with greater on-site capacity (i.e. beds, staff, physical and occupational therapy), whereas patients with conditions related to cancer or chronic renal failure are associated with better outcomes when assigned to SNFs with less on-site capacity.

    Conclusions

    Risks of rehospitalization and mortality appear to vary significantly by patient and SNF, with certain SNFs being better suited for some patient conditions over others.

     
    more » « less
  3. null (Ed.)
    Objective: The objective of the study is to build models for early prediction of risk for developing multiple organ dysfunction (MOD) in pediatric intensive care unit (PICU) patients. Design: The design of the study is a retrospective observational cohort study. Setting: The setting of the study is at a single academic PICU at the Johns Hopkins Hospital, Baltimore, MD. Patients: The patients included in the study were <18 years of age admitted to the PICU between July 2014 and October 2015. Measurements and main results: Organ dysfunction labels were generated every minute from preceding 24-h time windows using the International Pediatric Sepsis Consensus Conference (IPSCC) and Proulx et al. MOD criteria. Early MOD prediction models were built using four machine learning methods: random forest, XGBoost, GLMBoost, and Lasso-GLM. An optimal threshold learned from training data was used to detect high-risk alert events (HRAs). The early prediction models from all methods achieved an area under the receiver operating characteristics curve ≥0.91 for both IPSCC and Proulx criteria. The best performance in terms of maximum F1-score was achieved with random forest (sensitivity: 0.72, positive predictive value: 0.70, F1-score: 0.71) and XGBoost (sensitivity: 0.8, positive predictive value: 0.81, F1-score: 0.81) for IPSCC and Proulx criteria, respectively. The median early warning time was 22.7 h for random forest and 37 h for XGBoost models for IPSCC and Proulx criteria, respectively. Applying spectral clustering on risk-score trajectories over 24 h following early warning provided a high-risk group with ≥0.93 positive predictive value. Conclusions: Early predictions from risk-based patient monitoring could provide more than 22 h of lead time for MOD onset, with ≥0.93 positive predictive value for a high-risk group identified pre-MOD. 
    more » « less
  4. The COVID-19 pandemic has changed the lives of many people around the world. Based on the available data and published reports, most people diagnosed with COVID-19 exhibit no or mild symptoms and could be discharged home for self-isolation. Considering that a substantial portion of them will progress to a severe disease requiring hospitalization and medical management, including respiratory and circulatory support in the form of supplemental oxygen therapy, mechanical ventilation, vasopressors, etc. The continuous monitoring of patient conditions at home for patients with COVID-19 will allow early determination of disease severity and medical intervention to reduce morbidity and mortality. In addition, this will allow early and safe hospital discharge and free hospital beds for patients who are in need of admission. In this review, we focus on the recent developments in next-generation wearable sensors capable of continuous monitoring of disease symptoms, particularly those associated with COVID-19. These include wearable non/minimally invasive biophysical (temperature, respiratory rate, oxygen saturation, heart rate, and heart rate variability) and biochemical (cytokines, cortisol, and electrolytes) sensors, sensor data analytics, and machine learning-enabled early detection and medical intervention techniques. Together, we aim to inspire the future development of wearable sensors integrated with data analytics, which serve as a foundation for disease diagnostics, health monitoring and predictions, and medical interventions. 
    more » « less
  5. Abstract

    Unrecognized deterioration of COVID-19 patients can lead to high morbidity and mortality. Most existing deterioration prediction models require a large number of clinical information, typically collected in hospital settings, such as medical images or comprehensive laboratory tests. This is infeasible for telehealth solutions and highlights a gap in deterioration prediction models based on minimal data, which can be recorded at a large scale in any clinic, nursing home, or even at the patient’s home. In this study, we develop and compare two prognostic models that predict if a patient will experience deterioration in the forthcoming 3 to 24 h. The models sequentially process routine triadic vital signs: (a) oxygen saturation, (b) heart rate, and (c) temperature. These models are also provided with basic patient information, including sex, age, vaccination status, vaccination date, and status of obesity, hypertension, or diabetes. The difference between the two models is the way that the temporal dynamics of the vital signs are processed. Model #1 utilizes a temporally-dilated version of the Long-Short Term Memory model (LSTM) for temporal processes, and Model #2 utilizes a residual temporal convolutional network (TCN) for this purpose. We train and evaluate the models using data collected from 37,006 COVID-19 patients at NYU Langone Health in New York, USA. The convolution-based model outperforms the LSTM based model, achieving a high AUROC of 0.8844–0.9336 for 3 to 24 h deterioration prediction on a held-out test set. We also conduct occlusion experiments to evaluate the importance of each input feature, which reveals the significance of continuously monitoring the variation of the vital signs. Our results show the prospect for accurate deterioration forecast using a minimum feature set that can be relatively easily obtained using wearable devices and self-reported patient information.

     
    more » « less