skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


This content will become publicly available on November 12, 2025

Title: Abstract 4141280: Machine Learning Predicts 24-Hour Change in Decongestion Biomarkers in Hospitalized Heart Failure Patients
Background:Despite recent advances, patients with heart failure (HF) often experience repeat hospitalizations and worsening clinical trajectories from inadequate decongestion. Evidence-based approaches for optimizing interventions in the acute hospital setting for patients with decompensated HF are needed. We evaluated whether machine learning (ML) models can accurately predict next-day levels for decongestion surrogates in hospitalized HF patients. Hypothesis:ML can accurately predict body weight, hematocrit, creatinine, and potassium values in the next 24 hours in hospitalized HF patients. Methods:We utilized national Veterans Affairs (VA) databases to study all patients admitted with HF from January 2014 to July 2022. Records including at least one value for at least one biomarker of interest (body weight, hematocrit, creatinine, and potassium) were included. Patients were randomly split into training (80%), validation (10%), and test (10%) datasets. We trained a recurrent neural network to predict each biomarker’s value on admission day n+1 using data until day n, simulating a scenario where a clinician monitors response to treatment (e.g., diuresis) over a 24-hour cycle. The model that performed best on the validation set was evaluated on the test set. The R2, mean absolute error (MAE), and feature importance were determined. Results:We identified 589,114 admissions involving 124,163 unique patients. The mean (SD) age on admission was 72 (10) years; 98% were male, 69% were white, and 25% were Black. The performance (R2, MAE) for each biomarker model was as follows: body weight (0.94, 6.15 lb.), creatinine (0.92, 0.21 mg/dL), hematocrit (0.86, 1.7%), and potassium (0.53, 0.27 mmol/L). The top predictive features across all models were intravenous or oral diuretic use, patient age, and diastolic blood pressure. The predicted 24-hour change in each biomarker based on total daily diuretic dose for five representative patients is demonstrated in the Figure. Conclusions:ML can accurately predict the 24-hour body weight, hematocrit, creatinine, and potassium values in hospitalized HF patients, suggesting the potential for AI to guide acute in-hospital management.  more » « less
Award ID(s):
2304358
PAR ID:
10575750
Author(s) / Creator(s):
; ; ; ; ;
Publisher / Repository:
Lippincott Williams & Wilkins
Date Published:
Journal Name:
Circulation
Volume:
150
Issue:
Suppl_1
ISSN:
0009-7322
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. null (Ed.)
    Background The novel coronavirus SARS-CoV-2 and its associated disease, COVID-19, have caused worldwide disruption, leading countries to take drastic measures to address the progression of the disease. As SARS-CoV-2 continues to spread, hospitals are struggling to allocate resources to patients who are most at risk. In this context, it has become important to develop models that can accurately predict the severity of infection of hospitalized patients to help guide triage, planning, and resource allocation. Objective The aim of this study was to develop accurate models to predict the mortality of hospitalized patients with COVID-19 using basic demographics and easily obtainable laboratory data. Methods We performed a retrospective study of 375 hospitalized patients with COVID-19 in Wuhan, China. The patients were randomly split into derivation and validation cohorts. Regularized logistic regression and support vector machine classifiers were trained on the derivation cohort, and accuracy metrics (F1 scores) were computed on the validation cohort. Two types of models were developed: the first type used laboratory findings from the entire length of the patient’s hospital stay, and the second type used laboratory findings that were obtained no later than 12 hours after admission. The models were further validated on a multicenter external cohort of 542 patients. Results Of the 375 patients with COVID-19, 174 (46.4%) died of the infection. The study cohort was composed of 224/375 men (59.7%) and 151/375 women (40.3%), with a mean age of 58.83 years (SD 16.46). The models developed using data from throughout the patients’ length of stay demonstrated accuracies as high as 97%, whereas the models with admission laboratory variables possessed accuracies of up to 93%. The latter models predicted patient outcomes an average of 11.5 days in advance. Key variables such as lactate dehydrogenase, high-sensitivity C-reactive protein, and percentage of lymphocytes in the blood were indicated by the models. In line with previous studies, age was also found to be an important variable in predicting mortality. In particular, the mean age of patients who survived COVID-19 infection (50.23 years, SD 15.02) was significantly lower than the mean age of patients who died of the infection (68.75 years, SD 11.83; P<.001). Conclusions Machine learning models can be successfully employed to accurately predict outcomes of patients with COVID-19. Our models achieved high accuracies and could predict outcomes more than one week in advance; this promising result suggests that these models can be highly useful for resource allocation in hospitals. 
    more » « less
  2. Keim-Malpass, Jessica (Ed.)
    During the early stages of hospital admission, clinicians use limited information to make decisions as patient acuity evolves. We hypothesized that clustering analysis of vital signs measured within six hours of hospital admission would reveal distinct patient phenotypes with unique pathophysiological signatures and clinical outcomes. We created a longitudinal electronic health record dataset for 75,762 adult patient admissions to a tertiary care center in 2014–2016 lasting six hours or longer. Physiotypes were derived via unsupervised machine learning in a training cohort of 41,502 patients applying consensus k -means clustering to six vital signs measured within six hours of admission. Reproducibility and correlation with clinical biomarkers and outcomes were assessed in validation cohort of 17,415 patients and testing cohort of 16,845 patients. Training, validation, and testing cohorts had similar age (54–55 years) and sex (55% female), distributions. There were four distinct clusters. Physiotype A had physiologic signals consistent with early vasoplegia, hypothermia, and low-grade inflammation and favorable short-and long-term clinical outcomes despite early, severe illness. Physiotype B exhibited early tachycardia, tachypnea, and hypoxemia followed by the highest incidence of prolonged respiratory insufficiency, sepsis, acute kidney injury, and short- and long-term mortality. Physiotype C had minimal early physiological derangement and favorable clinical outcomes. Physiotype D had the greatest prevalence of chronic cardiovascular and kidney disease, presented with severely elevated blood pressure, and had good short-term outcomes but suffered increased 3-year mortality. Comparing sequential organ failure assessment (SOFA) scores across physiotypes demonstrated that clustering did not simply recapitulate previously established acuity assessments. In a heterogeneous cohort of hospitalized patients, unsupervised machine learning techniques applied to routine, early vital sign data identified physiotypes with unique disease categories and distinct clinical outcomes. This approach has the potential to augment understanding of pathophysiology by distilling thousands of disease states into a few physiological signatures. 
    more » « less
  3. Abstract ObjectiveTo develop predictive models of coronavirus disease 2019 (COVID-19) outcomes, elucidate the influence of socioeconomic factors, and assess algorithmic racial fairness using a racially diverse patient population with high social needs. Materials and MethodsData included 7,102 patients with positive (RT-PCR) severe acute respiratory syndrome coronavirus 2 test at a safety-net system in Massachusetts. Linear and nonlinear classification methods were applied. A score based on a recurrent neural network and a transformer architecture was developed to capture the dynamic evolution of vital signs. Combined with patient characteristics, clinical variables, and hospital occupancy measures, this dynamic vital score was used to train predictive models. ResultsHospitalizations can be predicted with an area under the receiver-operating characteristic curve (AUC) of 92% using symptoms, hospital occupancy, and patient characteristics, including social determinants of health. Parsimonious models to predict intensive care, mechanical ventilation, and mortality that used the most recent labs and vitals exhibited AUCs of 92.7%, 91.2%, and 94%, respectively. Early predictive models, using labs and vital signs closer to admission had AUCs of 81.1%, 84.9%, and 92%, respectively. DiscussionThe most accurate models exhibit racial bias, being more likely to falsely predict that Black patients will be hospitalized. Models that are only based on the dynamic vital score exhibited accuracies close to the best parsimonious models, although the latter also used laboratories. ConclusionsThis large study demonstrates that COVID-19 severity may accurately be predicted using a score that accounts for the dynamic evolution of vital signs. Further, race, social determinants of health, and hospital occupancy play an important role. 
    more » « less
  4. Background Early diagnosis is essential for effective stroke therapy. Strokes in hospitalized patients are associated with worse outcomes compared with strokes in the community. We derived and validated an algorithm to identify strokes by monitoring upper limb movements in hospitalized patients. Methods and Results A prospective case–control study in hospitalized patients evaluated bilateral arm accelerometry from patients with acute stroke with lateralized weakness and controls without stroke. We derived a stroke classifier algorithm from 123 controls and 77 acute stroke cases and then validated the performance in a separate cohort of 167 controls and 33 acute strokes, measuring false alarm rates in nonstroke controls and time to detection in stroke cases. Faster detection time was associated with more false alarms. With a median false alarm rate among nonstroke controls of 3.6 (interquartile range [IQR], 2.1–5.0) alarms per patient per day, the median time to detection was 15.0 (IQR, 8.0–73.5) minutes. A median false alarm rate of 1.1 (IQR. 0–2.2) per patient per day was associated with a median time to stroke detection of 29.0 (IQR, 11.0–58.0) minutes. There were no differences in algorithm performance for subgroups dichotomized by age, sex, race, handedness, nondominant hemisphere involvement, intensive care unit versus ward, or daytime versus nighttime. Conclusions Arm movement data can be used to detect asymmetry indicative of stroke in hospitalized patients with a low false alarm rate. Additional studies are needed to demonstrate clinical usefulness. 
    more » « less
  5. Prescribed fires often have ecological benefits, but their environmental health risks have been infrequently studied. We investigated associations between residing near a prescribed fire, wildfire smoke exposure, and heart failure (HF) patients’ hospital utilization. Methods: We used electronic health records from January 2014 to December 2016 in a North Carolina hospital-based cohort to determine HF diagnoses, primary residence, and hospital utilization. Using a cross-sectional study design, we associated the prescribed fire occurrences within 1, 2, and 5 km of the patients’ primary residence with the number of hospital visits and 7- and 30-day readmissions. To compare prescribed fire associations with those observed for wildfire smoke, we also associated zip code-level smoke density data designed to capture wildfire smoke emissions with hospital utilization amongst HF patients. Quasi-Poisson regression models were used for the number of hospital visits, while zero-inflated Poisson regression models were used for readmissions. All models were adjusted for age, sex, race, and neighborhood socioeconomic status and included an offset for follow-up time. The results are the percent change and the 95% confidence interval (CI). Results: Associations between prescribed fire occurrences and hospital visits were generally null, with the few associations observed being with prescribed fires within 5 and 2 km of the primary residence in the negative direction but not the more restrictive 1 km radius. However, exposure to medium or heavy smoke (primarily from wildfires) at the zip code level was associated with both 7-day (8.5% increase; 95% CI = 1.5%, 16.0%) and 30-day readmissions (5.4%; 95% CI = 2.3%, 8.5%), and to a lesser degree, hospital visits (1.5%; 95% CI: 0.0%, 3.0%) matching previous studies. Conclusions: Area-level smoke exposure driven by wildfires is positively associated with hospital utilization but not proximity to prescribed fires. 
    more » « less