Quantifying input data drift in medical machine learning models by detecting change-points in time-series data

Prathapan, Smriti; Samala, Ravi K; Hadjiyski, Nathan; D’Haese, Pierre-François; Maldonado, Fabien; Nguyen, Phuong; Yesha, Yelena; Sahiner, Berkman

doi:10.1117/12.3008771

Citation Details

Quantifying input data drift in medical machine learning models by detecting change-points in time-series data

Devices enabled by artificial intelligence (AI) and machine learning (ML) are being introduced for clinical use at an accelerating pace. In a dynamic clinical environment, these devices may encounter conditions different from those they were developed for. The statistical data mismatch between training/initial testing and production is often referred to as data drift. Detecting and quantifying data drift is significant for ensuring that AI model performs as expected in clinical environments. A drift detector signals when a corrective action is needed if the performance changes. In this study, we investigate how a change in the performance of an AI model due to data drift can be detected and quantified using a cumulative sum (CUSUM) control chart. To study the properties of CUSUM, we first simulate different scenarios that change the performance of an AI model. We simulate a sudden change in the mean of the performance metric at a change-point (change day) in time. The task is to quickly detect the change while providing few false-alarms before the change-point, which may be caused by the statistical variation of the performance metric over time. Subsequently, we simulate data drift by denoising the Emory Breast Imaging Dataset (EMBED) after a pre-defined change-point. We detect the change-point by studying the pre- and post-change specificity of a mammographic CAD algorithm. Our results indicate that with the appropriate choice of parameters, CUSUM is able to quickly detect relatively small drifts with a small number of false-positive alarms. more »

Award ID(s):: 2326034

PAR ID:: 10539737

Author(s) / Creator(s):: Prathapan, Smriti; Samala, Ravi K; Hadjiyski, Nathan; D’Haese, Pierre-François; Maldonado, Fabien; Nguyen, Phuong; Yesha, Yelena; Sahiner, Berkman

Editor(s):: Astley, Susan M; Chen, Weijie

Publisher / Repository:: SPIE

Date Published:: 2024-04-03

ISBN:: 9781510671584

Page Range / eLocation ID:: 11

Format(s):: Medium: X

Location:: San Diego, United States

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1117/12.3008771

More Like this