Machine learning identifies abnormal Ca2+ transients in human induced pluripotent stem cell-derived cardiomyocytes

Hwang, Hyun; Liu, Rui; Maxwell, Joshua T.; Yang, Jingjing; Xu, Chunhui

doi:10.1038/s41598-020-73801-x

Abstract Human-induced pluripotent stem cell-derived cardiomyocytes (hiPSC-CMs) provide an excellent platform for potential clinical and research applications. Identifying abnormal Ca²⁺transients is crucial for evaluating cardiomyocyte function that requires labor-intensive manual effort. Therefore, we develop an analytical pipeline for automatic assessment of Ca²⁺transient abnormality, by employing advanced machine learning methods together with an Analytical Algorithm. First, we adapt an existing Analytical Algorithm to identify Ca²⁺transient peaks and determine peak abnormality based on quantified peak characteristics. Second, we train a peak-level Support Vector Machine (SVM) classifier by using human-expert assessment of peak abnormality as outcome and profiled peak variables as predictive features. Third, we train another cell-level SVM classifier by using human-expert assessment of cell abnormality as outcome and quantified cell-level variables as predictive features. This cell-level SVM classifier can be used to assess additional Ca²⁺transient signals. By applying this pipeline to our Ca²⁺transient data, we trained a cell-level SVM classifier using 200 cells as training data, then tested its accuracy in an independent dataset of 54 cells. As a result, we obtained 88% training accuracy and 87% test accuracy. Further, we provide a free R package to implement our pipeline for high-throughput CM Ca²⁺analysis.

More Like this