skip to main content

Title: ECG-based Human Authentication using High-level Spectro-temporal Signal Features
Electrocardiography (ECG) is the process of recording the electrical activity of the human heart over time using electrodes that are placed over the skin. While the primary usage of electrocardiograms, the recorded signals, has been focused on the check of signs of heart-related diseases, recent studies have moved also toward their usage for human authentication. Thus, an ECG signal can be unique enough to be used independently as a biometric modality. In addition to its inherent liveness detection, it is easy to collect and can be easily captured either via sensors attached to the human body (fingertips, chest, wrist) or even passively using wireless sensors. In this paper, we propose a novel approach that exploits the spectro-temporal dynamic characteristics of the ECG signal to establish personal recognition system using both short-time Fourier transform (STFT) and generalized Morse wavelets (CWT). This process results in enriching the information extracted from the original ECG signal that is inserted in a 2D convolutional neural network (CNN) which extracts higher level and subject-specific ECG-based features for each individual. To validate our proposed CNN model, we performed nested cross-validation using eight different ECG databases. These databases are considered challenging since they include both normal and abnormal more » heartbeats as well as a dynamic number of subjects. Our proposed algorithms yield superior performance when compared to other state-ofart approaches discussed in the literature, i.e. the STFT-based one achieves an average identification rate, equal error rate (EER), and area under curve (AUC) of 97.86%, 0.0268, and 0.9933 respectively, whereas the CWT achieves comparable to STFT results in 97.5%, 0.0386, and 0.9882 respectively. « less
Award ID(s):
Publication Date:
Journal Name:
IEEE International Conference on Big Data (Big Data)
Page Range or eLocation-ID:
4984 to 4993
Sponsoring Org:
National Science Foundation
More Like this
  1. Arrhythmia is an abnormal heart rhythm that occurs due to the improper operation of the electrical impulses that coordinate the heartbeats. It is one of the most well-known heart conditions (including coronary artery disease, heart failure etc.) that is experienced by millions of people around the world. While there are several types of arrhythmias, not all of them are dangerous or harmful. However, there are arrhythmias that can often lead to death in minutes (e.g, ventricular fibrillation and ventricular tachycardia) even in young people. Thus, the detection of arrhythmia is critical for stopping and reversing its progression and for increasing longevity and life quality. While a doctor can perform different heart-monitoring tests specific to arrhythmias, the electrocardiogram (ECG) is one of the most common ones used either independently or in combination with other tests (to only detect, e.g. echocardiogram, or trigger arrhythmia and, then, detect, e.g. stress test). We propose a machine learning approach that augments the traditional arrhythmia detection approaches via our automatic arrhythmia classification system. It utilizes the texture of the ECG signal in both the temporal and spectro-temporal domains to detect and classify four types of heartbeats. The original ECG signal is first preprocessed, and then, themore »R-peaks associated with heartbeat estimation are identified. Next, 1D local binary patterns (LBP) in the temporal domain are utilized, while 2D LBPs and texture-based features extracted by a grayscale co-occurrence matrix (GLCM) are utilized in the spectro-temporal domain using the short-time Fourier transform (STFT) and Morse wavelets. Finally, different classifiers, as well as different ECG lead configurations are examined before we determine our proposed time-frequency SVM model, which obtains a maximum accuracy of 99.81%, sensitivity of 98.17%, and specificity of 99.98% when using a 10 cross-validation on the MIT-BIH database.« less
  2. Vital signs (e.g., heart and respiratory rate) are indicative for health status assessment. Efforts have been made to extract vital signs using radio frequency (RF) techniques (e.g., Wi-Fi, FMCW, UWB), which offer a non-touch solution for continuous and ubiquitous monitoring without users’ cooperative efforts. While RF-based vital signs monitoring is user-friendly, its robustness faces two challenges. On the one hand, the RF signal is modulated by the periodic chest wall displacement due to heartbeat and breathing in a nonlinear manner. It is inherently hard to identify the fundamental heart and respiratory rates (HR and RR) in the presence of higher order harmonics of them and intermodulation between HR and RR, especially when they have overlapping frequency bands. On the other hand, the inadvertent body movements may disturb and distort the RF signal, overwhelming the vital signals, thus inhibiting the parameter estimation of the physiological movement (i.e., heartbeat and breathing). In this paper, we propose DeepVS, a deep learning approach that addresses the aforementioned challenges from the non-linearity and inadvertent movements for robust RF-based vital signs sensing in a unified manner. DeepVS combines 1D CNN and attention models to exploit local features and temporal correlations. Moreover, it leverages a two-stream schememore »to integrate features from both time and frequency domains. Additionally, DeepVS unifies the estimation of HR and RR with a multi-head structure, which only adds limited extra overhead (<1%) to the existing model, compared to doubling the overhead using two separate models for HR and RR respectively. Our experiments demonstrate that DeepVS achieves 80-percentile HR/RR errors of 7.4/4.9 beat/breaths per minute (bpm) on a challenging dataset, as compared to 11.8/7.3 bpm of a non-learning solution. Besides, an ablation study has been conducted to quantify the effectiveness of DeepVS.« less
  3. Background: Monitoring glucose excursions is important in diabetes management. This can be achieved using continuous glucose monitors (CGMs). However, CGMs are expensive and invasive. Thus, alternative low-cost noninvasive wearable sensors capable of predicting glycemic excursions could be a game changer to manage diabetes. Methods: In this article, we explore two noninvasive sensor modalities, electrocardiograms (ECGs) and accelerometers, collected on five healthy participants over two weeks, to predict both hypoglycemic and hyperglycemic excursions. We extract 29 features encompassing heart rate variability features from the ECG, and time- and frequency-domain features from the accelerometer. We evaluated two machine-learning approaches to predict glycemic excursions: a classification model and a regression model. Results: The best model for both hypoglycemia and hyperglycemia detection was the regression model based on ECG and accelerometer data, yielding 76% sensitivity and specificity for hypoglycemia and 79% sensitivity and specificity for hyperglycemia. This had an improvement of 5% in sensitivity and specificity for both hypoglycemia and hyperglycemia when compared with using ECG data alone. Conclusions: Electrocardiogram is a promising alternative not only to detect hypoglycemia but also to predict hyperglycemia. Supplementing ECG data with contextual information from accelerometer data can improve glucose prediction.
  4. Obeid, I. ; Selesnick, I. (Ed.)
    The Neural Engineering Data Consortium at Temple University has been providing key data resources to support the development of deep learning technology for electroencephalography (EEG) applications [1-4] since 2012. We currently have over 1,700 subscribers to our resources and have been providing data, software and documentation from our web site [5] since 2012. In this poster, we introduce additions to our resources that have been developed within the past year to facilitate software development and big data machine learning research. Major resources released in 2019 include: ● Data: The most current release of our open source EEG data is v1.2.0 of TUH EEG and includes the addition of 3,874 sessions and 1,960 patients from mid-2015 through 2016. ● Software: We have recently released a package, PyStream, that demonstrates how to correctly read an EDF file and access samples of the signal. This software demonstrates how to properly decode channels based on their labels and how to implement montages. Most existing open source packages to read EDF files do not directly address the problem of channel labels [6]. ● Documentation: We have released two documents that describe our file formats and data representations: (1) electrodes and channels [6]: describes how tomore »map channel labels to physical locations of the electrodes, and includes a description of every channel label appearing in the corpus; (2) annotation standards [7]: describes our annotation file format and how to decode the data structures used to represent the annotations. Additional significant updates to our resources include: ● NEDC TUH EEG Seizure (v1.6.0): This release includes the expansion of the training dataset from 4,597 files to 4,702. Calibration sequences have been manually annotated and added to our existing documentation. Numerous corrections were made to existing annotations based on user feedback. ● IBM TUSZ Pre-Processed Data (v1.0.0): A preprocessed version of the TUH Seizure Detection Corpus using two methods [8], both of which use an FFT sliding window approach (STFT). In the first method, FFT log magnitudes are used. In the second method, the FFT values are normalized across frequency buckets and correlation coefficients are calculated. The eigenvalues are calculated from this correlation matrix. The eigenvalues and correlation matrix's upper triangle are used to generate feature. ● NEDC TUH EEG Artifact Corpus (v1.0.0): This corpus was developed to support modeling of non-seizure signals for problems such as seizure detection. We have been using the data to build better background models. Five artifact events have been labeled: (1) eye movements (EYEM), (2) chewing (CHEW), (3) shivering (SHIV), (4) electrode pop, electrostatic artifacts, and lead artifacts (ELPP), and (5) muscle artifacts (MUSC). The data is cross-referenced to TUH EEG v1.1.0 so you can match patient numbers, sessions, etc. ● NEDC Eval EEG (v1.3.0): In this release of our standardized scoring software, the False Positive Rate (FPR) definition of the Time-Aligned Event Scoring (TAES) metric has been updated [9]. The standard definition is the number of false positives divided by the number of false positives plus the number of true negatives: #FP / (#FP + #TN). We also recently introduced the ability to download our data from an anonymous rsync server. The rsync command [10] effectively synchronizes both a remote directory and a local directory and copies the selected folder from the server to the desktop. It is available as part of most, if not all, Linux and Mac distributions (unfortunately, there is not an acceptable port of this command for Windows). To use the rsync command to download the content from our website, both a username and password are needed. An automated registration process on our website grants both. An example of a typical rsync command to access our data on our website is: rsync -auxv Rsync is a more robust option for downloading data. We have also experimented with Google Drive and Dropbox, but these types of technology are not suitable for such large amounts of data. All of the resources described in this poster are open source and freely available at We will demonstrate how to access and utilize these resources during the poster presentation and collect community feedback on the most needed additions to enable significant advances in machine learning performance.« less
  5. Paroxysmal atrial fibrillation (Paro. AF) is challenging to identify at the right moment. This disease is often undiagnosed using currently existing methods. Nonlinear analysis is gaining importance due to its capability to provide more insight into complex heart dynamics. The aim of this study is to use several recently developed nonlinear techniques to discriminate persistent AF (Pers. AF) from normal sinus rhythm (NSR), and more importantly, Paro. AF from NSR, using short-term single-lead electrocardiogram (ECG) signals. Specifically, we adapted and modified the time-delayed embedding method to minimize incorrect embedding parameter selection and further support to reconstruct proper phase plots of NSR and AF heart dynamics, from MIT-BIH databases. We also examine information-based methods, such as multiscale entropy (MSE) and kurtosis (Kt) for the same purposes. Our results demonstrate that embedding parameter time delay ( τ ), as well as MSE and Kt values can be successfully used to discriminate between Pers. AF and NSR. Moreover, we demonstrate that τ and Kt can successfully discriminate Paro. AF from NSR. Our results suggest that nonlinear time-delayed embedding method and information-based methods provide robust discriminating features to distinguish both Pers. AF and Paro. AF from NSR, thus offering effective treatment before suffering chaoticmore »Pers. AF.« less