skip to main content

Attention:

The NSF Public Access Repository (PAR) system and access will be unavailable from 11:00 PM ET on Thursday, January 16 until 2:00 AM ET on Friday, January 17 due to maintenance. We apologize for the inconvenience.


Title: Distributed Kriging-Bootstrapped DNN Model for Fast, Accurate Seizure Detection from EEG Signals
The modeling of the brain as a three-dimensional spatial object, similar to a geographical landscape, has the paved way for the successful application of Kriging methods in solving the seizure detection problem with good performance but in cubic computational time complexity. The Deep Neural Network (DNN) has been widely used for seizure detection due to its effectiveness in classification tasks, although at the cost of a protracted training time. While Kriging exploits the spatial correlation between data locations, DNN relies on its capacity to learn intrinsic representations within the dataset from the basest unit parts. This paper presents a Distributed Kriging-Bootstrapped Deep Neural Network (DNN) model as a twofold solution for fast and accurate seizure detection using brain signals collected with the electroencephalogram (EEG) from healthy subjects and patients of epilepsy. The proposed model parallelizes the Kriging computation into different cores in a machine and then produces a strongly correlated, unified quasi-output data which serves as an input to the Deep Neural Network. Experimental results validate the proposed model as superior to conventional Kriging methods and DNN by training in 91% less time than the basic DNN and about three times as fast as the ordinary Kriging-Bootstrapped Deep Neural Network (DNN) model while maintaining good performance in terms of sensitivity, specificity and testing accuracy compared to other models and existing works.  more » « less
Award ID(s):
1924112
PAR ID:
10158121
Author(s) / Creator(s):
; ;
Date Published:
Journal Name:
Proceedings of the 19th IEEE Computer Society Annual Symposium on VLSI (ISVLSI)
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. The Deep Neural Network (DNN) model is known for its high accuracy in classification tasks due to its intrinsic ability to learn the underlying patterns existing in a set of data. Hence it has gained momentum in seizure detection research, as in many other fields. However, its high performance is at the expense of an extensive training time. This is not appropriate for a real-time application such as seizure detection in which a swift reaction is required to save the life of the patient. This paper presents a novel Kriging-Bootstrapped Deep Neural Network hierarchical model for early seizure detection in which Kriging is first used to generate a well-correlated intermediate data set from the original input. The correlated data is then fed into the DNN for the final training. Experiments were carried out using electroencephalogram (EEG) data from both normal and epileptic patients. Results show that, with the same architecture and data size, the cumulative training time of the Krigging-Bootstrapped DNN is about 75% lower than that of the ordinary DNN without a compromise in performance as the proposed hybrid model shows a slightly better accuracy than the baseline DNN model. 
    more » « less
  2. Epileptic seizures are dangerous. They render patients unconscious and can lead to death within seconds of onset. There is, therefore, the need for a very fast and accurate seizure detection mechanism. Kriging methods have been used extensively in geostatistics for spatial prediction and are known for very high accuracy. By modeling the brain as a spatial map, we demonstrate the effectiveness of Kriging Methods for efficient seizure detection in an edge computing paradigm. We explore three different types of Kriging - Simple Kriging, Ordinary Kriging and Universal Kriging. Results from various experiments with electroencephalogram (EEG) signals of both healthy and diseased patients show that all three Kriging methods have good performance in terms of accuracy, sensitivity and detection latency. However, Simple Kriging emerged as the slight favorite for seizure detection with a mean detection latency of 0.81 sec, an accuracy of 97.50%, a sensitivity of 94.74% and a perfect specificity. Simple Kriging is at least 5% better than Ordinary Kriging and Universal Kriging when evaluated at 68.2% confidence interval. The results obtained in this paper compare favorably with other seizure detection models in the literature. 
    more » « less
  3. Existing deep learning systems commonly parallelize deep neural network (DNN) training using data or model parallelism, but these strategies often result in suboptimal parallelization performance. We introduce SOAP, a more comprehensive search space of parallelization strategies for DNNs that includes strategies to parallelize a DNN in the Sample, Operator, Attribute, and Parameter dimensions. We present FlexFlow, a deep learning engine that uses guided randomized search of the SOAP space to find a fast parallelization strategy for a specific parallel machine. To accelerate this search, FlexFlow introduces a novel execution simulator that can accurately predict a parallelization strategy’s performance and is three orders of magnitude faster than prior approaches that execute each strategy. We evaluate FlexFlow with six real-world DNN benchmarks on two GPU clusters and show that FlexFlow increases training throughput by up to 3.3× over state-of-the-art approaches, even when including its search time, and also improves scalability. 
    more » « less
  4. In the past decade, Deep Neural Networks (DNNs), e.g., Convolutional Neural Networks, achieved human-level performance in vision tasks such as object classification and detection. However, DNNs are known to be computationally expensive and thus hard to be deployed in real-time and edge applications. Many previous works have focused on DNN model compression to obtain smaller parameter sizes and consequently, less computational cost. Such methods, however, often introduce noticeable accuracy degradation. In this work, we optimize a state-of-the-art DNN-based video detection framework—Deep Feature Flow (DFF) from the cloud end using three proposed ideas. First, we propose Asynchronous DFF (ADFF) to asynchronously execute the neural networks. Second, we propose a Video-based Dynamic Scheduling (VDS) method that decides the detection frequency based on the magnitude of movement between video frames. Last, we propose Spatial Sparsity Inference, which only performs the inference on part of the video frame and thus reduces the computation cost. According to our experimental results, ADFF can reduce the bottleneck latency from 89 to 19 ms. VDS increases the detection accuracy by 0.6% mAP without increasing computation cost. And SSI further saves 0.2 ms with a 0.6% mAP degradation of detection accuracy. 
    more » « less
  5. Recent advancements in Deep Neural Networks (DNNs) have enabled widespread deployment in multiple security-sensitive domains. The need for resource-intensive training and the use of valuable domain-specific training data have made these models the top intellectual property (IP) for model owners. One of the major threats to DNN privacy is model extraction attacks where adversaries attempt to steal sensitive information in DNN models. In this work, we propose an advanced model extraction framework DeepSteal that steals DNN weights remotely for the first time with the aid of a memory side-channel attack. Our proposed DeepSteal comprises two key stages. Firstly, we develop a new weight bit information extraction method, called HammerLeak, through adopting the rowhammer-based fault technique as the information leakage vector. HammerLeak leverages several novel system-level techniques tailored for DNN applications to enable fast and efficient weight stealing. Secondly, we propose a novel substitute model training algorithm with Mean Clustering weight penalty, which leverages the partial leaked bit information effectively and generates a substitute prototype of the target victim model. We evaluate the proposed model extraction framework on three popular image datasets (e.g., CIFAR-10/100/GTSRB) and four DNN architectures (e.g., ResNet-18/34/Wide-ResNetNGG-11). The extracted substitute model has successfully achieved more than 90% test accuracy on deep residual networks for the CIFAR-10 dataset. Moreover, our extracted substitute model could also generate effective adversarial input samples to fool the victim model. Notably, it achieves similar performance (i.e., ~1-2% test accuracy under attack) as white-box adversarial input attack (e.g., PGD/Trades). 
    more » « less