NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Effective Posture Classification using Statistically Significant Data from Flexible Pressure Sensors

https://doi.org/10.1109/JFLEX.2024.3400151

Yoon, Jungeun; Moon, Aekyeung; Son, Seung Woo (January 2024, IEEE Journal on Flexible Electronics)

Advancements in flexible and printable sensor technologies to overcome the limitations of conventional rigid counterparts offer an excellent opportunity to design various healthcare applications for humans, and their potential flexibility can be used in real-time health monitoring and personalized physical conditions with minimal or no inconvenience. However, managing a large volume of obtained sensor datasets and ensuring accurate predictions can take time and effort. While statistical analysis and the Pearson correlation coefficient can reduce data volume, whether this would lead to losing important information and affect downstream application performance is still being determined. In this paper, we use posture classification as an exemplar of timely services in digital healthcare, especially for bedsores or decubitus ulcers. Our sensors, placed under hospital beds, have a thickness of just 0.4mm and collect pressure data from 28 sensors (7 by 4) at an 8 Hz cycle, categorizing postures into 4 types from 5 patients. We then collected sensor data to explore the possibility of using a small number of pressure sensors for patient posture classification. Next, we apply a statistical analysis to the datasets obtained to select the featured sensor data cells and evaluate the performance of posture classification models on various groups of sensors. Our evaluation involves the analysis of reduced datasets through statistical methods and the Pearson correlation coefficient. The classification performance using datasets comprising 5 featured and 28 sensors are 0.93 and 0.99, respectively. These results suggest comparable performance and the viability of useful classifiers for both cases. Consequently, comparable posture classification performance can be achieved using only 17.9% of the entire dataset.
more » « less
Full Text Available
Anomaly Detection in Scientific Datasets using Sparse Representation

https://doi.org/10.1145/3588982.3603610

Moon, Aekyeung; Kim, Minjun; Chen, Jiaxi; Son, Seung Woo (August 2023, Proceedings of the First Workshop on AI for Systems)

As the size and complexity of high-performance computing (HPC) systems keep growing, scientists' ability to trust the data produced is paramount due to potential data corruption for various reasons, which may stay undetected. While employing machine learning-based anomaly detection techniques could relieve scientists of such concern, it is practically infeasible due to the need for labels for volumes of scientific datasets and the unwanted extra overhead associated. In this paper, we exploit spatial sparsity profiles exhibited in scientific datasets and propose an approach to detect anomalies effectively. Our method first extracts block-level sparse representations of original datasets in the transformed domain. Then it learns from the extracted sparse representations and builds the boundary threshold between normal and abnormal without relying on labeled data. Experiments using real-world scientific datasets show that the proposed approach requires 13% on average (less than 10% in most cases and as low as 0.3%) of the entire dataset to achieve competitive detection accuracy (70.74%-100.0%) as compared to two state-of-the-art unsupervised techniques.
more » « less
Full Text Available
Exploration Of Lossy Posture Classification Model Using In-Bed Flexible Pressure Sensors

https://doi.org/10.1109/FLEPS57599.2023.10220219

Moon, Aekyeung; Son, Seung Woo; Kim, Minjun; Chang, Seyun; Park, Hyeji (July 2023, IEEE International Conference on Flexible and Printable Sensors and Systems (FLEPS))

Advances in flexible and printable sensor technologies have made it possible to use posture classification for providing timely services in digital healthcare, especially for bedsores or decubitus ulcers. However, managing a large amount of sensor data and ensuring accurate predictions can be challenging. While lossy compressors can reduce data volume, it is still unclear whether this would lead to losing important information and affect downstream application performance. In this paper, we propose LCDNN (Lossy Compression using Deep Neural Network) to reduce the size of sensor data and evaluate the performance of posture classification models. Our sensors, placed under hospital beds, have a thickness of just 0.4mm and collect pressure data from 28 sensors (7 by 4) at an 8 Hz cycle, categorizing postures into 4 types from 5 patients. Our evaluation, which includes reduced datasets by LCDNN, demonstrates that the results are promising.
more » « less
Full Text Available
Lossy Predictive Models for Accurate Classification Algorithms

https://doi.org/10.1109/BigData55660.2022.10020381

Moon, Aekyeung; Son, Seung Woo; Kim, Hyson; Kim, Minjun (December 2022, IEEE International Conference on Big Data (Big Data))

Recent years have witnessed an upsurge of interest in lossy compression due to its potential to significantly reduce data volume with adequate exploitation of the spatiotemporal properties of IoT datasets. However, striking a balance between compression ratios and data fidelity is challenging, particularly when losing data fidelity impacts downstream data analytics noticeably. In this paper, we propose a lossy prediction model dealing with binary classification analytics tasks to minimize the impact of the error introduced due to lossy compression. We specifically focus on five classification algorithms for frost prediction in agricultural fields allowing preparation by the predictive advisories to provide helpful information for timely services. While our experimental evaluations reaffirm the nature of lossy compressions where allowing higher errors offers higher compression ratios, we also observe that the classification performance in terms of accuracy and F-1 score differs among all the algorithms we evaluated. Specifically, random forest is the best lossy prediction model for classifying frost. Lastly, we show the robustness of the lossy prediction model based on the data fidelity in prediction performance.
more » « less
Full Text Available
Towards Guaranteeing Error Bound in DCT-based Lossy Compression

https://doi.org/10.1109/BigData55660.2022.10020345

Chen, Jiaxi; Moon, Aekyeung; Son, Seung Woo (December 2022, IEEE International Conference on Big Data (Big Data))

High-performance computing (HPC) systems that run scientific simulations of significance produce a large amount of data during runtime. Transferring or storing such big datasets causes a severe I/O bottleneck and a considerable storage burden. Applying compression techniques, particularly lossy compressors, can reduce the size of the data and mitigate such overheads. Unlike lossless compression algorithms, error-controlled lossy compressors could significantly reduce the data size while respecting the user-defined error bound. DCTZ is one of the transform-based lossy compressors with a highly efficient encoding and purpose-built error control mechanism that accomplishes high compression ratios with high data fidelity. However, since DCTZ quantizes the DCT coefficients in the frequency domain, it may only partially control the relative error bound defined by the user. In this paper, we aim to improve the compression quality of DCTZ. Specifically, we propose a preconditioning method based on level offsetting and scaling to control the magnitude of input of the DCTZ framework, thereby enforcing stricter error bounds. We evaluate the performance of our method in terms of compression ratio and rate distortion with real-world HPC datasets. Our experimental result shows that our method can achieve a higher compression ratio than other state-of-the-art lossy compressors with a tighter error bound while precisely guaranteeing the user-defined error bound.
more » « less
Full Text Available
Characterization of Transform-Based Lossy Compression for HPC Datasets

https://doi.org/10.1109/DRBSD56682.2022.00013

Moon, Aekyeung; Chen, Jiaxi; Son, Seung Woo; Kim, Minjun (November 2022, IEEE/ACM 8th International Workshop on Data Analysis and Reduction for Big Scientific Data (DRBSD))

As the scale and complexity of high-performance computing (HPC) systems keep growing, data compression techniques are often adopted to reduce the data volume and processing time. While lossy compression becomes preferable to a lossless one because of the potential benefit of generating a high compression ratio, it would lose its worth the effort without finding an optimal balance between volume reduction and information loss. Among many lossy compression techniques, transform-based lossy algorithms utilize spatial redundancy better. However, the transform-based lossy compressor has received relatively less attention because there is a lack of understanding of its compression performance on scientific data sets. The insight of this paper is that, in transform-based lossy compressors, quantifying dominant coefficients at the block level reveals the right balance, potentially impacting overall compression ratios. Motivated by this, we characterize three transformation-based lossy compression mechanisms with different information compaction methods using the statistical features that capture data characteristics. And then, we build several prediction models using the statistical features and the characteristics of dominant coefficients and evaluate the effectiveness of each model using six HPC datasets from three production-level simulations at scale. Our results demonstrate that the random forest classifier captures the behavior of dominant coefficients precisely, achieving nearly 99% of prediction accuracy.
more » « less
Full Text Available
Cascaded Dimension Reduction for Effective Anomaly Detection

https://doi.org/10.1109/BigData52589.2021.9671364

Zhuo, Xiaoyan; Moon, Aekyeung; Zhang, Jialing; Son, Seung Woo (December 2021, IEEE International Conference on Big Data (Big Data))

Full Text Available
DPZ: Improving Lossy Compression Ratio with Information Retrieval on Scientific Data

https://doi.org/10.1109/Cluster48925.2021.00056

Zhang, Jialing; Chen, Jiaxi; Zhuo, Xiaoyan; Moon, Aekyeung; Son, Seung Woo (September 2021, IEEE International Conference on Cluster Computing (CLUSTER))
null (Ed.)
Full Text Available
Anomaly Detection in Edge Nodes using Sparsity Profile

https://doi.org/10.1109/BigData50022.2020.9377757

Moon, Aekyeung; Zhuo, Xiaoyan; Zhang, Jialing; Son, Seung Woo; Jeong Song, Yun (December 2020, 2020 IEEE International Conference on Big Data (Big Data))
null (Ed.)
Edge devices with attentive sensors enable various intelligent services by exploring streams of sensor data. However, anomalies, which are inevitable due to faults or failures in the sensor and network, can result in incorrect or unwanted operational decisions. While promptly ensuring the accuracy of IoT data is critical, lack of labels for live sensor data and limited storage resources necessitates efficient and reliable detection of anomalies at edge nodes. Motivated by the existence of unique sparsity profiles that express original signals as a combination of a few coefficients between normal and abnormal sensing periods, we propose a novel anomaly detection approach, called ADSP (Anomaly Detection with Sparsity Profile). The key idea is to apply a transformation on the raw data, identify top-K dominant components that represent normal data behaviors, and detect data anomalies based on the disparity from K values approximating the periods of normal data in an unsupervised manner. Our evaluation using a set of synthetic datasets demonstrates that ADSP can achieve 92%–100% of detection accuracy. To validate our anomaly detection approach on real-world cases, we label potential anomalies using a range of error boundary conditions using sensors exhibiting a straight line in Q-Q plot and strong Pearson correlation and conduct a controlled comparison of the detection accuracy. Our experimental evaluation using real-world datasets demonstrates that ADSP can detect 83%– 92% of anomalies using only 1.7% of the original data, which is comparable to the accuracy achieved by using the entire datasets.
more » « less
Full Text Available
Understanding Bit-Error Trade-off of Transform-based Lossy Compression on Electrocardiogram Signals

https://doi.org/10.1109/BigData50022.2020.9378343

Moon, Aekyeung; Woo Son, Seung; Jung, Jiuk; Jeong Song, Yun (December 2020, 2020 IEEE International Conference on Big Data (Big Data))
null (Ed.)
The growing demand for recording longer ECG signals to improve the effectiveness of IoT-enabled remote clinical healthcare is contributing large amounts of ECG data. While lossy compression techniques have shown potential in significantly lowering the amount of data, investigation on how to trade-off between data reduction and data fidelity on ECG data received relatively less attention. This paper gives insight into the power of lossy compression to ECG signals by balancing between data quality and compression ratio. We evaluate the performance of transformed-based lossy compressions on the ECG datasets collected from the Biosemi ActiveTwo devices. Our experimental results indicate that ECG data exhibit high energy compaction property through transformations like DCT and DWT, thus could improve compression ratios significantly without hurting data fidelity much. More importantly, we evaluate the effect of lossy compression on ECG signals by validating the R-peak in the QRS complex. Our method can obtain low error rates measured in PRD (as low as 0.3) and PSNR (up to 67) using only 5% of the transform coefficients. Therefore, R-peaks in the reconstructed ECG signals are almost identical to ones in the original signals, thus facilitating extended ECG monitoring.
more » « less
Full Text Available

« Prev Next »

Search for: All records