CIM: A Novel Clustering-based Energy-Efficient Data Imputation Method for Human Activity Recognition

Hussein, Dina; Bhat, Ganapati

doi:10.1145/3609111

Citation Details

This content will become publicly available on October 31, 2024

CIM: A Novel Clustering-based Energy-Efficient Data Imputation Method for Human Activity Recognition

Human activity recognition (HAR) is an important component in a number of health applications, including rehabilitation, Parkinson’s disease, daily activity monitoring, and fitness monitoring. State-of-the-art HAR approaches use multiple sensors on the body to accurately identify activities at runtime. These approaches typically assume that data from all sensors are available for runtime activity recognition. However, data from one or more sensors may be unavailable due to malfunction, energy constraints, or communication challenges between the sensors. Missing data can lead to significant degradation in the accuracy, thus affecting quality of service to users. A common approach for handling missing data is to train classifiers or sensor data recovery algorithms for each combination of missing sensors. However, this results in significant memory and energy overhead on resource-constrained wearable devices. In strong contrast to prior approaches, this paper presents a clustering-based approach (CIM) to impute missing data at runtime. We first define a set of possible clusters and representative data patterns for each sensor in HAR. Then, we create and store a mapping between clusters across sensors. At runtime, when data from a sensor are missing, we utilize the stored mapping table to obtain most likely cluster for the missing sensor. The representative window for the identified cluster is then used as imputation to perform activity classification. We also provide a method to obtain imputation-aware activity prediction sets to handle uncertainty in data when using imputation. Experiments on three HAR datasets show that CIM achieves accuracy within 10% of a baseline without missing data for one missing sensor when providing single activity labels. The accuracy gap drops to less than 1% with imputation-aware classification. Measurements on a low-power processor show that CIM achieves close to 100% energy savings compared to state-of-the-art generative approaches.

Award ID(s):: 2238257

NSF-PAR ID:: 10500183

Author(s) / Creator(s):: Hussein, Dina; Bhat, Ganapati

Publisher / Repository:: Association for Computing Machinery

Date Published:: 2023-10-31

Journal Name:: ACM Transactions on Embedded Computing Systems

Volume:: 22

Issue:: 5s

ISSN:: 1539-9087

Page Range / eLocation ID:: 1 to 26

Subject(s) / Keyword(s):: ["Human activity recognition","wearable electronics","missing data detection","data imputation","clustering","health monitoring"]

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on October 31, 2024
Journal Article:
https://doi.org/10.1145/3609111

More Like this