skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Comparison of raw accelerometry data from ActiGraph, Apple Watch, Garmin, and Fitbit using a mechanical shaker table
The purpose of this study was to evaluate the reliability and validity of the raw accelerometry output from research-grade and consumer wearable devices compared to accelerations produced by a mechanical shaker table. Raw accelerometry data from a total of 40 devices (i.e., n = 10 ActiGraph wGT3X-BT, n = 10 Apple Watch Series 7, n = 10 Garmin Vivoactive 4S, and n = 10 Fitbit Sense) were compared to reference accelerations produced by an orbital shaker table at speeds ranging from 0.6 Hz (4.4 milligravity-mg) to 3.2 Hz (124.7mg). Two-way random effects absolute intraclass correlation coefficients (ICC) tested inter-device reliability. Pearson product moment, Lin’s concordance correlation coefficient (CCC), absolute error, mean bias, and equivalence testing were calculated to assess the validity between the raw estimates from the devices and the reference metric. Estimates from Apple, ActiGraph, Garmin, and Fitbit were reliable, with ICCs = 0.99, 0.97, 0.88, and 0.88, respectively. Estimates from ActiGraph, Apple, and Fitbit devices exhibited excellent concordance with the reference CCCs = 0.88, 0.83, and 0.85, respectively, while estimates from Garmin exhibited moderate concordance CCC = 0.59 based on the mean aggregation method. ActiGraph, Apple, and Fitbit produced similar absolute errors = 16.9mg, 21.6mg, and 22.0mg, respectively, while Garmin produced higher absolute error = 32.5mg compared to the reference. ActiGraph produced the lowest mean bias 0.0mg (95%CI = -40.0, 41.0). Equivalence testing revealed raw accelerometry data from all devices were not statistically significantly within the equivalence bounds of the shaker speed. Findings from this study provide evidence that raw accelerometry data from Apple, Garmin, and Fitbit devices can be used to reliably estimate movement; however, no estimates were statistically significantly equivalent to the reference. Future studies could explore device-agnostic and harmonization methods for estimating physical activity using the raw accelerometry signals from the consumer wearables studied herein.  more » « less
Award ID(s):
2246582
PAR ID:
10539634
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ; ; ; ; ; ;
Editor(s):
Yamada, Yosuke
Publisher / Repository:
PLOS ONE
Date Published:
Journal Name:
PLOS ONE
Volume:
19
Issue:
3
ISSN:
1932-6203
Page Range / eLocation ID:
e0286898
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Study ObjectivesEvaluate wrist-placed accelerometry predicted heartrate compared to electrocardiogram (ECG) heartrate in children during sleep. MethodsChildren (n = 82, 61% male, 43.9% black) wore a wrist-placed Apple Watch Series 7 (AWS7) and ActiGraph GT9X during a polysomnogram. Three-Axis accelerometry data was extracted from AWS7 and the GT9X. Accelerometry heartrate estimates were derived from jerk (the rate of acceleration change), computed using the peak magnitude frequency in short time Fourier Transforms of Hilbert transformed jerk computed from acceleration magnitude. Heartrates from ECG traces were estimated from R-R intervals using R-pulse detection. Lin’s concordance correlation coefficient (CCC), mean absolute error (MAE), and mean absolute percent error (MAPE) assessed agreement with ECG estimated heart rate. Secondary analyses explored agreement by polysomnography sleep stage and a signal quality metric. ResultsThe developed scripts are available on Github. For the GT9X, CCC was poor at −0.11 and MAE and MAPE were high at 16.8 (SD = 14.2) beats/minute and 20.4% (SD = 18.5%). For AWS7, CCC was moderate at 0.61 while MAE and MAPE were lower at 6.4 (SD = 9.9) beats/minute and 7.3% (SD = 10.3%). Accelerometry estimated heartrate for AWS7 was more closely related to ECG heartrate during N2, N3 and REM sleep than lights on, wake, and N1 and when signal quality was high. These patterns were not evident for the GT9X. ConclusionsRaw accelerometry data extracted from AWS7, but not the GT9X, can be used to estimate heartrate in children while they sleep. Future work is needed to explore the sources (i.e. hardware, software, etc.) of the GT9X’s poor performance. 
    more » « less
  2. ABSTRACT IntroductionCurrent wearables that collect heart rate and acceleration were not designed for children and/or do not allow access to raw signals, making them fundamentally unverifiable. This study describes the creation and calibration of an open-source multichannel platform (PATCH) designed to measure heart rate and acceleration in children ages 3–8 yr. MethodsChildren (N = 63; mean age, 6.3 yr) participated in a 45-min protocol ranging in intensities from sedentary to vigorous activity. Actiheart-5 was used as a comparison measure. We calculated mean bias, mean absolute error (MAE) mean absolute percent error (MA%E), Pearson correlations, and Lin’s concordance correlation coefficient (CCC). ResultsMean bias between PATCH and Actiheart heart rate was 2.26 bpm, MAE was 6.67 bpm, and M%E was 5.99%. The correlation between PATCH and Actiheart heart rate was 0.89, and CCC was 0.88. For acceleration, mean bias was 1.16 mg and MAE was 12.24 mg. The correlation between PATCH and Actiheart was 0.96, and CCC was 0.95. ConclusionsThe PATCH demonstrated clinically acceptable accuracies to measure heart rate and acceleration compared with a research-grade device. 
    more » « less
  3. Purpose : Our study evaluated the agreement of mean daily step counts, peak 1-min cadence, and peak 30-min cadence between the hip-worn ActiGraph GT3X+ accelerometer, using the normal filter (AG N ) and the low frequency extension (AG LFE ), and the thigh-worn activPAL3 micro (AP) accelerometer among older adults. Methods : Nine-hundred and fifty-three older adults (≥65 years) were recruited to wear the ActiGraph device concurrently with the AP for 4–7 days beginning in 2016. Using the AP as the reference measure, device agreement for each step-based metric was assessed using mean differences (AG N  − AP and AG LFE  − AP), mean absolute percentage error (MAPE), and Pearson and concordance correlation coefficients. Results : For AG N  − AP, the mean differences and MAPE were: daily steps −1,851 steps/day and 27.2%, peak 1-min cadence −16.2 steps/min and 16.3%, and peak 30-min cadence −17.7 steps/min and 24.0%. Pearson coefficients were .94, .85, and .91 and concordance coefficients were .81, .65, and .73, respectively. For AG LFE  − AP, the mean differences and MAPE were: daily steps 4,968 steps/day and 72.7%, peak 1-min cadence −1.4 steps/min and 4.7%, and peak 30-min cadence 1.4 steps/min and 7.0%. Pearson coefficients were .91, .91, and .95 and concordance coefficients were .49, .91, and .94, respectively. Conclusions : Compared with estimates from the AP, the AG N underestimated daily step counts by approximately 1,800 steps/day, while the AG LFE overestimated by approximately 5,000 steps/day. However, peak step cadence estimates generated from the AG LFE and AP had high agreement (MAPE ≤ 7.0%). Additional convergent validation studies of step-based metrics from concurrently worn accelerometers are needed for improved understanding of between-device agreement. 
    more » « less
  4. Abstract Background Hip-worn accelerometer cut-points have poor validity for assessing children’s sedentary time, which may partly explain the equivocal health associations shown in prior research. Improved processing/classification methods for these monitors would enrich the evidence base and inform the development of more effective public health guidelines. The present study aimed to develop and evaluate a novel computational method (CHAP-child) for classifying sedentary time from hip-worn accelerometer data. Methods Participants were 278, 8–11-year-olds recruited from nine primary schools in Melbourne, Australia with differing socioeconomic status. Participants concurrently wore a thigh-worn activPAL (ground truth) and hip-worn ActiGraph (test measure) during up to 4 seasonal assessment periods, each lasting up to 8 days. activPAL data were used to train and evaluate the CHAP-child deep learning model to classify each 10-s epoch of raw ActiGraph acceleration data as sitting or non-sitting, creating comparable information from the two monitors. CHAP-child was evaluated alongside the current practice 100 counts per minute (cpm) method for hip-worn ActiGraph monitors. Performance was tested for each 10-s epoch and for participant-season level sedentary time and bout variables (e.g., mean bout duration). Results Across participant-seasons, CHAP-child correctly classified each epoch as sitting or non-sitting relative to activPAL, with mean balanced accuracy of 87.6% (SD = 5.3%). Sit-to-stand transitions were correctly classified with mean sensitivity of 76.3% (SD = 8.3). For most participant-season level variables, CHAP-child estimates were within ± 11% (mean absolute percent error [MAPE]) of activPAL, and correlations between CHAP-child and activPAL were generally very large (> 0.80). For the current practice 100 cpm method, most MAPEs were greater than ± 30% and most correlations were small or moderate (≤ 0.60) relative to activPAL. Conclusions There was strong support for the concurrent validity of the CHAP-child classification method, which allows researchers to derive activPAL-equivalent measures of sedentary time, sit-to-stand transitions, and sedentary bout patterns from hip-worn triaxial ActiGraph data. Applying CHAP-child to existing datasets may provide greater insights into the potential impacts and influences of sedentary time in children. 
    more » « less
  5. This dataset supports the study "A method for intelligent allocation of diagnostic testing by leveraging data from commercial wearable devices: a case study on COVID-19" which developed an Intelligent Testing Allocation (ITA) method. The study demonstrated the efficacy of using continuous digital biomarkers like resting heart rate and steps to enhance COVID-19 diagnostic testing positivity rates. The findings suggest significant potential for large-scale, symptom-independent surveillance testing to alleviate diagnostic test shortages. The provided data is from the CovIdentify study launched by Duke's BIG IDEAs Lab in the Biomedical Engineering Department. From April 2nd, 2020 to May 25th, 2021, 2,887 participants connected their smartwatches to the CovIdentify platform, including 1,689 Garmin, 1,091 Fitbit, and 107 Apple smartwatches 
    more » « less