skip to main content


The NSF Public Access Repository (NSF-PAR) system and access will be unavailable from 5:00 PM ET until 11:00 PM ET on Friday, June 21 due to maintenance. We apologize for the inconvenience.

Title: Knee Acoustic Emissions as a Digital Biomarker of Disease Status in Juvenile Idiopathic Arthritis
In this paper, we quantify the joint acoustic emissions (JAEs) from the knees of children with juvenile idiopathic arthritis (JIA) and support their use as a novel biomarker of the disease. JIA is the most common rheumatic disease of childhood; it has a highly variable presentation, and few reliable biomarkers which makes diagnosis and personalization of care difficult. The knee is the most commonly affected joint with hallmark synovitis and inflammation that can extend to damage the underlying cartilage and bone. During movement of the knee, internal friction creates JAEs that can be non-invasively measured. We hypothesize that these JAEs contain clinically relevant information that could be used for the diagnosis and personalization of treatment of JIA. In this study, we record and compare the JAEs from 25 patients with JIA−10 of whom were recorded a second time 3–6 months later—and 18 healthy age- and sex-matched controls. We compute signal features from each of those record cycles of flexion/extension and train a logistic regression classification model. The model classified each cycle as having JIA or being healthy with 84.4% accuracy using leave-one-subject-out cross validation (LOSO-CV). When assessing the full JAE recording of a subject (which contained at least 8 cycles of flexion/extension), a majority vote of the cycle labels accurately classified the subjects as having JIA or being healthy 100% of the time. Using the output probabilities of a JIA class as a basis for a joint health score and test it on the follow-up patient recordings. In all 10 of our 6-week follow-up recordings, the score accurately tracked with successful treatment of the condition. Our proposed JAE-based classification model of JIA presents a compelling case for incorporating this novel joint health assessment technique into the clinical work-up and monitoring of JIA.  more » « less
Award ID(s):
Author(s) / Creator(s):
; ; ; ; ; ;
Date Published:
Journal Name:
Frontiers in Digital Health
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. null (Ed.)
    Joint acoustic emission (JAE) sensing has recently proven to be a viable technique for non-invasive quantification indicating knee joint health. In this work, we adapt the acoustic emission sensing method to measure the JAEs of the wrist—another joint commonly affected by injury and degenerative disease. JAEs of seven healthy volunteers were recorded during wrist flexion-extension and rotation with sensitive uniaxial accelerometers placed at eight locations around the wrist. The acoustic data were bandpass filtered (150 Hz–20 kHz). The signal-to-noise ratio (SNR) was used to quantify the strength of the JAE signals in each recording. Then, nine audio features were extracted, and the intraclass correlation coefficient (ICC) (model 3,k), coefficients of variability (CVs), and Jensen–Shannon (JS) divergence were calculated to evaluate the interrater repeatability of the signals. We found that SNR ranged from 4.1 to 9.8 dB, intrasession and intersession ICC values ranged from 0.629 to 0.886, CVs ranged from 0.099 to 0.241, and JS divergence ranged from 0.18 to 0.20, demonstrating high JAE repeatability and signal strength at three locations. The volunteer sample size is not large enough to represent JAE analysis of a larger population, but this work will lay a foundation for future work in using wrist JAEs to aid in diagnosis and treatment tracking of musculoskeletal pathologies and injury in wearable systems. 
    more » « less
  2. Abstract Background

    Joint acoustic emissions from knees have been evaluated as a convenient, non-invasive digital biomarker of inflammatory knee involvement in a small cohort of children with Juvenile Idiopathic Arthritis (JIA). The objective of the present study was to validate this in a larger cohort.


    A total of 116 subjects (86 JIA and 30 healthy controls) participated in this study. Of the 86 subjects with JIA, 43 subjects had active knee involvement at the time of study. Joint acoustic emissions were bilaterally recorded, and corresponding signal features were used to train a machine learning algorithm (XGBoost) to classify JIA and healthy knees. All active JIA knees and 80% of the controls were used as training data set, while the remaining knees were used as testing data set. Leave-one-leg-out cross-validation was used for validation on the training data set. Validation on the training and testing set of the classifier resulted in an accuracy of 81.1% and 87.7% respectively. Sensitivity / specificity for the training and testing validation was 88.6% / 72.3% and 88.1% / 83.3%, respectively. The area under the curve of the receiver operating characteristic curve was 0.81 for the developed classifier. The distributions of the joint scores of the active and inactive knees were significantly different.


    Joint acoustic emissions can serve as an inexpensive and easy-to-use digital biomarker to distinguish JIA from healthy controls. Utilizing serial joint acoustic emission recordings can potentially help monitor disease activity in JIA affected joints to enable timely changes in therapy.

    more » « less
  3. Abstract STUDY QUESTION

    Can we derive adequate models to predict the probability of conception among couples actively trying to conceive?


    Leveraging data collected from female participants in a North American preconception cohort study, we developed models to predict pregnancy with performance of ∼70% in the area under the receiver operating characteristic curve (AUC).


    Earlier work has focused primarily on identifying individual risk factors for infertility. Several predictive models have been developed in subfertile populations, with relatively low discrimination (AUC: 59–64%).


    Study participants were female, aged 21–45 years, residents of the USA or Canada, not using fertility treatment, and actively trying to conceive at enrollment (2013–2019). Participants completed a baseline questionnaire at enrollment and follow-up questionnaires every 2 months for up to 12 months or until conception. We used data from 4133 participants with no more than one menstrual cycle of pregnancy attempt at study entry.


    On the baseline questionnaire, participants reported data on sociodemographic factors, lifestyle and behavioral factors, diet quality, medical history and selected male partner characteristics. A total of 163 predictors were considered in this study. We implemented regularized logistic regression, support vector machines, neural networks and gradient boosted decision trees to derive models predicting the probability of pregnancy: (i) within fewer than 12 menstrual cycles of pregnancy attempt time (Model I), and (ii) within 6 menstrual cycles of pregnancy attempt time (Model II). Cox models were used to predict the probability of pregnancy within each menstrual cycle for up to 12 cycles of follow-up (Model III). We assessed model performance using the AUC and the weighted-F1 score for Models I and II, and the concordance index for Model III.


    Model I and II AUCs were 70% and 66%, respectively, in parsimonious models, and the concordance index for Model III was 63%. The predictors that were positively associated with pregnancy in all models were: having previously breastfed an infant and using multivitamins or folic acid supplements. The predictors that were inversely associated with pregnancy in all models were: female age, female BMI and history of infertility. Among nulligravid women with no history of infertility, the most important predictors were: female age, female BMI, male BMI, use of a fertility app, attempt time at study entry and perceived stress.


    Reliance on self-reported predictor data could have introduced misclassification, which would likely be non-differential with respect to the pregnancy outcome given the prospective design. In addition, we cannot be certain that all relevant predictor variables were considered. Finally, though we validated the models using split-sample replication techniques, we did not conduct an external validation study.


    Given a wide range of predictor data, machine learning algorithms can be leveraged to analyze epidemiologic data and predict the probability of conception with discrimination that exceeds earlier work.


    The research was partially supported by the U.S. National Science Foundation (under grants DMS-1664644, CNS-1645681 and IIS-1914792) and the National Institutes for Health (under grants R01 GM135930 and UL54 TR004130). In the last 3 years, L.A.W. has received in-kind donations for primary data collection in PRESTO from,, Sandstone Diagnostics and Swiss Precision Diagnostics. L.A.W. also serves as a fibroid consultant to AbbVie, Inc. The other authors declare no competing interests.



    more » « less
  4. Abstract This paper presents a new two-step design procedure and preliminary kinematic evaluation of a novel, passive, six-bar knee-ankle-foot orthosis (KAFO). The kinematic design and preliminary kinematic gait analysis of the KAFO are based on motion capture data from a single healthy male subject. Preliminary kinematic evaluation shows that the designed passive KAFO is capable of supporting flexion and extension of the knee joint during stance and swing phases of walking. The two-step design procedure for the KAFO consists of (1) computational synthesis based on user's motion data and (2) performance optimization. In the computational synthesis step, first the lower leg (knee-ankle-foot) of the subject is approximated as a 2R kinematic chain and its target trajectories are specified from motion capture data. Six-bar linkages are synthesized to coordinate the angular movements of knee and ankle joints of the 2R chain at 11 accuracy points. The first step of the design procedure yields 332 six-bar KAFO design candidates. This is followed by a performance optimization step in which the KAFO design candidates are optimally modified to satisfy specified constraints on end-effector trajectory and shape. This two-step process yields an optimally designed passive six-bar KAFO that shows promising kinematic results at the knee joint of the user during walking. The preliminary prototype manufactured is cost effective, easy to operate, and suitably demonstrates the feasibility of the proposed concept. 
    more » « less
  5. null (Ed.)
    Introduction: Alzheimer’s disease (AD) causes progressive irreversible cognitive decline and is the leading cause of dementia. Therefore, a timely diagnosis is imperative to maximize neurological preservation. However, current treatments are either too costly or limited in availability. In this project, we explored using retinal vasculature as a potential biomarker for early AD diagnosis. This project focuses on stage 3 of a three-stage modular machine learning pipeline which consisted of image quality selection, vessel map generation, and classification [1]. The previous model only used support vector machine (SVM) to classify AD labels which limited its accuracy to 82%. In this project, random forest and gradient boosting were added and, along with SVM, combined into an ensemble classifier, raising the classification accuracy to 89%. Materials and Methods: Subjects classified as AD were those who were diagnosed with dementia in “Dementia Outcome: Alzheimer’s disease” from the UK Biobank Electronic Health Records. Five control groups were chosen with a 5:1 ratio of control to AD patients where the control patients had the same age, gender, and eye side image as the AD patient. In total, 122 vessel images from each group (AD and control) were used. The vessel maps were then segmented from fundus images through U-net. A t-test feature selection was first done on the training folds and the selected features was fed into the classifiers with a p-value threshold of 0.01. Next, 20 repetitions of 5-fold cross validation were performed where the hyperparameters were solely tuned on the training data. An ensemble classifier consisting of SVM, gradient boosting tree, and random forests was built and the final prediction was made through majority voting and evaluated on the test set. Results and Discussion: Through ensemble classification, accuracy increased by 4-12% relative to the individual classifiers, precision by 9-15%, sensitivity by 2-9%, specificity by at least 9-16%, and F1 score by 712%. Conclusions: Overall, a relatively high classification accuracy was achieved using machine learning ensemble classification with SVM, random forest, and gradient boosting. Although the results are very promising, a limitation of this study is that the requirement of needing images of sufficient quality decreased the amount of control parameters that can be implemented. However, through retinal vasculature analysis, this project shows machine learning’s high potential to be an efficient, more cost-effective alternative to diagnosing Alzheimer’s disease. Clinical Application: Using machine learning for AD diagnosis through retinal images will make screening available for a broader population by being more accessible and cost-efficient. Mobile device based screening can also be enabled at primary screening in resource-deprived regions. It can provide a pathway for future understanding of the association between biomarkers in the eye and brain. 
    more » « less