skip to main content


Title: Automatic Estimation of Laryngeal Vestibule Closure Duration Using High-Resolution Cervical Auscultation Signals
Purpose Safe swallowing requires adequate protection of the airway to prevent swallowed materials from entering the trachea or lungs (i.e., aspiration). Laryngeal vestibule closure (LVC) is the first line of defense against swallowed materials entering the airway. Absent LVC or mistimed/shortened closure duration can lead to aspiration, adverse medical consequences, and even death. LVC mechanisms can be judged commonly through the videofluoroscopic swallowing study; however, this type of instrumentation exposes patients to radiation and is not available or acceptable to all patients. There is growing interest in noninvasive methods to assess/monitor swallow physiology. In this study, we hypothesized that our noninvasive sensor-based system, which has been shown to accurately track hyoid displacement and upper esophageal sphincter opening duration during swallowing, could predict laryngeal vestibule status, including the onset of LVC and the onset of laryngeal vestibule reopening, in real time and estimate the closure duration with a comparable degree of accuracy as trained human raters. Method The sensor-based system used in this study is high-resolution cervical auscultation (HRCA). Advanced machine learning techniques enable HRCA signal analysis through feature extraction and complex algorithms. A deep learning model was developed with a data set of 588 swallows from 120 patients with suspected dysphagia and further tested on 45 swallows from 16 healthy participants. Results The new technique achieved an overall mean accuracy of 74.90% and 75.48% for the two data sets, respectively, in distinguishing LVC status. Closure duration ratios between automated and gold-standard human judgment of LVC duration were 1.13 for the patient data set and 0.93 for the healthy participant data set. Conclusions This study found that HRCA signal analysis using advanced machine learning techniques can effectively predict laryngeal vestibule status (closure or opening) and further estimate LVC duration. HRCA is potentially a noninvasive tool to estimate LVC duration for diagnostic and biofeedback purposes without X-ray imaging.  more » « less
Award ID(s):
1652203
NSF-PAR ID:
10222451
Author(s) / Creator(s):
; ; ; ; ;
Date Published:
Journal Name:
Perspectives of the ASHA Special Interest Groups
Volume:
5
Issue:
6
ISSN:
2381-4764
Page Range / eLocation ID:
1647 to 1656
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract High resolution cervical auscultation is a very promising noninvasive method for dysphagia screening and aspiration detection, as it does not involve the use of harmful ionizing radiation approaches. Automatic extraction of swallowing events in cervical auscultation is a key step for swallowing analysis to be clinically effective. Using time-varying spectral estimation of swallowing signals and deep feed forward neural networks, we propose an automatic segmentation algorithm for swallowing accelerometry and sounds that works directly on the raw swallowing signals in an online fashion. The algorithm was validated qualitatively and quantitatively using the swallowing data collected from 248 patients, yielding over 3000 swallows manually labeled by experienced speech language pathologists. With a detection accuracy that exceeded 95%, the algorithm has shown superior performance in comparison to the existing algorithms and demonstrated its generalizability when tested over 76 completely unseen swallows from a different population. The proposed method is not only of great importance to any subsequent swallowing signal analysis steps, but also provides an evidence that such signals can capture the physiological signature of the swallowing process. 
    more » « less
  2. Li-Jessen, Nicole Yee-Key (Ed.)
    The Earable device is a behind-the-ear wearable originally developed to measure cognitive function. Since Earable measures electroencephalography (EEG), electromyography (EMG), and electrooculography (EOG), it may also have the potential to objectively quantify facial muscle and eye movement activities relevant in the assessment of neuromuscular disorders. As an initial step to developing a digital assessment in neuromuscular disorders, a pilot study was conducted to determine whether the Earable device could be utilized to objectively measure facial muscle and eye movements intended to be representative of Performance Outcome Assessments, (PerfOs) with tasks designed to model clinical PerfOs, referred to as mock-PerfO activities. The specific aims of this study were: To determine whether the Earable raw EMG, EOG, and EEG signals could be processed to extract features describing these waveforms; To determine Earable feature data quality, test re-test reliability, and statistical properties; To determine whether features derived from Earable could be used to determine the difference between various facial muscle and eye movement activities; and, To determine what features and feature types are important for mock-PerfO activity level classification. A total of N = 10 healthy volunteers participated in the study. Each study participant performed 16 mock-PerfOs activities, including talking, chewing, swallowing, eye closure, gazing in different directions, puffing cheeks, chewing an apple, and making various facial expressions. Each activity was repeated four times in the morning and four times at night. A total of 161 summary features were extracted from the EEG, EMG, and EOG bio-sensor data. Feature vectors were used as input to machine learning models to classify the mock-PerfO activities, and model performance was evaluated on a held-out test set. Additionally, a convolutional neural network (CNN) was used to classify low-level representations of the raw bio-sensor data for each task, and model performance was correspondingly evaluated and compared directly to feature classification performance. The model’s prediction accuracy on the Earable device’s classification ability was quantitatively assessed. Study results indicate that Earable can potentially quantify different aspects of facial and eye movements and may be used to differentiate mock-PerfO activities. Specially, Earable was found to differentiate talking, chewing, and swallowing tasks from other tasks with observed F1 scores >0.9. While EMG features contribute to classification accuracy for all tasks, EOG features are important for classifying gaze tasks. Finally, we found that analysis with summary features outperformed a CNN for activity classification. We believe Earable may be used to measure cranial muscle activity relevant for neuromuscular disorder assessment. Classification performance of mock-PerfO activities with summary features enables a strategy for detecting disease-specific signals relative to controls, as well as the monitoring of intra-subject treatment responses. Further testing is needed to evaluate the Earable device in clinical populations and clinical development settings. 
    more » « less
  3. Backgroud The nasal route of targeted drug administration facilitates medical management of chronic and acute onsets of various respiratory conditions such as rhinitis and sinusitis and during the initial onset phase of severe acute respiratory syndrome coronavirus 2, when the infection is still contained within the upper airway. Nevertheless, patient comfort issues that are often associated with intranasal devise usage can lead to low compliance, thereby compromising treatment efficacy. Hence, there is an urgent need to detect reproducible and user-friendly intranasal drug delivery modalities that may promote adoption compliance and yet be effective at targeted transport of drugs to the infective airway regions. Methods In this pilot study, we have collected evaluation feedback from a cohort of 13 healthy volunteers, who used an open-angle swirling effect atomizer to assess two different nasal spray administration techniques (with 0.9% saline solution), namely the vertical placement protocol (or, VP), wherein the nozzle is held vertically upright at a shallow insertion depth of 0.5 cm inside the nasal vestibule; and the shallow angle protocol (or, SA), wherein the spray axis is angled at 45° to the vertical, with a vestibular insertion depth of 1.5 cm. The VP protocol is based on current usage instructions, while the SA protocol is derived from published findings on alternate spray orientations that have been shown to enhance targeted drug delivery at posterior infection sites, e.g., the ostiomeatal complex and the nasopharynx. Results All study participants reported that the SA protocol offered a more gentle and soothing delivery experience, with less impact pressure. Additionally, 60% of participants reported that the VP technique caused painful irritation. We also numerically tracked the drug transport processes for the two spray techniques in a computed tomography-based nasal cavity reconstruction; the SA protocol registered a distinct improvement in airway penetration when compared to the VP protocol. Conclusion The participant-reported unequivocally favorable experience with the new SA protocol justifies a full-scale clinical study aimed at testing the related medication compliance parameters and the corresponding therapeutic efficacies. 
    more » « less
  4. null (Ed.)
    High-resolution cervical auscultation (HRCA) is an evolving clinical method for noninvasive screening of dysphagia that relies on data science, machine learning, and wearable sensors to investigate the characteristics of disordered swallowing function in people with dysphagia. HRCA has shown promising results in categorizing normal and disordered swallowing (i.e., screening) independent of human input, identifying a variety of swallowing physiological events as accurately as trained human judges. The system has been developed through a collaboration of data scientists, computer–electrical engineers, and speech-language pathologists. Its potential to automate dysphagia screening and contribute to evaluation lies in its noninvasive nature (wearable electronic sensors) and its growing ability to accurately replicate human judgments of swallowing data typically formed on the basis of videofluoroscopic imaging data. Potential contributions of HRCA when videofluoroscopic swallowing study may be unavailable, undesired, or not feasible for many patients in various settings are discussed, along with the development and capabilities of HRCA. The use of technological advances and wearable devices can extend the dysphagia clinician's reach and reinforce top-of-license practice for patients with swallowing disorders. 
    more » « less
  5. Aspiration is the most serious complication of dysphagia, which may lead to pneumonia. Detection of aspiration is limited by the presence of its signs like coughing and choking, which may be absent in many cases. High resolution cervical auscultations (HRCA) represent a promising non-invasive method intended for the detection of swallowing disorders. In this study, we investigate the potential of HRCA in detection of penetration-aspiration in patients suspected of dysphagia. A variety of features were extracted from HRCA in both time and frequency domains and they were tested for association with the presence of penetration-aspiration. Multiple classifiers were implemented also for aspiration detection using the extracted signal features. The results showed the presence of strong association between some HRCA signal features and penetration-aspiration, furthermore, they direct towards future directions to enhance prediction capability of aspiration using HRCA signals. 
    more » « less