MELDER: The Design and Evaluation of a Real-time Silent Speech Recognizer for Mobile Devices

Pandey, Laxmi; Arif, Ahmed Sabbir

doi:10.1145/3613904.3642348

Citation Details

MELDER: The Design and Evaluation of a Real-time Silent Speech Recognizer for Mobile Devices

Silent speech is unaffected by ambient noise, increases accessibility, and enhances privacy and security. Yet current silent speech recognizers operate in a phrase-in/phrase-out manner, thus are slow, error prone, and impractical for mobile devices. We present MELDER, a Mobile Lip Reader that operates in real-time by splitting the input video into smaller temporal segments to process them individually. An experiment revealed that this substantially improves computation time, making it suitable for mobile devices. We further optimize the model for everyday use by exploiting the knowledge from a high-resource vocabulary using a transfer learning model. We then compare MELDER in both stationary and mobile settings with two state-of-the-art silent speech recognizers, where MELDER demonstrated superior overall performance. Finally, we compare two visual feedback methods of MELDER with the visual feedback method of Google Assistant. The outcomes shed light on how these proposed feedback methods influence users' perceptions of the model's performance. more »

Award ID(s):: 2239633

PAR ID:: 10579860

Author(s) / Creator(s):: Pandey, Laxmi; Arif, Ahmed Sabbir

Publisher / Repository:: ACM

Date Published:: 2024-05-11

ISBN:: 9798400703300

Page Range / eLocation ID:: 1 to 23

Subject(s) / Keyword(s):: Silent Speech Digital Lip Reading Image Processing Deep Learning Transfer Learning Language Modeling Visual Feedback Text Input

Format(s):: Medium: X

Location:: Honolulu, HI, USA

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1145/3613904.3642348

More Like this