Universal Sign Language Recognition System Using Gesture Description Generation and Large Language Model

Podder, Kanchon Kanti; Zhang, Jian; Wang, Lingyan

doi:10.1007/978-3-031-71470-2_23

Citation Details

Universal Sign Language Recognition System Using Gesture Description Generation and Large Language Model

Sign language is a priceless means of communication for deaf and hard-of-hearing people to fully enable them to participate in society and interact with others. This study introduces a novel universal sign language system that uses the Gesture-script to generate a detailed description of gestures in videos, which involve continuous movement of hands, arms, heads, and body language. Subsequently, we input this description into a Large Language Model (LLM) to interpret sign language. We deployed a few-shot prompting technique for LLM, enabling it to precisely transfer the sign videos into corresponding sentences in natural language. Furthermore, the Few-shot prompting technique enables our system to interpret multiple types of sign language without pre-training or fine-tuning. more »

Award ID(s):: 2245607

PAR ID:: 10598017

Author(s) / Creator(s):: Podder, Kanchon Kanti; Zhang, Jian; Wang, Lingyan

Publisher / Repository:: Springer Nature Switzerland

Date Published:: 2024-11-13

ISBN:: 978-3-031-71469-6

Page Range / eLocation ID:: 279 to 289

Subject(s) / Keyword(s):: Sign language interpretation, Large Language Model(LLM), Masked Auto-encoder(MAE), Few-shot prompting · Gesture

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Book Chapter:
https://doi.org/10.1007/978-3-031-71470-2_23

More Like this