Effects of Feature Scaling and Fusion on Sign Language Translation

Ananthanarayana, Tejaswini; Chaudhary, Lipisha; Nwogu, Ifeoma

doi:10.21437/Interspeech.2021-1863

Citation Details

Effects of Feature Scaling and Fusion on Sign Language Translation

Sign language translation without transcription has only recently started to gain attention. In our work, we focus on improving the state-of-the-art translation by introducing a multi-feature fusion architecture with enhanced input features. As sign language is challenging to segment, we obtain the input features by extracting overlapping scaled segments across the video and obtaining their 3D CNN representations. We exploit the attention mechanism in the fusion architecture by initially learning dependencies between different frames of the same video and later fusing them to learn the relations between different features from the same video. In addition to 3D CNN features, we also analyze pose-based features. Our robust methodology outperforms the state-of-the-art sign language translation model by achieving higher BLEU 3 – BLEU 4 scores and also outperforms the state-of-the-art sequence attention models by achieving a 43.54% increase in BLEU 4 score. We conclude that the combined effects of feature scaling and feature fusion make our model more robust in predicting longer n-grams which are crucial in continuous sign language translation. more »

Award ID(s):: 1846076

PAR ID:: 10321200

Author(s) / Creator(s):: Ananthanarayana, Tejaswini; Chaudhary, Lipisha; Nwogu, Ifeoma

Date Published:: 2021-08-30

Journal Name:: Proceedings of Interspeech 2021

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.21437/Interspeech.2021-1863

More Like this