Cross-lingual articulatory feature information transfer for speech recognition using recurrent progressive neural networks

Morshed, Mahir; Hasegawa-Johnson, Mark

doi:10.21437/Interspeech.2022-11202

Citation Details

Cross-lingual articulatory feature information transfer for speech recognition using recurrent progressive neural networks

A system for the lateral transfer of information from end-to-end neural networks recognizing articulatory feature classes to similarly structured networks recognizing phone tokens is here proposed. The system connects recurrent layers of feature detectors pre-trained on a base language to recurrent layers of a phone recognizer for a different target language, this inspired primarily by the progressive neural network scheme. Initial experiments used detectors trained on Bengali speech for four articulatory feature classes—consonant place, consonant manner, vowel height, and vowel backness—attached to phone recognizers for four other Asian languages (Javanese, Nepali, Sinhalese, and Sundanese). While these do not currently suggest consistent performance improvements across different low-resource settings for target languages, irrespective of their genealogic or phonological relatedness to Bengali, they do suggest the need for further trials with different language sets, altered data sources and data configurations, and slightly altered network setups. more »

Award ID(s):: 1910319

PAR ID:: 10437892

Author(s) / Creator(s):: Morshed, Mahir; Hasegawa-Johnson, Mark

Date Published:: 2022-09-18

Journal Name:: Proceedings of Interspeech 2022

Page Range / eLocation ID:: 2298 to 2302

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.21437/Interspeech.2022-11202

More Like this