Phonetic Anchor-Based Transfer Learning to Facilitate Unsupervised Cross-Lingual Speech Emotion Recognition

Upadhyay, Shreya G.; Martinez-Lucas, Luz; Su, Bo-Hao; Lin, Wei-Cheng; Chien, Woan-Shiuan; Wu, Ya-Tse; Katz, William; Busso, Carlos; Lee, Chi-Chun

doi:10.1109/ICASSP49357.2023.10095250

Citation Details

Phonetic Anchor-Based Transfer Learning to Facilitate Unsupervised Cross-Lingual Speech Emotion Recognition

Modeling cross-lingual speech emotion recognition (SER) has become more prevalent because of its diverse applications. Existing studies have mostly focused on technical approaches that adapt the feature, domain, or label across languages, without considering in detail the similarities be- tween the languages. This study focuses on domain adaptation in cross-lingual scenarios using phonetic constraints. This work is framed in a twofold manner. First, we analyze emotion-specific phonetic commonality across languages by identifying common vowels that are useful for SER modeling. Second, we leverage these common vowels as an anchoring mechanism to facilitate cross-lingual SER. We consider American English and Taiwanese Mandarin as a case study to demonstrate the potential of our approach. This work uses two in-the-wild natural emotional speech corpora: MSP-Podcast (American English), and BIIC-Podcast (Taiwanese Mandarin). The proposed unsupervised cross-lingual SER model using these phonetical anchors outperforms the baselines with a 58.64% of unweighted average recall (UAR). more »

Award ID(s):: 2016719

PAR ID:: 10441289

Author(s) / Creator(s):: Upadhyay, Shreya G.; Martinez-Lucas, Luz; Su, Bo-Hao; Lin, Wei-Cheng; Chien, Woan-Shiuan; Wu, Ya-Tse; Katz, William; Busso, Carlos; Lee, Chi-Chun

Date Published:: 2023-06-04

Journal Name:: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023)

Page Range / eLocation ID:: 1 to 5

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/ICASSP49357.2023.10095250

More Like this