Multimodal Emotion Recognition with Surgical and Fabric Masks

Yang, Ziqing; Nayan, Katherine; Fan, Zehao; Cao, Houwei

doi:10.1109/ICASSP43922.2022.9746414

Citation Details

Multimodal Emotion Recognition with Surgical and Fabric Masks

In this study, we investigate how different types of masks affect automatic emotion classification in different channels of audio, visual, and multimodal. We train emotion classification models for each modality with the original data without mask and the re-generated data with mask respectively, and investigate how muffled speech and occluded facial expressions change the prediction of emotions. Moreover, we conduct the contribution analysis to study how muffled speech and occluded face interplay with each other and further investigate the individual contribution of audio, visual, and audio-visual modalities to the prediction of emotion with and without mask. Finally, we investigate the cross-corpus emotion recognition across clear speech and re-generated speech with different types of masks, and discuss the robustness of speech emotion recognition. more »

Award ID(s):: 2034791 1852316

PAR ID:: 10341980

Author(s) / Creator(s):: Yang, Ziqing; Nayan, Katherine; Fan, Zehao; Cao, Houwei

Date Published:: 2022-05-23

Journal Name:: ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Page Range / eLocation ID:: 4678 to 4682

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/ICASSP43922.2022.9746414

More Like this