Monologue versus Conversation: Differences in Emotion Perception and Acoustic Expressivity

Chien, Woan-Shiuan; Upadhyay, Shreya G.; Lin, Wei-Cheng; Wu, Ya-Tse; Su, Bo-Hao; Busso, Carlos; Lee, Chi-Chun

doi:10.1109/ACII55700.2022.9953814

Citation Details

Monologue versus Conversation: Differences in Emotion Perception and Acoustic Expressivity

Advancing speech emotion recognition (SER) de- pends highly on the source used to train the model, i.e., the emotional speech corpora. By permuting different design parameters, researchers have released versions of corpora that attempt to provide a better-quality source for training SER. In this work, we focus on studying communication modes of collection. In particular, we analyze the patterns of emotional speech collected during interpersonal conversations or monologues. While it is well known that conversation provides a better protocol for eliciting authentic emotion expressions, there is a lack of systematic analyses to determine whether conversational speech provide a “better-quality” source. Specifically, we examine this research question from three perspectives: perceptual differences, acoustic variability and SER model learning. Our analyses on the MSP- Podcast corpus show that: 1) rater’s consistency for conversation recordings is higher when evaluating categorical emotions, 2) the perceptions and acoustic patterns observed on conversations have properties that are better aligned with expected trends discussed in emotion literature, and 3) a more robust SER model can be trained from conversational data. This work brings initial evidences stating that samples of conversations may provide a better-quality source than samples from monologues for building a SER model. more »

Award ID(s):: 2016719

PAR ID:: 10441264

Author(s) / Creator(s):: Chien, Woan-Shiuan; Upadhyay, Shreya G.; Lin, Wei-Cheng; Wu, Ya-Tse; Su, Bo-Hao; Busso, Carlos; Lee, Chi-Chun

Date Published:: 2022-10-18

Journal Name:: International Conference on Affective Computing and Intelligent Interaction

Page Range / eLocation ID:: 1 to 7

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/ACII55700.2022.9953814

More Like this