An Intelligent Infrastructure Toward Large Scale Naturalistic Affective Speech Corpora Collection

Upadhyay, Shreya G; Chien, Woan-Shiuan; Su, Bo-Hao; Goncalves, Lucas; Wu, Ya-Tse; Salman, Ali N; Busso, Carlos; Lee, Chi-Chun

doi:10.1109/ACII59096.2023.10388175

Citation Details

An Intelligent Infrastructure Toward Large Scale Naturalistic Affective Speech Corpora Collection

The field of speech emotion recognition (SER) aims to create scientifically rigorous systems that can reliably characterize emotional behaviors expressed in speech. A key aspect for building SER systems is to obtain emotional data that is both reliable and reproducible for practitioners. However, academic researchers encounter difficulties in accessing or collecting naturalistic, large-scale, reliable emotional recordings. Also, the best practices for data collection are not necessarily described or shared when presenting emotional corpora. To address this issue, the paper proposes the creation of an affective naturalistic database consortium (AndC) that can encourage multidisciplinary cooperation among researchers and practitioners in the field of affective computing. This paper’s contribution is twofold. First, it proposes the design of the AndC with a customizable-standard framework for intelligently-controlled emotional data collection. The focus is on leveraging naturalistic spontaneous record- ings available on audio-sharing websites. Second, it presents as a case study the development of a naturalistic large-scale Taiwanese Mandarin podcast corpus using the customizable- standard intelligently-controlled framework. The AndC will en- able research groups to effectively collect data using the provided pipeline and to contribute with alternative algorithms or data collection protocols. more »

Award ID(s):: 2016719

PAR ID:: 10533016

Author(s) / Creator(s):: Upadhyay, Shreya G; Chien, Woan-Shiuan; Su, Bo-Hao; Goncalves, Lucas; Wu, Ya-Tse; Salman, Ali N; Busso, Carlos; Lee, Chi-Chun

Corporate Creator(s):: na

Editor(s):: na

Publisher / Repository:: IEEE

Date Published:: 2023-09-10

ISBN:: 979-8-3503-2743-4

Page Range / eLocation ID:: 1 to 8

Subject(s) / Keyword(s):: Speech Emotion Recognition, Database Consortium, Affective Computing

Format(s):: Medium: X

Location:: Cambridge, MA, USA

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/ACII59096.2023.10388175

More Like this