Leveraging Multiple Sources in Automatic African American English Dialect Detection for Adults and Children

Johnson, Alexander; Shetty, Vishwas; Ostendorf, Mari; and Alwan, Abeer

doi:10.1109/ICASSP49357.2023.10096614

Citation Details

Leveraging Multiple Sources in Automatic African American English Dialect Detection for Adults and Children

This paper 1 presents a novel system which utilizes acoustic, phonological, morphosyntactic, and prosodic information for binary automatic dialect detection of African American English. We train this system utilizing adult speech data and then evaluate on both children’s and adults’ speech with unmatched training and testing scenarios. The proposed system combines novel and state-of-the-art architectures, including a multi-source transformer language model pre-trained on Twitter text data and fine-tuned on ASR transcripts as well as an LSTM acoustic model trained on self-supervised learning representations, in order to learn a comprehensive view of dialect. We show robust, explainable performance across recording conditions for different features for adult speech, but fusing multiple features is important for good results on children’s speech. more »

Award ID(s):: 2202585

PAR ID:: 10426181

Author(s) / Creator(s):: Johnson, Alexander; Shetty, Vishwas; Ostendorf, Mari; and Alwan, Abeer

Editor(s):: IEEE SIGNAL PROCESSING SOCIETY

Date Published:: 2023-06-04

Journal Name:: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Page Range / eLocation ID:: 1-5

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/ICASSP49357.2023.10096614

More Like this