NSF PAR Search | NSF Public Access Repository

An exploratory study on dialect density estimation for children and adult's African American English

https://doi.org/10.1121/10.0025771

Johnson, Alexander; Shankar, Natarajan Balaji; Ostendorf, Mari; Alwan, Abeer (April 2024, The Journal of the Acoustical Society of America)

This paper evaluates an innovative framework for spoken dialect density prediction on children's and adults' African American English. A speaker's dialect density is defined as the frequency with which dialect-specific language characteristics occur in their speech. Rather than treating the presence or absence of a target dialect in a user's speech as a binary decision, instead, a classifier is trained to predict the level of dialect density to provide a higher degree of specificity in downstream tasks. For this, self-supervised learning representations from HuBERT, handcrafted grammar-based features extracted from ASR transcripts, prosodic features, and other feature sets are experimented with as the input to an XGBoost classifier. Then, the classifier is trained to assign dialect density labels to short recorded utterances. High dialect density level classification accuracy is achieved for child and adult speech and demonstrated robust performance across age and regional varieties of dialect. Additionally, this work is used as a basis for analyzing which acoustic and grammatical cues affect machine perception of dialect.

Full Text Available

Search for: All records