Challenges remain in Building ASR for Spontaneous Preschool Children Speech in Naturalistic Educational Environments

Dutta, Satwik; Tao, Sarah Anne; Reyna, Jacob C.; Hacker, Rebecca Elizabeth; Irvin, Dwight W.; Buzhardt, Jay F.; Hansen, John H.L.

doi:10.21437/Interspeech.2022-555

Citation Details

Challenges remain in Building ASR for Spontaneous Preschool Children Speech in Naturalistic Educational Environments

Monitoring child development in terms of speech/language skills has a long-term impact on their overall growth. As student diversity continues to expand in US classrooms, there is a growing need to benchmark social-communication engagement, both from a teacher-student perspective, as well as student-student content. Given various challenges with direct observation, deploying speech technology will assist in extracting meaningful information for teachers. These will help teachers to identify and respond to students in need, immediately impacting their early learning and interest. This study takes a deep dive into exploring various hybrid ASR solutions for low-resource spontaneous preschool (3-5yrs) children (with & without developmental delays) speech, being involved in various activities, and interacting with teachers and peers in naturalistic classrooms. Various out-of-domain corpora over a wide and limited age range, both scripted and spontaneous were considered. Acoustic models based on factorized TDNNs infused with Attention, and both N-gram and RNN language models were considered. Results indicate that young children have significantly different/ developing articulation skills as compared to older children. Out-of-domain transcripts of interactions between young children and adults however enhance language model performance. Overall transcription of such data, including various non-linguistic markers, poses additional challenges. more »

Award ID(s):: 1918032

PAR ID:: 10362772

Author(s) / Creator(s):: Dutta, Satwik; Tao, Sarah Anne; Reyna, Jacob C.; Hacker, Rebecca Elizabeth; Irvin, Dwight W.; Buzhardt, Jay F.; Hansen, John H.L.

Date Published:: 2022-09-18

Journal Name:: ISCA INTERSPEECH-2022

Page Range / eLocation ID:: 4322 to 4326

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.21437/Interspeech.2022-555

More Like this