End-to-End Word-Level Disfluency Detection and Classification in Children’s Reading Assessment

Venkatasubramaniam, Lavanya; Sunder, Vishal; Fosler-Lussier, Eric

doi:10.1109/ICASSP49357.2023.10095555

Citation Details

End-to-End Word-Level Disfluency Detection and Classification in Children’s Reading Assessment

Disfluency detection and classification on children’s speech has a great potential for teaching reading skills. Word-level assessment of children’s speech can help teachers to effectively gauge their students’ progress. Hence, we propose a novel attention-based model to perform word-level disfluency detection and classification in a fully end-to-end (E2E) manner making it fast and easy to use. We develop a word-level disfluency annotation scheme using which we annotate a dataset of children read speech, the reading races dataset (READR). We also annotate disfluencies in the existing CMU Kids corpus. The proposed model significantly outperforms traditional cascaded baselines, which use forced alignments, on both datasets. To deal with the inevitable class-imbalance in the datasets, we propose a novel technique called HiDeC (Hierarchical Detection and Classification) which yields a detection improvement of 23% and 16% and a classification improvement of 3.8% and 19.3% relative F1-score on the READR and CMU Kids datasets respectively. more »

Award ID(s):: 2008043

PAR ID:: 10439497

Author(s) / Creator(s):: Venkatasubramaniam, Lavanya; Sunder, Vishal; Fosler-Lussier, Eric

Date Published:: 2023-06-04

Journal Name:: International Conference on Acoustics, Speech and Signal Processing

Page Range / eLocation ID:: 1 to 5

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/ICASSP49357.2023.10095555

More Like this