Computational Modeling of the Segmentation of Sentence Stimuli From an Infant Word‐Finding Study

Swingley, Daniel; Algayres, Robin

doi:10.1111/cogs.13427

Citation Details

Computational Modeling of the Segmentation of Sentence Stimuli From an Infant Word‐Finding Study

Abstract Computational models of infant word‐finding typically operate over transcriptions of infant‐directed speech corpora. It is now possible to test models of word segmentation on speech materials, rather than transcriptions of speech. We propose that such modeling efforts be conducted over the speech of the experimental stimuli used in studies measuring infants' capacity for learning from spoken sentences. Correspondence with infant outcomes in such experiments is an appropriate benchmark for models of infants. We demonstrate such an analysis by applying the DP‐Parser model of Algayres and colleagues to auditory stimuli used in infant psycholinguistic experiments by Pelucchi and colleagues. The DP‐Parser model takes speech as input, and creates multiple overlapping embeddings from each utterance. Prospective words are identified as clusters of similar embedded segments. This allows segmentation of each utterance into possible words, using a dynamic programming method that maximizes the frequency of constituent segments. We show that DP‐Parse mimics American English learners' performance in extracting words from Italian sentences, favoring the segmentation of words with high syllabic transitional probability. This kind of computational analysis over actual stimuli from infant experiments may be helpful in tuning future models to match human performance. more »

Award ID(s):: 1917608

PAR ID:: 10541097

Author(s) / Creator(s):: Swingley, Daniel; Algayres, Robin

Publisher / Repository:: Cognitive Science

Date Published:: 2024-03-01

Journal Name:: Cognitive Science

Volume:: 48

Issue:: 3

ISSN:: 0364-0213

Subject(s) / Keyword(s):: Infant language Word recognition Speech segmentation Computational modeling Zero-resource speech Developmental psycholinguistics

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1111/cogs.13427

More Like this