A framework for evaluating speech representations

Richter, Caitlin; Feldman, Naomi H.; Salgado, Harini; Jansen, Aren

Citation Details

Listeners track distributions of speech sounds along perceptual dimensions. We introduce a method for evaluating hypotheses about what those dimensions are, using a cognitive model whose prior distribution is estimated directly from speech recordings. We use this method to evaluate two speaker normalization algorithms against human data. Simulations show that representations that are normalized across speakers predict human discrimination data better than unnormalized representations, consistent with previous research. Results further reveal differences across normalization methods in how well each predicts human data. This work provides a framework for evaluating hypothesized representations of speech and lays the groundwork for testing models of speech perception on natural speech recordings from ecologically valid settings. more »

Award ID(s):: 1320410

PAR ID:: 10057883

Author(s) / Creator(s):: Richter, Caitlin; Feldman, Naomi H.; Salgado, Harini; Jansen, Aren

Date Published:: 2016-08-01

Journal Name:: Proceedings of the Annual Conference of the Cognitive Science Society

ISSN:: 1069-7977

Page Range / eLocation ID:: 1919-1924

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this