EARSHOT: A minimal network model of human speech recognition that operates on real speech

Magnuson, J. S.; You, H.; Rueckl, J.; Allopenna, P.; Li, M.; Luthra, S.; Steiner, R.; Nam, H.; Escabi, M.; Brown, K.; Theodore, R.; Monto, N.

Citation Details

Despite the lack of invariance problem (the many-to-many mapping between acoustics and percepts), we experience phonetic constancy and typically perceive what a speaker intends. Models of human speech recognition have side- stepped this problem, working with abstract, idealized inputs and deferring the challenge of working with real speech. In contrast, automatic speech recognition powered by deep learning networks have allowed robust, real-world speech recognition. However, the complexities of deep learning architectures and training regimens make it difficult to use them to provide direct insights into mechanisms that may support human speech recognition. We developed a simple network that borrows one element from automatic speech recognition (long short-term memory nodes, which provide dynamic memory for short and long spans). This allows the network to learn to map real speech from multiple talkers to semantic targets with high accuracy. Internal representations emerge that resemble phonetically-organized responses in human superior temporal gyrus, suggesting that the model develops a distributed phonological code despite no explicit training on phonetic or phonemic targets. The ability to work with real speech is a major advance for cognitive models of human speech recognition. more »

Award ID(s):: 1754284

NSF-PAR ID:: 10137127

Author(s) / Creator(s):: Magnuson, J. S.; You, H.; Rueckl, J.; Allopenna, P.; Li, M.; Luthra, S.; Steiner, R.; Nam, H.; Escabi, M.; Brown, K.; Theodore, R.; Monto, N.

Date Published:: 2019-07-01

Journal Name:: Proceedings of the Cognitive Science Society

Page Range / eLocation ID:: 2248-2253

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this