When does a gradual learning rule translate into gradual learning performance? This paper studies a gradient-ascent Maximum Entropy phonotactic learner, as applied to two- alternative forced-choice performance expressed as log-odds. The main result is that slow initial performance cannot accelerate later if the initial weights are near zero, but can if they are not. Stated another way, abrupt- ness in this learner is an effect of transfer, either from Universal Grammar in the form of an initial weighting, or from previous learning in the form of an acquired weighting.
more »
« less
Can you judge what you don’t hear? Perception as a source of gradient wordlikeness judgements
A key observation about wordlikeness judgements, going back to some of the earliest work on the topic is that they are gradient in the sense that nonce words tend to form a cline of acceptability. In recent years, such gradience has been modelled as stemming from a gradient phonotactic grammar or from a lexical similarity effect. In this article, we present two experiments that suggest that at least some of the observed gradience stems from gradience in perception. More generally, the results raise the possibility that the gradience observed in wordlikeness tasks may not come from a gradient phonotactic/phonological grammar.
more »
« less
- Award ID(s):
- 2213960
- PAR ID:
- 10575009
- Publisher / Repository:
- Glossa
- Date Published:
- Journal Name:
- Glossa: a journal of general linguistics
- Volume:
- 8
- Issue:
- 1
- ISSN:
- 2397-1835
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
To ascertain the importance of phonetic information in the form of phonological distinctive features for the purpose of segment-level phonotactic acquisition, we compare the performance of two recurrent neural network models of phonotactic learning: one that has access to distinctive features at the start of the learning process, and one that does not. Though the predictions of both models are significantly correlated with human judgments of non-words, the feature-naive model significantly outperforms the feature-aware one in terms of probability assigned to a held-out test set of English words, suggesting that distinctive features are not obligatory for learning phonotactic patterns at the segment level.more » « less
-
Tim Hunter; Brandon Prickett (Ed.)Models of phonotactics include subsegmental representations in order to generalize to unattested sequences. These representations can be encoded in at least two ways: as discrete, phonetically-based features, or as continuous, distribution-based representations induced from the statistical patterning of sounds. Because phonological theory typically assumes that representations are discrete, past work has reduced continuous representations to discrete ones, which eliminates potentially relevant information. In this paper we present a model of phonotactics that can use continuous representations directly, and show that this approach yields competitive performance on modeling experimental judgments of English sonority sequencing. The proposed model broadens the space of possible phonotactic models by removing requirements for discrete features, and is a step towards an integrated picture of phonotactic learning based on distributional statistics and continuous representations.more » « less
-
null (Ed.)Purpose The experiment reported here compared two hypotheses for the poor statistical and artificial grammar learning often seen in children and adults with developmental language disorder (DLD; also known as specific language impairment). The procedural learning deficit hypothesis states that implicit learning of rule-based input is impaired, whereas the sequential pattern learning deficit hypothesis states that poor performance is only seen when learners must implicitly compute sequential dependencies. The current experiment tested learning of an artificial grammar that could be learned via feature activation, as observed in an associatively organized lexicon, without computing sequential dependencies and should therefore be learnable on the sequential pattern learning deficit hypothesis, but not on the procedural learning deficit hypothesis. Method Adults with DLD and adults with typical language development (TD) listened to consonant–vowel–consonant–vowel familiarization words from one of two artificial phonological grammars: Family Resemblance (two out of three features) and a control (exclusive OR, in which both consonants are voiced OR both consonants are voiceless) grammar in which no learning was predicted for either group. At test, all participants rated 32 test words as to whether or not they conformed to the pattern in the familiarization words. Results Adults with DLD and adults with TD showed equal and robust learning of the Family Resemblance grammar, accepting significantly more conforming than nonconforming test items. Both groups who were familiarized with the Family Resemblance grammar also outperformed those who were familiarized with the OR grammar, which, as predicted, was learned by neither group. Conclusion Although adults and children with DLD often underperform, compared to their peers with TD, on statistical and artificial grammar learning tasks, poor performance appears to be tied to the implicit computation of sequential dependencies, as predicted by the sequential pattern learning deficit hypothesis.more » « less
-
This study evaluates the malleability of adults' perception of probabilistic phonotactic (biphone) probabilities, building on a body of literature on statistical phonotactic learning. It was first replicated that listeners categorize phonetic continua as sounds that create higher-probability sequences in their native language. Listeners were also exposed to skewed distributions of biphone contexts, which resulted in the enhancement or reversal of these effects. Thus, listeners dynamically update biphone probabilities (BPs) and bring this to bear on perception of ambiguous acoustic information. These effects can override long-term BP effects rooted in native language experience.more » « less