skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Incremental learning of lexically specific morphophonology: an integrative approach
Abstract Acquisition and processing results indicate idiosyncratic, lexical knowledge interacts with productive, grammatical knowledge in systematic ways. Evidence from language processing demonstrates that higher frequency and less productive complex words are more likely to be retrieved holistically from the mental lexicon rather than being decomposed into constituents. Meanwhile, acquisition findings in both natural and artificial languages provide important evidence about the time course of learning and the role that lexical conditioning plays during development. Finally, work in experimental phonology demonstrates adult native speakers have productive knowledge of lexicalized morphophonological patterns, with learners extending lexical trends stochastically to novel forms. Although there have been significant computational advances in modeling the learning of such gradient grammatical generalizations, less attention has been devoted to modeling the joint learning of both the lexical and grammatical components of these systems, their interactions, and their connections to the processing and acquisition findings. This paper examines concrete predictions that two families of existing models of lexical and grammatical learning make across all three of these empirical domains. I show the models already align well with existing processing evidence on lexicalization effects, identify aspects of their predictions requiring further attention, and discuss implications for linguistic theory and computational modeling.  more » « less
Award ID(s):
2140826
PAR ID:
10677536
Author(s) / Creator(s):
 
Publisher / Repository:
De Gruyter Brill
Date Published:
Journal Name:
Linguistics Vanguard
ISSN:
2199-174X
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Cascadilla Press (Ed.)
    The morphosyntactic information in grammatical number marking may be a useful cue for children in the process of acquiring number words. A language with dual marking, like Slovenian, may help children to bootstrap the meaning of the word “two” by drawing their attention to sets of two as a referent of language. If the dual marker indeed facilitates number learning, then we hypothesized that “two” should be acquired earlier in populations exposed to the dual marker; the dual should be learned before “two”; and knowledge of the dual form should be correlated with knowledge of “two”. We tested these hypotheses by having Slovenian and English-speaking children complete the Give-a-Number and Give-Morphology tasks. We analyzed the Give-Morphology in a new way, using stricter criteria to determine that children “know” the morphological markers than simple percent correct. In this sample, Slovenian children exposed to the dual marker did not show evidence of knowing “two” (i.e., being 2-knowers) at very young ages or earlier than English-speaking children. Knowledge of the dual marker did not precede nor correlate with the acquisition of “two”; indeed, the dual form was only acquired after the singular and plural. These analyses were conducted using an open data set with more Slovenian 2-knowers, yielding similar results. These findings present challenges for the view that grammatical number plays a role in number acquisition. This theory requires articulation about how a dual-marked language can facilitate number acquisition if children do not notice or learn the dual form. The information in grammatical number marking may be a useful cue for children in the process of acquiring number words. A language with dual marking, like Slovenian, may help children to bootstrap the meaning of the word “two” by drawing their attention to sets of two as a referent of language. If the dual marker indeed facilitates number learning, we hypothesized that “two” should be acquired earlier in populations exposed to the dual marker; the dual should be learned before “two”; and knowledge of the dual form should be correlated with knowledge of “two”. We tested these hypotheses by having Slovenian and English-speaking children complete the Give-a-Number and Give-Morphology tasks. We analyzed the Give-Morphology in a new way, using stricter criteria to determine that children “know” the morphological markers than simple percent correct. In this sample, Slovenian children exposed to the dual marker did not show evidence of knowing “two” (i.e., being 2-knowers) at very young ages or earlier than English-speaking children. Knowledge of the dual marker did not precede nor correlate with the acquisition of “two”. Indeed, the dual form was acquired only after the singular and plural. Parallel analyses were also conducted using an open data set with more Slovenian 2-knowers, yielding similar results. These findings present challenges for the claim that grammatical number plays a role in number acquisition. Specifically, this theory requires better articulation about how a dual-marked language can facilitate number acquisition if children do not notice or learn the dual form. 
    more » « less
  2. Language understanding involves processing text with both the grammatical and 2 common-sense contexts of the text fragments. The text “I went to the grocery store 3 and brought home a car” requires both the grammatical context (syntactic) and 4 common-sense context (semantic) to capture the oddity in the sentence. Contex5 tualized text representations learned by Language Models (LMs) are expected to 6 capture a variety of syntactic and semantic contexts from large amounts of training 7 data corpora. Recent work such as ERNIE has shown that infusing the knowl8 edge contexts, where they are available in LMs, results in significant performance 9 gains on General Language Understanding (GLUE) benchmark tasks. However, 10 to our knowledge, no knowledge-aware model has attempted to infuse knowledge 11 through top-down semantics-driven syntactic processing (Eg: Common-sense to 12 Grammatical) and directly operated on the attention mechanism that LMs leverage 13 to learn the data context. We propose a learning framework Top-Down Language 14 Representation (TDLR) to infuse common-sense semantics into LMs. In our 15 implementation, we build on BERT for its rich syntactic knowledge and use the 16 knowledge graphs ConceptNet and WordNet to infuse semantic knowledge. 
    more » « less
  3. Pronouns that precede their antecedents are called cataphors. Upon encountering a cataphor, comprehenders engage in an active, eager search for its antecedent later in the sentence. The processing of cataphoric dependencies has been used to probe comprehenders’ incremental expectations and how grammatical knowledge guides those expectations. Past experiments have investigated whether knowledge of Binding Principles B and C constrains comprehenders’ initial expectations for coreference or whether grammatical knowledge applies with a delay. Although studies suggest that the Binding Principles B and C influence the earliest stages of cataphor resolution, recent findings have called into question whether the main experimental paradigm used to test the time-course of constraint-sensitivity – self-paced reading – has the necessary temporal precision to adjudicate between competing accounts. In this paper, we investigate the application of Principle B during cataphor processing using eye-tracking-while-reading, an experimental paradigm with more fine-grained temporal sensitivity. Our results converge with prior findings that Principle B strongly constrains active cataphor resolution. We end by discussing how to model Principle B sensitivity via predictions made at the level of the discourse representation. 
    more » « less
  4. Convolutional neural networks trained using manually generated labels are commonly used for semantic or instance segmentation. In precision agriculture, automated flower detection methods use supervised models and post-processing techniques that may not perform consistently as the appearance of the flowers and the data acquisition conditions vary. We propose a self-supervised learning strategy to enhance the sensitivity of segmentation models to different flower species using automatically generated pseudo-labels. We employ a data augmentation and refinement approach to improve the accuracy of the model predictions. The augmented semantic predictions are then converted to panoptic pseudo-labels to iteratively train a multi-task model. The self-supervised model predictions can be refined with existing post-processing approaches to further improve their accuracy. An evaluation on a multi-species fruit tree flower dataset demonstrates that our method outperforms state-of-the-art models without computationally expensive post-processing steps, providing a new baseline for flower detection applications. 
    more » « less
  5. Computational models of learning can be powerful tools to test educational technologies, automate the authoring of instructional software, and advance theories of learning. These mechanistic models of learning, which instantiate computational theories of the learning process, are capable of making predictions about learners’ performance in instructional technologies given only the technology itself without fitting any parameters to existing learners’ data. While these so call “zero-parameter” models have been successful in modeling student learning in intelligent tutoring systems they still show systematic deviation from human learning performance. One deviation stems from the computational models’ lack of prior knowledge—all models start off as a blank slate—leading to substantial differences in performance at the first practice opportunity. In this paper, we explore three different strategies for accounting for prior knowledge within computational models of learning and the effect of these strategies on the predictive accuracy of these models. 
    more » « less