Prominent sociolinguistic theories of language mixing have posited that single-word insertions of one language into the other are the result of a distinct process than multi-word alternations between two languages given that the former overwhelmingly surface morphosyntactically integrated into the surrounding language. To date, this distinction has not been tested in comprehension. The present study makes use of pupillometry to examine the online processing of single-word insertions and multi-word alternations by highly proficient Spanish-English bilinguals in Puerto Rico. Participants heard sentences containing target noun/adjective pairs (1) in unilingual Spanish, (2) where the Spanish noun was replaced with its English translation equivalent, followed by a Spanish post-nominal adjective, and (3) where both the noun and adjective appeared in English with the adjective occurring in the English pre-nominal position. Both types of language mixing elicit larger pupillary responses when compared to unilingual Spanish speech, though the magnitude of this difference depends on the grammatical gender of the target noun. Importantly, single-word insertions and multi-word alternations did not differ from one another. Taken together, these findings suggest that morphosyntactic integration is not the defining feature of single-word insertions, at least in comprehension, and that the comprehension system is tuned to the distributional properties of bilingual speech.
more »
« less
Informational goals, sentence structure, and comparison class inference
Understanding a gradable adjective (e.g., big) requires making reference to a comparison class, a set of objects or entities against which the referent is implicitly compared (e.g., big for a Great Dane), but how do listeners decide upon a comparison class? Simple models of semantic composition stipulate that the adjective combines with a noun, which necessarily be- comes the comparison class (e.g., “That Great Dane is big” means big for a Great Dane). We investigate an alternative hypothesis built on the idea that the utility of a noun in an adjectival utterance can be either for reference (getting the listener to attend to the right object) or predication (describing a property of the referent). Therefore, we hypothesize that when the presence of a noun N can be explained away by its utility in reference (e.g., being in the subject position: “That N is big”), it is less likely to set the comparison class. Across three pre-registered experiments, we find evidence that listeners use the noun as a cue to infer comparison classes consistent with a trade-off between reference and predication. This work highlights the complexity of the relation between the form of an utterance and its meaning.
more »
« less
- Award ID(s):
- 1911790
- PAR ID:
- 10159025
- Date Published:
- Journal Name:
- Proceedings of the Annual Conference of the Cognitive Science Society
- ISSN:
- 1069-7977
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Nominal compounds (N-N) and noun-adjective (N-Adj) sequences share a distinctive morphotonology and behave as (extended) prosodic words in four Bozo languages studied. Input N and Adj stems of various tone melodies are scanned for tonal characteristics that classify the numerous melodies into just two melodic superclasses for initials. Separately, finals are also scanned and classified. The criterial tonal feature varies from language to language and from initial to final; it may be the leftmost tone element or a configuration (level versus contour). Tone overlays are then associated with the initial, the final, or both jointly. In some cases, the lexical melody of the initial is overwritten locally, but is expressed at a distance by determining or at least influencing the overlay on the final.In the neighboring isolate Bangime, a structurally similar scan-classify-overlay system is at work in definite and possessed NPs.In Bozo and Bangime, an overlaid tone pattern may differ from or even invert the (lexical) melody. However, because overlays are associated with melodic superclasses, they allow partial recovery of melodies by listeners. The scan-classify-overlay model is distinct both from ordinary tonal morphophonology (which directly operates on lexical tones) and from true replacive tonal ablaut (which irrecoverably erases melodies).more » « less
-
How do children learn to connect expressions (e.g “that red apple”) to the real-world objects they refer to? The dominant view in developmental psychology is that children rely primarily on descriptive information encoded in content words (red, apple). In contrast, linguistic semantic theories of adult language attribute primacy to the grammar (e.g. words like that, another), which first establish the status of potential referents within the discourse context (old, new) before descriptive information can factor in. These theories predict that reference can succeed even when the description does not match the referent. We explore this novel prediction in adults and children. Over three experiments, we found that (i) adults relied on the articles to establish the referent, even when the noun description did not fit, consistent with grammar-first accounts; (ii) consistent with description-first accounts, and contrary to adult behavior, 3-5yo children prioritized the descriptions provided by the nouns, despite being sensitive to grammatical information.more » « less
-
Learning representations of words in a continuous space is perhaps the most fundamental task in NLP, however words interact in ways much richer than vector dot product similarity can provide. Many relationships between words can be expressed set-theoretically, for example, adjective-noun compounds (eg. “red cars”⊆“cars”) and homographs (eg. “tongue”∩“body” should be similar to “mouth”, while “tongue”∩“language” should be similar to “dialect”) have natural set-theoretic interpretations. Box embeddings are a novel region-based representation which provide the capability to perform these set-theoretic operations. In this work, we provide a fuzzy-set interpretation of box embeddings, and learn box representations of words using a set-theoretic training objective. We demonstrate improved performance on various word similarity tasks, particularly on less common words, and perform a quantitative and qualitative analysis exploring the additional unique expressivity provided by Word2Box.more » « less
-
When a language offers multiple options for expressing the same meaning, what principles govern a speaker’s choice? Two well-known principles proposed for explaining wideranging speaker preference are Uniform Information Density and Availability-Based Production. Here we test the predictions of these theories in a previously uninvestigated case of speaker choice. Russian has two ways of expressing the comparative: an EXPLICIT option (Ona bystree chem ja/She fast- COMP than me-NOM) and a GENITIVE option (Ona bystree menya/She fast-COMP me-GEN). We lay out several potential predictions of each theory for speaker choice in the Russian comparative construction, including effects of postcomparative word predictability, phrase length, syntactic complexity, and semantic association between the comparative adjective and subsequent noun. In a corpus study, we find that the explicit construction is used preferentially when the postcomparative noun phrase is longer, has a relative clause, and is less semantically associated with the comparative adjective. A follow-up production experiment using visual scene stimuli to elicit comparative sentences replicates the corpus finding that Russian native speakers prefer the explicit form when post-comparative phrases are longer. These findings offer no clear support for the predictions of Uniform Information Density, but are broadly supportive of Availability- Based Production, with the explicit option serving as an unreduced form that eases speakers’ planning of complex or lowavailability utterances. Code for this study is availablemore » « less
An official website of the United States government

