skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

Attention:

The NSF Public Access Repository (NSF-PAR) system and access will be unavailable from 10:00 PM ET on Thursday, March 12 until 2:00 AM ET on Friday, March 13 due to maintenance. We apologize for the inconvenience.


Title: Informational goals, sentence structure, and comparison class inference
Understanding a gradable adjective (e.g., big) requires making reference to a comparison class, a set of objects or entities against which the referent is implicitly compared (e.g., big for a Great Dane), but how do listeners decide upon a comparison class? Simple models of semantic composition stipulate that the adjective combines with a noun, which necessarily be- comes the comparison class (e.g., “That Great Dane is big” means big for a Great Dane). We investigate an alternative hypothesis built on the idea that the utility of a noun in an adjectival utterance can be either for reference (getting the listener to attend to the right object) or predication (describing a property of the referent). Therefore, we hypothesize that when the presence of a noun N can be explained away by its utility in reference (e.g., being in the subject position: “That N is big”), it is less likely to set the comparison class. Across three pre-registered experiments, we find evidence that listeners use the noun as a cue to infer comparison classes consistent with a trade-off between reference and predication. This work highlights the complexity of the relation between the form of an utterance and its meaning.  more » « less
Award ID(s):
1911790
PAR ID:
10159025
Author(s) / Creator(s):
; ; ;
Date Published:
Journal Name:
Proceedings of the Annual Conference of the Cognitive Science Society
ISSN:
1069-7977
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Prominent sociolinguistic theories of language mixing have posited that single-word insertions of one language into the other are the result of a distinct process than multi-word alternations between two languages given that the former overwhelmingly surface morphosyntactically integrated into the surrounding language. To date, this distinction has not been tested in comprehension. The present study makes use of pupillometry to examine the online processing of single-word insertions and multi-word alternations by highly proficient Spanish-English bilinguals in Puerto Rico. Participants heard sentences containing target noun/adjective pairs (1) in unilingual Spanish, (2) where the Spanish noun was replaced with its English translation equivalent, followed by a Spanish post-nominal adjective, and (3) where both the noun and adjective appeared in English with the adjective occurring in the English pre-nominal position. Both types of language mixing elicit larger pupillary responses when compared to unilingual Spanish speech, though the magnitude of this difference depends on the grammatical gender of the target noun. Importantly, single-word insertions and multi-word alternations did not differ from one another. Taken together, these findings suggest that morphosyntactic integration is not the defining feature of single-word insertions, at least in comprehension, and that the comprehension system is tuned to the distributional properties of bilingual speech. 
    more » « less
  2. Nominal compounds (N-N) and noun-adjective (N-Adj) sequences share a distinctive morphotonology and behave as (extended) prosodic words in four Bozo languages studied. Input N and Adj stems of various tone melodies are scanned for tonal characteristics that classify the numerous melodies into just two melodic superclasses for initials. Separately, finals are also scanned and classified. The criterial tonal feature varies from language to language and from initial to final; it may be the leftmost tone element or a configuration (level versus contour). Tone overlays are then associated with the initial, the final, or both jointly. In some cases, the lexical melody of the initial is overwritten locally, but is expressed at a distance by determining or at least influencing the overlay on the final.In the neighboring isolate Bangime, a structurally similar scan-classify-overlay system is at work in definite and possessed NPs.In Bozo and Bangime, an overlaid tone pattern may differ from or even invert the (lexical) melody. However, because overlays are associated with melodic superclasses, they allow partial recovery of melodies by listeners. The scan-classify-overlay model is distinct both from ordinary tonal morphophonology (which directly operates on lexical tones) and from true replacive tonal ablaut (which irrecoverably erases melodies). 
    more » « less
  3. How do children learn to connect expressions (e.g “that red apple”) to the real-world objects they refer to? The dominant view in developmental psychology is that children rely primarily on descriptive information encoded in content words (red, apple). In contrast, linguistic semantic theories of adult language attribute primacy to the grammar (e.g. words like that, another), which first establish the status of potential referents within the discourse context (old, new) before descriptive information can factor in. These theories predict that reference can succeed even when the description does not match the referent. We explore this novel prediction in adults and children. Over three experiments, we found that (i) adults relied on the articles to establish the referent, even when the noun description did not fit, consistent with grammar-first accounts; (ii) consistent with description-first accounts, and contrary to adult behavior, 3-5yo children prioritized the descriptions provided by the nouns, despite being sensitive to grammatical information. 
    more » « less
  4. Learning representations of words in a continuous space is perhaps the most fundamental task in NLP, however words interact in ways much richer than vector dot product similarity can provide. Many relationships between words can be expressed set-theoretically, for example, adjective-noun compounds (eg. “red cars”⊆“cars”) and homographs (eg. “tongue”∩“body” should be similar to “mouth”, while “tongue”∩“language” should be similar to “dialect”) have natural set-theoretic interpretations. Box embeddings are a novel region-based representation which provide the capability to perform these set-theoretic operations. In this work, we provide a fuzzy-set interpretation of box embeddings, and learn box representations of words using a set-theoretic training objective. We demonstrate improved performance on various word similarity tasks, particularly on less common words, and perform a quantitative and qualitative analysis exploring the additional unique expressivity provided by Word2Box. 
    more » « less
  5. To begin learning their language, infants must locate words in the speech signal. Some models of word discovery presuppose that the discovery process depends on identifying phonetic segments (phones) in speech. To test the plausibility of models arguing that infants can reliably categorize consonants in speech, adult native speakers were asked to identify the consonant in vowel-consonant-vowel sequences extracted from spontaneous English infant-directed speech. Listeners could consistently identify some instances of consonants (for example, correctly indicating that an /s/ was an /s/). But many tokens (about half) were not consistently identifiable. Performance was significantly worse for codas than onsets. Providing the full utterance context in low-pass-filtered form did not aid recognition, nor did familiarization with the talker. In a second task, listeners were barely above chance in guessing whether a consonant was a word onset or a word-final coda. Performance on infant-directed speech was not markedly better than performance on a comparison set of adult-directed speech consonants. Erroneous responses frequently had little systematic resemblance to the correct answer. The results suggest that it is not plausible that infants can parse most utterances exhaustively into strings of uttered speech sounds and feed those strings into a statistical clustering mechanism. 
    more » « less