NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Rational Sentence Interpretation in Mandarin Chinese

https://doi.org/10.1111/cogs.13383

Zhan, Meilin; Chen, Sihan; Levy, Roger; Lu, Jiayi; Gibson, Edward (December 2023, Cognitive Science)

Abstract Previous work has shown that English native speakers interpret sentences as predicted by a noisy‐channel model: They integrate both the real‐world plausibility of the meaning—the prior—and the likelihood that the intended sentence may be corrupted into the perceived sentence. In this study, we test the noisy‐channel model in Mandarin Chinese, a language taxonomically different from English. We present native Mandarin speakers sentences in a written modality (Experiment 1) and an auditory modality (Experiment 2) in three pairs of syntactic alternations. The critical materials are literally implausible but require differing numbers and types of edits in order to form more plausible sentences. Each sentence is followed by a comprehension question that allows us to infer whether the speakers interpreted the item literally, or made an inference toward a more likely meaning. Similar to previous research on related English constructions, Mandarin participants made the most inferences for implausible materials that could be inferred as plausible by deleting a single morpheme or inserting a single morpheme. Participants were less likely to infer a plausible meaning for materials that could be inferred as plausible by making an exchange across a preposition. And participants were least likely to infer a plausible meaning for materials that could be inferred as plausible by making an exchange across a main verb. Moreover, we found more inferences in written materials than spoken materials, possibly a result of a lack of word boundaries in written Chinese. Overall, the fact that the results were so similar to those found in related constructions in English suggests that the noisy‐channel proposal is robust.
more » « less
Full Text Available
Eye Movement Traces of Linguistic Knowledge in Native and Non-Native Reading

https://doi.org/10.1162/opmi_a_00084

Berzak, Yevgeni; Levy, Roger (January 2023, Open Mind)

Abstract The detailed study of eye movements in reading has shed considerable light into how language processing unfolds in real time. Yet eye movements in reading remain inadequately studied in non-native (L2) readers, even though much of the world’s population is multilingual. Here we present a detailed analysis of the quantitative functional influences of word length, frequency, and predictability on eye movement measures in reading in a large, linguistically diverse sample of non-native English readers. We find many similar qualitative effects as in L1 readers, but crucially also a proficiency-sensitive “lexicon-context tradeoff”. The most proficient L2 readers’ eye movements approach an L1 pattern, but as L2 proficiency diminishes, readers’ eye movements become less sensitive to a word’s predictability in context and more sensitive to word frequency, which is context-invariant. This tradeoff supports a rational, experience-dependent account of how context-driven expectations are deployed in L2 language processing.
more » « less
Full Text Available
A Cross-Linguistic Pressure for Uniform Information Density in Word Order

https://doi.org/10.1162/tacl_a_00589

Clark, Thomas Hikaru; Meister, Clara; Pimentel, Tiago; Hahn, Michael; Cotterell, Ryan; Futrell, Richard; Levy, Roger (January 2023, Transactions of the Association for Computational Linguistics)

Abstract While natural languages differ widely in both canonical word order and word order flexibility, their word orders still follow shared cross-linguistic statistical patterns, often attributed to functional pressures. In the effort to identify these pressures, prior work has compared real and counterfactual word orders. Yet one functional pressure has been overlooked in such investigations: The uniform information density (UID) hypothesis, which holds that information should be spread evenly throughout an utterance. Here, we ask whether a pressure for UID may have influenced word order patterns cross-linguistically. To this end, we use computational models to test whether real orders lead to greater information uniformity than counterfactual orders. In our empirical study of 10 typologically diverse languages, we find that: (i) among SVO languages, real word orders consistently have greater uniformity than reverse word orders, and (ii) only linguistically implausible counterfactual orders consistently exceed the uniformity of real orders. These findings are compatible with a pressure for information uniformity in the development and usage of natural languages.1
more » « less
Full Text Available
Testing the Predictions of Surprisal Theory in 11 Languages

https://doi.org/10.1162/tacl_a_00612

Wilcox, Ethan G.; Pimentel, Tiago; Meister, Clara; Cotterell, Ryan; Levy, Roger P. (January 2023, Transactions of the Association for Computational Linguistics)

Abstract Surprisal theory posits that less-predictable words should take more time to process, with word predictability quantified as surprisal, i.e., negative log probability in context. While evidence supporting the predictions of surprisal theory has been replicated widely, much of it has focused on a very narrow slice of data: native English speakers reading English texts. Indeed, no comprehensive multilingual analysis exists. We address this gap in the current literature by investigating the relationship between surprisal and reading times in eleven different languages, distributed across five language families. Deriving estimates from language models trained on monolingual and multilingual corpora, we test three predictions associated with surprisal theory: (i) whether surprisal is predictive of reading times, (ii) whether expected surprisal, i.e., contextual entropy, is predictive of reading times, and (iii) whether the linking function between surprisal and reading times is linear. We find that all three predictions are borne out crosslinguistically. By focusing on a more diverse set of languages, we argue that these results offer the most robust link to date between information theory and incremental language processing across languages.
more » « less
Full Text Available
Large-scale evidence for logarithmic effects of word predictability on reading time

https://doi.org/10.1073/pnas.2307876121

Shain, Cory; Meister, Clara; Pimentel, Tiago; Cotterell, Ryan; Levy, Roger (March 2024, Proceedings of the National Academy of Sciences)

Full Text Available
Image-conditioned human language comprehension and psychometric benchmarking of visual language models

https://doi.org/10.18653/v1/2024.conll-1.34

Pushpita, Subha Nawer; Levy, Roger P (January 2024, Association for Computational Linguistics)

Full Text Available
How adults understand what young children say

https://doi.org/10.1038/s41562-023-01698-3

Meylan, Stephan C.; Foushee, Ruthe; Wong, Nicole H.; Bergelson, Elika; Levy, Roger P. (December 2023, Nature Human Behaviour)

Full Text Available
The effect of context on noisy-channel sentence comprehension

https://doi.org/10.1016/j.cognition.2023.105503

Chen, Sihan; Nathaniel, Sarah; Ryskin, Rachel; Gibson, Edward (September 2023, Cognition)

Full Text Available
It is not what you say but how you say it: Evidence from Russian shows robust effects of the structural prior on noisy channel inferences.

https://doi.org/10.1037/xlm0001244

Poliak, Moshe; Ryskin, Rachel; Braginsky, Mika; Gibson, Edward (May 2023, Journal of Experimental Psychology: Learning, Memory, and Cognition)

Full Text Available
Using Computational Models to Test Syntactic Learnability

https://doi.org/10.1162/ling_a_00491

Wilcox, Ethan Gotlieb; Futrell, Richard; Levy, Roger (April 2023, Linguistic Inquiry)

We studied the learnability of English filler-gap dependencies and the “island” constraints on them by assessing the generalizations made by autoregressive (incremental) language models that use deep learning to predict the next word given preceding context. Using factorial tests inspired by experimental psycholinguistics, we found that models acquire not only the basic contingency between fillers and gaps, but also the unboundedness and hierarchical constraints implicated in the dependency. We evaluated a model’s acquisition of island constraints by demonstrating that its expectation for a filler-gap contingency is attenuated within an island environment. Our results provide empirical evidence against the argument from the poverty of the stimulus for this particular structure.
more » « less
Full Text Available

« Prev Next »

Search for: All records