We present a game-theoretic model of pragmatics that we call ReCo (for Regularized Conventions). This model formulates pragmatic communication as a game in which players are rewarded for communicating successfully and penalized for deviating from a shared, “default” semantics. As a result, players assign utterances context-dependent meanings that jointly optimize communicative success and naturalness with respect to speakers’ and listeners’ background knowledge of language. By using established game-theoretic tools to compute equilibrium strategies for this game, we obtain principled pragmatic language generation procedures with formal guarantees of communicative success. Across several datasets capturing real and idealized human judgments about pragmatic implicature, ReCo matches, or slightly improves upon, predictions made by Iterated Best Response and Rational Speech Acts models of language understanding.
more »
« less
Respect the code: Speakers expect novel conventions to generalize within but not across social group boundaries
Speakers use different language to communicate with partners in different communities. But how do we learn and represent which conventions to use with which partners? In this paper, we argue that solving this challenging computational problem requires speakers to supplement their lexical representations with knowledge of social group structure. We formalize this idea by extending a recent hierarchical Bayesian model of convention formation with an intermediate layer explicitly representing the latent communities each partner belongs to, and derive predictions about how conventions formed within a group ought to extend to new in-group and out-group members. We then present evidence from two behavioral experiments testing these predictions using a minimal group paradigm. Taken together, our findings provide a first step toward a formal framework for understanding the interplay between language use and social group knowledge.
more »
« less
- Award ID(s):
- 1911835
- PAR ID:
- 10285683
- Date Published:
- Journal Name:
- Proceedings of the Annual Conference of the Cognitive Science Society
- ISSN:
- 1069-7977
- Page Range / eLocation ID:
- 2232-2238
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
The study of language shift, the replacement of one language by another in a community, or subgroup of a speech community, is a prime topic for sociolinguistic analysis: shift is almost always the result of social factors. This paper argues for focusing research on the study of shift in process and, to that end, studying the different kinds of speakers in shifting communities. The prevalent response to massive, global language shift by linguists is language documentation. Although the need for documentation is clear, there have been inadvertent consequences: valorizing last speakers, promoting linguistic purism, and devaluing L2 language learners who, in many communities, represent the future of the language. The urgency of documenting and describing languages with relatively small numbers of elderly speakers has led the linguistic community to focus almost exclusively on such groups and ignore both larger speech communities in earlier stages of shift, and overlook the wide range of speaker types in shift communities. From a social standpoint, the result is that we are often failing to do the language work in precisely those communities where reversing language shift is still relatively easy. From a scientific standpoint, we are missing the opportunity to study language change in process, and missing the chance to study speaker variation in a shift situation. Variation in proficiency and performance across shifting speakers is not random but systematic and correlates with a set of social and cognitive factors.more » « less
-
This paper problematizes the assessment of speakers’ proficiency in endangered language communities. We focus in particular on processes of lexical production and elicitation as proxies for full proficiency assessment. Among linguists, it is standard to assess a speaker’s knowledge of specific lexical items in order to set a baseline for further data collection and research. Yet, as we argue in this paper, such tests can give the false impression that speakers do not know their language, since such tests do not distinguish between what speakers can recall in a particular moment and what they do not know because they did not acquire it. The endangered language context in particular calls for a more fine-tuned interpretation of lexical knowledge, given the high degree of idiolectal variation and lack of a community-based standard language. Drawing on fieldwork with Chukchi and Even Indigenous communities in northeastern Russia, we analyze lexical items that speakers claim to not remember. We then distinguish different reasons that are given for not remembering and consider their implications for speakers’ proficiency. Finally, we conclude with two recommendations for improving elicitation and language assessment tests.more » « less
-
Communication tools such as email facilitate communication and collaboration between speakers of different languages, who use two primary strategies—English as a common language and machine translation (MT) tools—to help them overcome language barriers. However, each of these communication strategies creates its own challenges for cross-lingual communication. In this paper, we compare how people’s interpretations of an email sender’s social intention, and their evaluation of the email and the senders, differ when using a common language versus MT in email communication. We conducted an online experiment in which monolingual native English speakers read and rated request emails written by native English speakers, emails written by bilingual Chinese speakers in English, and emails written in Chinese then machine-translated into English. We found that participants interpreted the social intentions of the email sender less accurately for machine-translated emails than for emails written by non-native speakers in English. Participants also rated the senders and emails less positively overall for machine-translated emails compared to emails written by non-native speakers in English. Based on these findings, we suggest design possibilities that could better aid multilingual communication.more » « less
-
The study of language variation examines how language varies between and within different groups of speakers, shedding light on how we use language to construct identities and how social contexts affect language use. A common method is to identify instances of a certain linguistic feature - say, the zero copula construction - in a corpus, and analyze the feature’s distribution across speakers, topics, and other variables, to either gain a qualitative understanding of the feature’s function or systematically measure variation. In this paper, we explore the challenging task of automatic morphosyntactic feature detection in low-resource English varieties. We present a human-in-the-loop approach to generate and filter effective contrast sets via corpus-guided edits. We show that our approach improves feature detection for both Indian English and African American English, demonstrate how it can assist linguistic research, and release our fine-tuned models for use by other researchers.more » « less
An official website of the United States government

