NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Do *they* mean ‘us’? Interpreting Referring Expression variation under Intergroup Bias

https://doi.org/10.18653/v1/2024.findings-emnlp.571

Govindarajan, Venkata S; Zang, Matianyu; Mahowald, Kyle; Beaver, David; Li, Junyi Jessy (November 2024, Findings of the Association for Computational Linguistics: EMNLP 2024, Association for Computational Linguistics)

The variations between in-group and out-group speech (intergroup bias) are subtle and could underlie many social phenomena like stereotype perpetuation and implicit bias. In this paper, we model intergroup bias as a tagging task on English sports comments from forums dedicated to fandom for NFL teams. We curate a dataset of over 6 million game-time comments from opposing perspectives (the teams in the game), each comment grounded in a non-linguistic description of the events that precipitated these comments (live win probabilities for each team). Expert and crowd annotations justify modeling the bias through tagging of implicit and explicit referring expressions and reveal the rich, contextual understanding of language and the world required for this task. For large-scale analysis of intergroup variation, we use LLMs for automated tagging, and discover that LLMs occasionally perform better when prompted with linguistic descriptions of the win probability at the time of the comment, rather than numerical probability. Further, large-scale tagging of comments using LLMs uncovers linear variations in the form of referent across win probabilities that distinguish in-group and out-group utterances.
more » « less
Full Text Available
Counterfactual Probing for the Influence of Affect and Specificity on Intergroup Bias

Govindarajan, Venkata Subrahmanyan; Beaver, David; Mahowald, Kyle; Li, Junyi Jessy (July 2023, Findings of the Association for Computational Linguistics: ACL 2023)

While existing work on studying bias in NLP focuses on negative or pejorative language use, Govindarajan et al. (2023) offer a revised framing of bias in terms of intergroup social context, and its effects on language behavior. In this paper, we investigate if two pragmatic features (specificity and affect) systematically vary in different intergroup contexts — thus connecting this new framing of bias to language output. Preliminary analysis finds modest correlations between specificity and affect of tweets with supervised intergroup relationship (IGR) labels. Counterfactual probing further reveals that while neural models finetuned for predicting IGR reliably use affect in classification, the model’s usage of specificity is inconclusive.
more » « less
Full Text Available
How people talk about each other: Modeling Generalized Intergroup Bias and Emotion

Govindarajan, Venkata Subrahmanyan; Atwell, Katherine; Sinno, Barea; Alikhani, Malihe; Beaver, David I.; Li, Junyi Jessy (May 2023, Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics)

Current studies of bias in NLP rely mainly on identifying (unwanted or negative) bias towards a specific demographic group. While this has led to progress recognizing and mitigating negative bias, and having a clear notion of the targeted group is necessary, it is not always practical. In this work we extrapolate to a broader notion of bias, rooted in social science and psychology literature. We move towards predicting interpersonal group relationship (IGR) - modeling the relationship between the speaker and the target in an utterance - using fine-grained interpersonal emotions as an anchor. We build and release a dataset of English tweets by US Congress members annotated for interpersonal emotion - the first of its kind, and ‘found supervision’ for IGR labels; our analyses show that subtle emotional signals are indicative of different biases. While humans can perform better than chance at identifying IGR given an utterance, we show that neural models perform much better; furthermore, a shared encoding between IGR and interpersonal perceived emotion enabled performance gains in both tasks.
more » « less
Full Text Available
Reproducible research in linguistics: A position statement on data citation and attribution in our field

https://doi.org/10.1515/ling-2017-0032

Berez-Kroeker, Andrea L.; Gawne, Lauren; Kung, Susan Smythe; Kelly, Barbara F.; Heston, Tyler; Holton, Gary; Pulsifer, Peter; Beaver, David I.; Chelliah, Shobhana; Dubinsky, Stanley; et al (January 2018, Linguistics)

Abstract This paper is a position statement on reproducible research in linguistics, including data citation and attribution, that represents the collective views of some 41 colleagues. Reproducibility can play a key role in increasing verification and accountability in linguistic research, and is a hallmark of social science research that is currently under-represented in our field. We believe that we need to take time as a discipline to clearly articulate our expectations for how linguistic data are managed, cited, and maintained for long-term access.
more » « less
Full Text Available

Search for: All records