NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

The unappreciated role of intent in algorithmic moderation of abusive content on social media

https://doi.org/10.37016/mr-2020-180

Wang, Xinyu; Koneru, Sai; Venkit, Pranav Narayanan; Frischmann, Brett; Rajtmajer, Sarah (July 2025, Harvard Kennedy School Misinformation Review)

A significant body of research is dedicated to developing language models that can detect various types of online abuse, for example, hate speech, cyberbullying. However, there is a disconnect between platform policies, which often consider the author's intention as a criterion for content moderation, and the current capabilities of detection models, which typically lack efforts to capture intent. This paper examines the role of intent in the moderation of abusive content. Specifically, we review state-of-the-art detection models and benchmark training datasets to assess their ability to capture intent. We propose changes to the design and development of automated detection and moderation systems to improve alignment with ethical and policy conceptualizations of these abuses.
more » « less
Free, publicly-accessible full text available July 29, 2026
Toward Context-Aware Privacy Enhancing Technologies for Online Self-Disclosure

https://doi.org/10.1609/hcomp.v12i1.31599

Du, Tingting; Kim, Jiyoon; Squicciarini, Anna; Rajtmajer, Sarah (October 2024, Proceedings of the AAAI Conference on Human Computation and Crowdsourcing)

Voluntary sharing of personal information is at the heart of user engagement on social media and central to platforms' business models. From the users' perspective, so-called self-disclosure is closely connected with both privacy risks and social rewards. Prior work has studied contextual influences on self-disclosure, from platform affordances and interface design to user demographics and perceived social capital. Our work takes a mixed-methods approach to understand the contextual information which might be integrated in the development of privacy-enhancing technologies. Through observational study of several Reddit communities, we explore the ways in which topic of discussion, group norms, peer effects, and audience size are correlated with personal information sharing. We then build and test a prototype privacy-enhancing tool that exposes these contextual factors. Our work culminates in a browser extension that automatically detects instances of self-disclosure in Reddit posts at the time of posting and provides additional context to users before they post to support enhanced privacy decision-making. We share this prototype with social media users, solicit their feedback, and outline a path forward for privacy-enhancing technologies in this space.
more » « less
Full Text Available
Have LLMs Reopened the Pandora’s Box of AI-Generated Fake News?

https://doi.org/10.18653/v1/2025.naacl-long.142

Wang, Xinyu; Zhang, Wenbo; Koneru, Sai; Guo, Hangzhi; Mingole, Bonam; Sundar, S Shyam; Rajtmajer, Sarah; Yadav, Amulya (January 2025, Association for Computational Linguistics)

Full Text Available
Inside the echo chamber: Linguistic underpinnings of misinformation on Twitter

https://doi.org/10.1145/3614419.3644009

Wang, Xinyu; Li, Jiayi; Rajtmajer, Sarah (May 2024, ACM)

Full Text Available
Online Self-Disclosure, Social Support, and User Engagement During the COVID-19 Pandemic

https://doi.org/10.1145/3617654

Lee, Jooyoung; Rajtmajer, Sarah; Srivatsavaya, Eesha; Wilson, Shomir (December 2023, ACM Transactions on Social Computing)

We investigate relationships between online self-disclosure and received social support and user engagement during the COVID-19 crisis. We crawl a total of 2,399 posts and 29,851 associated comments from the r/COVID19_support subreddit and manually extract fine-grained personal information categories and types of social support sought from each post. We develop a BERT-based ensemble classifier to automatically identify types of support offered in users’ comments. We then analyze the effect of personal information sharing and posts’ topical, lexical, and sentiment markers on the acquisition of support and five interaction measures (submission scores, the number of comments, the number of unique commenters, the length and sentiments of comments). Our findings show that: (1) users were more likely to share their age, education, and location information when seeking both informational and emotional support as opposed to pursuing either one; (2) while personal information sharing was positively correlated with receiving informational support when requested, it did not correlate with emotional support; (3) as the degree of self-disclosure increased, information support seekers obtained higher submission scores and longer comments, whereas emotional support seekers’ self-disclosure resulted in lower submission scores, fewer comments, and fewer unique commenters; and (4) post characteristics affecting audience response differed significantly based on types of support sought by post authors. These results provide empirical evidence for the varying effects of self-disclosure on acquiring desired support and user involvement online during the COVID-19 pandemic. Furthermore, this work can assist support seekers hoping to enhance and prioritize specific types of social support and user engagement.
more » « less
Full Text Available
Information Operations in Turkey: Manufacturing Resilience with Free Twitter Accounts

https://doi.org/10.1609/icwsm.v17i1.22175

Merhi, Maya; Rajtmajer, Sarah; Lee, Dongwon (June 2023, Proceedings of the International AAAI Conference on Web and Social Media)

Following the 2016 US elections Twitter launched their Information Operations (IO) hub where they archive account activity connected to state linked information operations. In June 2020, Twitter took down and released a set of accounts linked to Turkey's ruling political party (AKP). We investigate these accounts in the aftermath of the takedown to explore whether AKP-linked operations are ongoing and to understand the strategies they use to remain resilient to disruption. We collect live accounts that appear to be part of the same network, ~30% of which have been suspended by Twitter since our collection. We create a BERT-based classifier that shows similarity between these two networks, develop a taxonomy to categorize these accounts, find direct sequel accounts between the Turkish takedown and the live accounts, and find evidence that Turkish IO actors deliberately construct their network to withstand large-scale shutdown by utilizing explicit and implicit signals of coordination. We compare our findings from the Turkish operation to Russian and Chinese IO on Twitter and find that Turkey's IO utilizes a unique group structure to remain resilient. Our work highlights the fundamental imbalance between IO actors quickly and easily creating free accounts and the social media platforms spending significant resources on detection and removal, and contributes novel findings about Turkish IO on Twitter.
more » « less
Full Text Available
Content Sharing Design for Social Welfare in Networked Disclosure Game

Jia, Feiran; Qiu, Chenxi; Rajtmajer, Sarah; Squicciarini, Anna (July 2023, In Uncertainty in Artificial Intelligence, Proceedings of Machine Learning Research)

This work models the costs and benefits of per- sonal information sharing, or self-disclosure, in online social networks as a networked disclosure game. In a networked population where edges rep- resent visibility amongst users, we assume a leader can influence network structure through content promotion, and we seek to optimize social wel- fare through network design. Our approach con- siders user interaction non-homogeneously, where pairwise engagement amongst users can involve or not involve sharing personal information. We prove that this problem is NP-hard. As a solution, we develop a Mixed-integer Linear Programming algorithm, which can achieve an exact solution, and also develop a time-efficient heuristic algo- rithm that can be used at scale. We conduct nu- merical experiments to demonstrate the properties of the algorithms and map theoretical results to a dataset of posts and comments in 2020 and 2021 in a COVID-related Subreddit community where privacy risks and sharing tradeoffs were particularly pronounced.
more » « less
The Contribution of Verified Accounts to Self-Disclosure in COVID-Related Twitter Conversations

https://doi.org/10.1609/icwsm.v16i1.19394

Du, Tingting; Umar, Prasanna; Rajtmajer, Sarah; Squicciarini, Anna (June 2022, Proceedings of the International AAAI Conference on Web and Social Media)

On Twitter, so-called verified accounts represent celebrities and organizations of public interest, selected by Twitter based on criteria for both activity and notability. Our work seeks to understand the involvement and influence of these accounts in patterns of self-disclosure, namely, voluntary sharing of personal information. In a study of 3 million COVID-19 related tweets, we present a comparison of self-disclosure in verified vs ordinary users. We discuss evidence of peer effects on self-disclosing behaviors and analyze topics of conversation associated with these practices.
more » « less
Full Text Available
A study of self-disclosure during the Coronavirus pandemic

https://doi.org/10.5210/fm.v26i7.11555

Blose, Taylor; Umar, Prasanna; Squicciarini, Anna; Rajtmajer, Sarah (July 2021, First Monday)

We study observed incidence of self-disclosure in a large set of tweets representing user-led English-language conversation about the Coronavirus pandemic. Using an unsupervised approach to detect voluntary disclosure of personal information, we provide early evidence that situational factors surrounding the Coronavirus pandemic may impact individuals’ privacy calculus. Text analyses reveal topical shift toward supportiveness and support-seeking in self-disclosing conversation on Twitter. We run a comparable analysis of tweets from Hurricane Harvey to provide context for observed effects and suggest opportunities for further study.
more » « less
Full Text Available
When Smart Systems Fail: The Ethics of Cyber–Physical Critical Infrastructure Risk

https://doi.org/10.1109/TTS.2021.3058605

Grady, Caitlin; Rajtmajer, Sarah; Dennis, Lauren (March 2021, IEEE Transactions on Technology and Society)
null (Ed.)
Full Text Available

Search for: All records