NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Using Authorship Verification to Mitigate Abuse in Online Communities

Weerasinghe, J.; Singh, R.; Greenstadt, R. (May 2022, Proceedings of the International AAAI Conference on Weblogs and Social Media)

Social media has become an important method for information sharing. This has also created opportunities for bad actors to easily spread disinformation and manipulate public opinion. This paper explores the possibility of applying Authorship Verification on online communities to mitigate abuse by analyzing the writing style of online accounts to identify accounts managed by the same person. We expand on our similarity-based authorship verification approach, previously applied on large fanfictions, and show that it works in open-world settings, shorter documents, and is largely topic-agnostic. Our expanded model can link Reddit accounts based on the writing style of only 40 comments with an AUC of 0.95, and the performance increases to 0.98 given more content. We apply this model on a set of suspicious Reddit accounts associated with the disinformation campaign surrounding the 2016 U.S. presidential election and show that the writing style of these accounts are inconsistent, indicating that each account was likely maintained by multiple individuals. We also apply this model to Reddit user accounts that commented on the WallStreetBets subreddit around the 2021 GameStop short squeeze and show that a number of account pairs share very similar writing styles. We also show that this approach can link accounts across Reddit and Twitter with an AUC of 0.91 even when training data is very limited.
more » « less
Full Text Available
Conspiracy Brokers: Understanding the Monetization of YouTube Conspiracy Theories

https://doi.org/10.1145/3485447.3512142

Ballard, Cameron; Goldstein, Ian; Mehta, Pulak; Smothers, Genesis; Take, Kejsi; Zhong, Victoria; Greenstadt, Rachel; Lauinger, Tobias; McCoy, Damon (April 2022, WWW '22: Proceedings of the ACM Web Conference 2022)

Full Text Available
Understanding engagement with U.S. (mis)information news sources on Facebook

https://doi.org/10.1145/3487552.3487859

Edelson, Laura; Nguyen, Minh-Kha; Goldstein, Ian; Goga, Oana; McCoy, Damon; Lauinger, Tobias (November 2021, n ACM Internet Measurement Conference (IMC ’21))

Full Text Available
Understanding Incentivized Mobile App Installs on Google Play Store

https://doi.org/10.1145/3419394.3423662

Farooqi, Shehroze; Feal, Álvaro; Lauinger, Tobias; McCoy, Damon; Shafiq, Zubair; Vallina-Rodriguez, Narseo (October 2020, Proceedings of the ACM Internet Measurement Conference)
null (Ed.)
Full Text Available
A Security Analysis of the Facebook Ad Library

https://doi.org/10.1109/SP40000.2020.00084

Edelson, Laura; Lauinger, Tobias; McCoy, Damon (May 2020, 2020 IEEE Symposium on Security and Privacy (SP))

Full Text Available
The Pod People: Understanding Manipulation of Social Media Popularity via Reciprocity Abuse

https://doi.org/10.1145/3366423.3380256

Weerasinghe, Janith; Flanigan, Bailey; Stein, Aviel; McCoy, Damon; Greenstadt, Rachel (April 2020, WWW '20: Proceedings of The Web Conference)

Full Text Available
Are Anonymity-Seekers Just like Everybody Else? An Analysis of Contributions to Wikipedia from Tor

https://doi.org/10.1109/40000.2020.00053

Tran, Chau; Champion, Kaylea; Forte, Andrea; Hill, Benjamin Mako; Greenstadt, Rachel (January 2020, IEEE Symposium on Security and Privacy (SP))

User-generated content sites routinely block contributions from users of privacy-enhancing proxies like Tor because of a perception that proxies are a source of vandalism, spam, and abuse. Although these blocks might be effective, collateral damage in the form of unrealized valuable contributions from anonymity seekers is invisible. One of the largest and most important user-generated content sites, Wikipedia, has attempted to block contributions from Tor users since as early as 2005. We demonstrate that these blocks have been imperfect and that thousands of attempts to edit on Wikipedia through Tor have been successful. We draw upon several data sources and analytical techniques to measure and describe the history of Tor editing on Wikipedia over time and to compare contributions from Tor users to those from other groups of Wikipedia users. Our analysis suggests that although Tor users who slip through Wikipedia's ban contribute content that is more likely to be reverted and to revert others, their contributions are otherwise similar in quality to those from other unregistered participants and to the initial contributions of registered users.
more » « less
Full Text Available
Feature Vector Difference based Neural Network and Logistic Regression Models for Authorship Verification

Weerasinghe, Janith; Greenstadt, Rachel (January 2020, CEUR workshop proceedings)
Cappellato, Linda; Eickhoff, Carsten; Ferro, Nicola; Névéol, Aurélie (Ed.)
This paper describes the approach we took to create a machine learning model for the PAN 2020 Authorship Verification Task. For each document pair, we extracted stylometric features from the documents and used the absolute difference between the feature vectors as input to our classifier. We created two models: a Logistic Regression Model trained on a small dataset, and a Neural Network based model trained on the large dataset. These models achieved AUCs of 0.939 and 0.953 on the small and large datasets, making them the second-best models on both datasets submitted to the shared task.
more » « less
Full Text Available

Search for: All records