skip to main content


Title: News and Geolocated Social Media Accurately Measure Protest Size Variation
Larger protests are more likely to lead to policy changes than small ones are, but whether or not attendance estimates provided in news or generated from social media are biased is an open question. This letter closes the question: news and geolocated social media data generate accurate estimates of protest size variation. This claim is substantiated using cellphone location data from more than 10 million individuals during the 2017 United States Women’s March protests. These cellphone estimates correlate strongly with those provided in news media as well as three size estimates generated using geolocated tweets, one text-based and two based on images. Inferences about protest attendance from these estimates match others’ findings about the Women’s March.  more » « less
Award ID(s):
1831848
NSF-PAR ID:
10299070
Author(s) / Creator(s):
; ; ;
Date Published:
Journal Name:
American Political Science Review
Volume:
114
Issue:
4
ISSN:
0003-0554
Page Range / eLocation ID:
1343 to 1351
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    The effect that different police protest management methods have on protesters’ physical and mental trauma is still not well understood and is a matter of debate. In this paper, we take a two-pronged approach to gain insight into this issue. First, we perform statistical analysis on time series data of protests provided by ACLED and spanning the period of time from January 1, 2020, until March 13, 2021. After observing the data, it becomes apparent that employing kinetic impact projectiles is correlated with an increase in protests in the following days. Moreover, it serves as a more accurate indicator of the subsequent death toll compared to the mere number of protests. This leads to the conclusion that the utilization of less-lethal weapons appears to provoke rather than quell protests, exhibiting an inflammatory effect. Next, we provide a mathematical framework to model modern, but well-established social psychology research on compliance theory and crowd dynamics. Our results show that understanding the heterogeneity of the crowd is key for protests that lead to a reduction of social tension and minimization of physical and mental trauma in protesters.

     
    more » « less
  2. We call for a relational approach to constructing protest event data from news sources to provide tools for detecting and correcting errors and for capturing the relations among events and between events and the texts describing them. We address two problems with most protest event datasets: (1) inconsistencies and errors in identifying events and (2) disconnect between data structures and what is known about how protests and media accounts of protests are produced. Relational data structures can capture the theoretically important structuring of events into campaigns and episodes and media attention cascades and cycles. Relational data structures support richer theorizing about the interplay of protests and their representations in news media discourses. We present preliminary illustrative data about Black protests from these new procedures to demonstrate the value of this approach. 
    more » « less
  3. This paper introduces the Multimodal Chile & Venezuela Protest Event Dataset (MMCHIVED). MMCHIVED contains city-day event data using a new source of data, text and images shared on social media. These data enables the improved measurement of theoretically important variables such as protest size, protester and state violence, protester demographics, and emotions. In Venezuela, MMCHIVED records many more protests than existing datasets. In Chile, it records slightly more events than the Armed Conflict Location and Events Dataset (ACLED). These extra events are from small cities far from Caracas and Santiago, an improvement of coverage over datasets that rely on newspapers, and the paper confirms they are true positives. While MMCHIVED covers protest events in Chile and Venezuela, the approach used in the paper is generalizable and could generate protest event data in 107 countries containing 97.14% of global GDP and 82.7% of the world's population. 
    more » « less
  4. Abstract

    We analyze social media activity during one of the largest protest mobilizations in US history to examine ideological asymmetries in the posting of news content. Using an unprecedented combination of four datasets (tracking offline protests, social media activity, web browsing, and the reliability of news sources), we show that there is no evidence of unreliable sources having any prominent visibility during the protest period, but we do identify asymmetries in the ideological slant of the sources shared on social media, with a clear bias towards right-leaning domains. These results support the “amplification of the right” thesis, which points to the structural conditions (social and technological) that lead to higher visibility of content with a partisan bent towards the right. Our findings provide evidence that right-leaning sources gain more visibility on social media and reveal that ideological asymmetries manifest themselves even in the context of movements with progressive goals.

     
    more » « less
  5. Protest event analysis is an important method for the study of collective action and social movements and typically draws on traditional media reports as the data source. We introduce collective action from social media (CASM)—a system that uses convolutional neural networks on image data and recurrent neural networks with long short-term memory on text data in a two-stage classifier to identify social media posts about offline collective action. We implement CASM on Chinese social media data and identify more than 100,000 collective action events from 2010 to 2017 (CASM-China). We evaluate the performance of CASM through cross-validation, out-of-sample validation, and comparisons with other protest data sets. We assess the effect of online censorship and find it does not substantially limit our identification of events. Compared to other protest data sets, CASM-China identifies relatively more rural, land-related protests and relatively few collective action events related to ethnic and religious conflict. 
    more » « less