skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: CASM: A Deep-Learning Approach for Identifying Collective Action Events with Text and Image Data from Social Media
Protest event analysis is an important method for the study of collective action and social movements and typically draws on traditional media reports as the data source. We introduce collective action from social media (CASM)—a system that uses convolutional neural networks on image data and recurrent neural networks with long short-term memory on text data in a two-stage classifier to identify social media posts about offline collective action. We implement CASM on Chinese social media data and identify more than 100,000 collective action events from 2010 to 2017 (CASM-China). We evaluate the performance of CASM through cross-validation, out-of-sample validation, and comparisons with other protest data sets. We assess the effect of online censorship and find it does not substantially limit our identification of events. Compared to other protest data sets, CASM-China identifies relatively more rural, land-related protests and relatively few collective action events related to ethnic and religious conflict.  more » « less
Award ID(s):
1831481
PAR ID:
10182271
Author(s) / Creator(s):
;
Date Published:
Journal Name:
Sociological Methodology
Volume:
49
Issue:
1
ISSN:
0081-1750
Page Range / eLocation ID:
1 to 57
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. null (Ed.)
    Protest is a collective action problem and can be modeled as a coordination game in which people take an action with the potential to achieve shared mutual benefits. In game-theoretic contexts, successful coordination requires that people know each others' willingness to participate, and that this information is common knowledge among a sufficient number of people. We develop an agent-based model of collective action that was the first to combine social structure and individual incentives. Another novel aspect of the model is that a social network increases in density (i.e., new graph edges are formed) over time. The model studies the formation of common knowledge through local interactions and the characterizing social network structures. We use four real-world, data-mined social networks (Facebook, Wikipedia, email, and peer-to-peer networks) and one scale-free network, and conduct computational experiments to study contagion dynamics under different conditions. 
    more » « less
  2. Bae, K-H; Feng, B; Kim, S; Lazarova-Molnar, S; Zheng, Z; Roeder, T; Thiesing, R. (Ed.)
    Protest is a collective action problem and can be modeled as a coordination game in which people take an action with the potential to achieve shared mutual benefits. In game-theoretic contexts, successful coordination requires that people know each others’ willingness to participate, and that this information is common knowledge among a sufficient number of people. We develop an agent-based model of collective action that was the first to combine social structure and individual incentives. Another novel aspect of the model is that a social network increases in density (i.e., new graph edges are formed) over time. The model studies the formation of common knowledge through local interactions and the characterizing social network structures. We use four real-world, data-mined social networks (Facebook, Wikipedia, email, and peer-to-peer networks) and one scale-free network, and conduct computational experiments to study contagion dynamics under different conditions. 
    more » « less
  3. This paper introduces the Multimodal Chile & Venezuela Protest Event Dataset (MMCHIVED). MMCHIVED contains city-day event data using a new source of data, text and images shared on social media. These data enables the improved measurement of theoretically important variables such as protest size, protester and state violence, protester demographics, and emotions. In Venezuela, MMCHIVED records many more protests than existing datasets. In Chile, it records slightly more events than the Armed Conflict Location and Events Dataset (ACLED). These extra events are from small cities far from Caracas and Santiago, an improvement of coverage over datasets that rely on newspapers, and the paper confirms they are true positives. While MMCHIVED covers protest events in Chile and Venezuela, the approach used in the paper is generalizable and could generate protest event data in 107 countries containing 97.14% of global GDP and 82.7% of the world's population. 
    more » « less
  4. null (Ed.)
    Web-based interactions allow agents to coordinate and to take actions (change state) jointly, i.e., to participate in collective action such as a protest, facilitating spread of contagion to large groups within networked populations. In game theoretic contexts, coordination requires that agents share common knowledge about each other. Common knowledge emerges within a group when each member knows the states and the types (preferences) of the other members, and critically, each member knows that everyone else has this information. Hence, these models of common knowledge and coordination on communication networks are fundamentally different from influence-based unilateral contagion models, such as those devised by Granovetter and Centola. Common knowledge arises in many settings in practice, yet there are few operational models that can be used to compute contagion dynamics. Moreover, these models utilize different mechanisms for driving contagion. We evaluate the three mechanisms of a common knowledge model that can represent web-based communication among groups of people on Facebook. We evaluate these mechanisms on five social (media) networks with wide-ranging properties. We demonstrate that different mechanisms can produce widely varying behaviors in terms of the extent of contagion spreading and the speed of contagion transmission. 
    more » « less
  5. Abstract We analyze social media activity during one of the largest protest mobilizations in US history to examine ideological asymmetries in the posting of news content. Using an unprecedented combination of four datasets (tracking offline protests, social media activity, web browsing, and the reliability of news sources), we show that there is no evidence of unreliable sources having any prominent visibility during the protest period, but we do identify asymmetries in the ideological slant of the sources shared on social media, with a clear bias towards right-leaning domains. These results support the “amplification of the right” thesis, which points to the structural conditions (social and technological) that lead to higher visibility of content with a partisan bent towards the right. Our findings provide evidence that right-leaning sources gain more visibility on social media and reveal that ideological asymmetries manifest themselves even in the context of movements with progressive goals. 
    more » « less