skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: MMCHIVED: Multimodal Chile and Venezuela Protest Event Data
This paper introduces the Multimodal Chile & Venezuela Protest Event Dataset (MMCHIVED). MMCHIVED contains city-day event data using a new source of data, text and images shared on social media. These data enables the improved measurement of theoretically important variables such as protest size, protester and state violence, protester demographics, and emotions. In Venezuela, MMCHIVED records many more protests than existing datasets. In Chile, it records slightly more events than the Armed Conflict Location and Events Dataset (ACLED). These extra events are from small cities far from Caracas and Santiago, an improvement of coverage over datasets that rely on newspapers, and the paper confirms they are true positives. While MMCHIVED covers protest events in Chile and Venezuela, the approach used in the paper is generalizable and could generate protest event data in 107 countries containing 97.14% of global GDP and 82.7% of the world's population.  more » « less
Award ID(s):
1831848
PAR ID:
10410157
Author(s) / Creator(s):
;
Date Published:
Journal Name:
Proceedings of the International AAAI Conference on Web and Social Media
Volume:
16
ISSN:
2162-3449
Page Range / eLocation ID:
1332 to 1341
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. We call for a relational approach to constructing protest event data from news sources to provide tools for detecting and correcting errors and for capturing the relations among events and between events and the texts describing them. We address two problems with most protest event datasets: (1) inconsistencies and errors in identifying events and (2) disconnect between data structures and what is known about how protests and media accounts of protests are produced. Relational data structures can capture the theoretically important structuring of events into campaigns and episodes and media attention cascades and cycles. Relational data structures support richer theorizing about the interplay of protests and their representations in news media discourses. We present preliminary illustrative data about Black protests from these new procedures to demonstrate the value of this approach. 
    more » « less
  2. Protest event analysis is an important method for the study of collective action and social movements and typically draws on traditional media reports as the data source. We introduce collective action from social media (CASM)—a system that uses convolutional neural networks on image data and recurrent neural networks with long short-term memory on text data in a two-stage classifier to identify social media posts about offline collective action. We implement CASM on Chinese social media data and identify more than 100,000 collective action events from 2010 to 2017 (CASM-China). We evaluate the performance of CASM through cross-validation, out-of-sample validation, and comparisons with other protest data sets. We assess the effect of online censorship and find it does not substantially limit our identification of events. Compared to other protest data sets, CASM-China identifies relatively more rural, land-related protests and relatively few collective action events related to ethnic and religious conflict. 
    more » « less
  3. SUMMARY We recently found the original Omori seismograms recorded at Hongo, Tokyo, of the 1922 Atacama, Chile, earthquake (MS = 8.3) in the historical seismogram archive of the Earthquake Research Institute (ERI) of the University of Tokyo. These recordings enable a quantitative investigation of long-period seismic radiation from the 1922 earthquake. We document and provide interpretation of these seismograms together with a few other seismograms from Mizusawa, Japan, Uppsala, Sweden, Strasbourg, France, Zi-ka-wei, China and De Bilt, Netherlands. The 1922 event is of significant historical interest concerning the cause of tsunami, discovery of G wave, and study of various seismic phase and first-motion data. Also, because of its spatial proximity to the 1943, 1995 and 2015 great earthquakes in Chile, the 1922 event provides useful information on similarity and variability of great earthquakes on a subduction-zone boundary. The 1922 source region, having previously ruptured in 1796 and 1819, is considered to have significant seismic hazard. The focus of this paper is to document the 1922 seismograms so that they can be used for further seismological studies on global subduction zones. Since the instrument constants of the Omori seismographs were only incompletely documented, we estimate them using the waveforms of the observed records, a calibration pulse recorded on the seismogram and the waveforms of better calibrated Uppsala Wiechert seismograms. Comparison of the Hongo Omori seismograms with those of the 1995 Antofagasta, Chile, earthquake (Mw = 8.0) and the 2015 Illapel, Chile, earthquake (Mw = 8.3) suggests that the 1922 event is similar to the 1995 and 2015 events in mechanism (i.e. on the plate boundary megathrust) and rupture characteristics (i.e. not a tsunami earthquake) with Mw = 8.6 ± 0.25. However, the initial fine scale rupture process varies significantly from event to event. The G1 and G2, and R1 and R2 of the 1922 event are comparable in amplitude, suggesting a bilateral rupture, which is uncommon for large megathrust earthquakes. 
    more » « less
  4. A common claim about the affluent democracies is that protest is trending, becoming more legitimate and widely used by all political contenders. In the new democracies, protest is seen as having contributed to democratization, but growing apathy has led to protest decline while in authoritarian regimes protest may be spurring more democratization. Assessing these ideas requires comparative trend data covering 15 or more years but constructing such data confronts problems. The major problem is that the most available survey item asks “have you ever joined (lawful) demonstrations,” making it difficult to time when this protest behavior occurred. We advance a novel method for timing these “ever” responses by focusing on young adults (aged 18-23 years), who are likely reporting on participation within the past 5 years. Drawing on the Survey Data Recycling harmonized data set, we use a multilevel model including harmonization and survey quality controls to create predicted probabilities for young adult participation (576 surveys, 119 countries, 1966-2010). Aggregating these to create country-year rate estimates, these compare favorably with overlapping estimates from surveys asking about “the past 5 years or so” and event data from the PolDem project. Harmonization and survey quality controls improve these predicted values. These data provide 15+ years trend estimates for 60 countries, which we use to illustrate the possibilities of estimating comparative protest trends. 
    more » « less
  5. Abstract Understanding a tsunami source and its impact is vital to assess a tsunami hazard. Thanks to the efforts of the tsunami survey teams, high‐quality tsunami run‐up data exist for contemporary events. Still, it has not been widely used to infer a tsunami source and its impact mainly due to the computational burden of the tsunami forward model. In this study, we propose a TRRF‐INV (Tsunami Run‐up Response Function‐based INVersion) model that can provide probabilistic estimates of a near‐field tsunami source and tsunami run‐up distribution from a small number of run‐up records. We tested the TRRF‐INV model with synthetic tsunami scenarios in northern Chile and applied it to the 2014 Iquique, Chile, tsunami event as a case study. The results demonstrated that the TRRF‐INV model can provide a reasonable tsunami source estimate to first order and estimate tsunami run‐up distribution well. Moreover, the case‐study results agree well with the United States Geological Survey report and the global Centroid Moment Tensor solution. We also analyzed the performance of the TRRF‐INV model depending on the number and the uncertainty of run‐up records. We believe that the TRRF‐INV model has the potential for supporting accurate hazard assessment by (1) providing new insights from tsunami run‐up records into the tsunami source and its impact, (2) using the TRRF‐INV model as a tool to support existing tsunami inversion models, and (3) estimating a tsunami source and its impact for ancient events where no data other than estimated run‐up from sediment deposit data exist. 
    more » « less