Detecting Media Self-Censorship without Explicit Training Data

Tao, Rongrong; Zhou, Baojian; Chen, Feng; Mares, Dvid; Butler, Patrick; Ramakrishnan, Naren; Kennedy, Ryan

doi:10.1137/1.9781611976236.62

Citation Details

Detecting Media Self-Censorship without Explicit Training Data

The motives and means of explicit state censorship have been well studied, both quantitatively and qualitatively. Self-censorship by media outlets, however, has not received nearly as much attention, mostly because it is difficult to systematically detect. We develop a novel approach to identify news media self-censorship by using social media as a sensor. We develop a hypothesis testing framework to identify and evaluate censored clusters of keywords and a near-linear-time algorithm (called GraphDPD) to identify the highest-scoring clusters as indicators of censorship. We evaluate the accuracy of our framework, versus other state-of-the-art algorithms, using both semi-synthetic and real-world data from Mexico and Venezuela during Year 2014. These tests demonstrate the capacity of our framework to identify self-censorship and provide an indicator of broader media freedom. The results of this study lay the foundation for detection, study, and policy-response to self-censorship. more »

Award ID(s):: 1954376 1750911

PAR ID:: 10223465

Author(s) / Creator(s):: Tao, Rongrong; Zhou, Baojian; Chen, Feng; Mares, Dvid; Butler, Patrick; Ramakrishnan, Naren; Kennedy, Ryan

Editor(s):: Demeniconi; Carlotta; Nitesh V. Chawla

Date Published:: 2020-07-20

Journal Name:: Proceedings of the 2020 SIAM International Conference on Data Mining

Page Range / eLocation ID:: 550-558

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1137/1.9781611976236.62

More Like this