NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Prediction of People’s Emotional Response towards Multi-modal News

Gao, Ge; Paik, Sejin; Reardon, Carley; Zhao, Yanling; Guo, Lei; Ishwar, Prakash; Betke, Margrit; Wijaya; Derry Tanti (November 2022, Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing (Volume 1: Long Papers))

We aim to develop methods for understanding how multimedia news exposure can affect people’s emotional responses, and we especially focus on news content related to gun violence, a very important yet polarizing issue in the U.S. We created the dataset NEmo+ by significantly extending the U.S. gun violence news-to-emotions dataset, BU-NEmo, from 320 to 1,297 news headline and lead image pairings and collecting 38,910 annotations in a large crowdsourcing experiment. In curating the NEmo+ dataset, we developed methods to identify news items that will trigger similar versus divergent emotional responses. For news items that trigger similar emotional responses, we compiled them into the NEmo+-Consensus dataset. We benchmark models on this dataset that predict a person’s dominant emotional response toward the target news item (single-label prediction). On the full NEmo+ dataset, containing news items that would lead to both differing and similar emotional responses, we also benchmark models for the novel task of predicting the distribution of evoked emotional responses in humans when presented with multi-modal news content. Our single-label and multi-label prediction models outperform baselines by large margins across several metrics.
more » « less
Full Text Available
An Unsupervised Approach to Discover Media Frames

Lai, Sha; Jiang, Yanru; Guo, Lei; Betke, Margrit; Ishwar, Prakash; Wijaya; Derry Tanti (June 2022, Proceedings of The LREC 2022 workshop on Natural Language Processing for Political Sciences)

Media framing refers to highlighting certain aspect of an issue in the news to promote a particular interpretation to the audience. Supervised learning has often been used to recognize frames in news articles, requiring a known pool of frames for a particular issue, which must be identified by communication researchers through thorough manual content analysis. In this work, we devise an unsupervised learning approach to discover the frames in news articles automatically. Given a set of news articles for a given issue, e.g., gun violence, our method first extracts frame elements from these articles using related Wikipedia articles and the Wikipedia category system. It then uses a community detection approach to identify frames from these frame elements. We discuss the effectiveness of our approach by comparing the frames it generates in an unsupervised manner to the domain-expert-derived frames for the issue of gun violence, for which a supervised learning model for frame recognition exists.
more » « less
Full Text Available
Cultural and Geographical Influences on Image Translatability of Words across Languages

https://doi.org/10.18653/v1/2021.naacl-main.19

Khani, Nikzad; Tourni, Isidora; Rasooli, Mohammad Sadegh; Callison-Burch, Chris; Wijaya, Derry Tanti (January 2021, The 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies)
null (Ed.)
Neural Machine Translation (NMT) models have been observed to produce poor translations when there are few/no parallel sentences to train the models. In the absence of parallel data, several approaches have turned to the use of images to learn translations. Since images of words, e.g., horse may be unchanged across languages, translations can be identified via images associated with words in different languages that have a high degree of visual similarity. However, translating via images has been shown to improve upon text-only models only marginally. To better understand when images are useful for translation, we study image translatability of words, which we define as the translatability of words via images, by measuring intra- and inter-cluster similarities of image representations of words that are translations of each other. We find that images of words are not always invariant across languages, and that language pairs with shared culture, meaning having either a common language family, ethnicity or religion, have improved image translatability (i.e., have more similar images for similar words) compared to its converse, regardless of their geographic proximity. In addition, in line with previous works that show images help more in translating concrete words, we found that concrete words have improved image translatability compared to abstract ones.
more » « less
Full Text Available
IndoCollex: A Testbed for Morphological Transformation of Indonesian Word Colloquialism

https://doi.org/10.18653/v1/2021.findings-acl.280

Wibowo, Haryo Akbarianto; Nityasya, Made Nindyatama; Akyürek, Afra Feyza; Fitriany, Suci; Aji, Alham Fikri; Prasojo, Radityo Eko; Wijaya, Derry Tanti (January 2021, Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021)
null (Ed.)
Indonesian language is heavily riddled with colloquialism whether in written or spoken forms. In this paper, we identify a class of Indonesian colloquial words that have undergone morphological transformations from their standard forms, categorize their word formations, and propose a benchmark dataset of Indonesian Colloquial Lexicons (IndoCollex) consisting of informal words on Twitter expertly annotated with their standard forms and their word formation types/tags. We evalu- ate several models for character-level transduction to perform morphological word normalization on this testbed to understand their failure cases and provide baselines for future work. As IndoCollex catalogues word formation phenomena that are also present in the non-standard text of other languages, it can also provide an attractive testbed for methods tailored for cross-lingual word normalization and non-standard word formation.
more » « less
Full Text Available
Multi-Label and Multilingual News Framing Analysis

https://doi.org/10.18653/v1/2020.acl-main.763

Akyürek, Afra Feyza; Guo, Lei; Elanwar, Randa; Ishwar, Prakash; Betke, Margrit; Wijaya, Derry Tanti (January 2020, Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics)
null (Ed.)
Full Text Available
Detecting Frames in News Headlines and Lead Images in U.S. Gun Violence Coverage

https://doi.org/10.18653/v1/2021.findings-emnlp.339

Tourni, Isidora; Guo, Lei; Daryanto, Taufiq Husada; Zhafransyah, Fabian; Halim, Edward Edberg; Jalal, Mona; Chen, Boqi; Lai, Sha; Hu, Hengchang; Betke, Margrit; et al (January 2021, Findings of the Association for Computational Linguistics: 2021 Conference on Empirical Methods in Natural Language Processing. November 2021, pages 4037-4050, Punta Cana, Dominican Republic.)

News media structure their reporting of events or issues using certain perspectives. When describing an incident involving gun violence, for example, some journalists may focus on mental health or gun regulation, while others may emphasize the discussion of gun rights. Such perspectives are called “frames” in communication research. We study, for the first time, the value of combining lead images and their contextual information with text to identify the frame of a given news article. We observe that using multiple modes of information(article- and image-derived features) improves prediction of news frames over any single mode of information when the images are relevant to the frames of the headlines. We also observe that frame image relevance is related to the ease of conveying frames via images, which we call frame concreteness. Additionally, we release the first multimodal news framing dataset related to gun violence in the U.S., curated and annotated by communication researchers. The dataset will allow researchers to further examine the use of multiple information modalities for studying media framing.
more » « less
Full Text Available
OpenFraming: Open-sourced Tool for Computational Framing Analysis of Multilingual Data

https://doi.org/10.18653/v1/2021.emnlp-demo.28

Bhatia, Vibhu; Akavoor, Vidya Prasad; Paik, Sejin; Guo, Lei; Jalal, Mona; Smith, Alyssa; Tofu, David Assefa; Halim, Edward Edberg; Sun, Yimeng; Betke, Margrit; et al (January 2021, Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, Online and Punta Cana, Dominican Republic)

When journalists cover a news story, they can cover the story from multiple angles or perspectives. These perspectives are called “frames,” and usage of one frame or another may influence public perception and opinion of the issue at hand. We develop a web-based system for analyzing frames in multilingual text documents. We propose and guide users through a five-step end-to-end computational framing analysis framework grounded in media framing theory in communication research. Users can use the framework to analyze multilingual text data, starting from the exploration of frames in user’s corpora and through review of previous framing literature (step 1-3) to frame classification (step 4) and prediction (step 5). The framework combines unsupervised and supervised machine learning and leverages a state-of-the-art (SoTA) multilingual language model, which can significantly enhance frame prediction performance while requiring a considerably small sample of manual annotations. Through the interactive website, anyone can perform the proposed computational framing analysis, making advanced computational analysis available to researchers without a programming background and bridging the digital divide within the communication research discipline in particular and the academic community in general. The system is available online at http://www.openframing.org, via an API http://www.openframing.org:5000/docs/, or through our GitHub page https://github.com/vibss2397/openFraming.
more » « less
Full Text Available
Detecting Frames in News Headlines and Its Application to Analyzing News Framing Trends Surrounding U.S. Gun Violence

https://doi.org/10.18653/v1/K19-1047

Liu, Siyi; Guo, Lei; Mays, Kate; Betke, Margrit; Wijaya, Derry Tanti (January 2019, Proceedings of the 23rd Conference on Computational Natural Language Learning (CoNLL))
null (Ed.)
Different news articles about the same topic often offer a variety of perspectives: an article written about gun violence might emphasize gun control, while another might promote 2nd Amendment rights, and yet a third might focus on mental health issues. In communication research, these different perspectives are known as “frames”, which, when used in news media will influence the opinion of their readers in multiple ways. In this paper, we present a method for effectively detecting frames in news headlines. Our training and performance evaluation is based on a new dataset of news headlines related to the issue of gun violence in the United States. This Gun Violence Frame Corpus (GVFC) was curated and annotated by journalism and communication experts. Our proposed approach sets a new state-of-the-art performance for multiclass news frame detection, significantly outperforming a recent baseline by 35.9% absolute difference in accuracy. We apply our frame detection approach in a large scale study of 88k news headlines about the coverage of gun violence in the U.S. between 2016 and 2018.
more » « less
Full Text Available

Search for: All records