Target Span Detection for Implicit Harmful Content

Jafari, Nazanin; Allan, James; Sarwar, Sheikh Muhammad

doi:10.1145/3664190.3672525

Citation Details

Target Span Detection for Implicit Harmful Content

Identifying the targets of hate speech is a crucial step in grasping the nature of such speech and, ultimately, in improving the detection of offensive posts on online forums. Much harmful content on online platforms uses implicit language – especially when targeting vulnerable and protected groups – such as using stereotypical characteristics instead of explicit target names, making it harder to detect and mitigate the language. In this study, we focus on identifying implied targets of hate speech, essential for recognizing subtler hate speech and enhancing the detection of harmful content on digital platforms. We define a new task aimed at identifying the targets even when they are not explicitly stated. To address that task, we collect and annotate target spans in three prominent implicit hate speech datasets: SBIC, DynaHate, and IHC. We call the resulting merged collection Implicit-Target-Span. The collection is achieved using an innovative pooling method with matching scores based on human annotations and Large Language Models (LLMs). Our experiments indicate that Implicit-Target-Span provides a challenging test bed for target span detection methods. more »

Award ID(s):: 2106282

PAR ID:: 10542773

Author(s) / Creator(s):: Jafari, Nazanin; Allan, James; Sarwar, Sheikh Muhammad

Publisher / Repository:: ACM

Date Published:: 2024-08-02

ISBN:: 9798400706813

Page Range / eLocation ID:: 117 to 122

Subject(s) / Keyword(s):: Hate speech detection, Pooling, Dataset

Format(s):: Medium: X

Location:: Washington DC USA

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1145/3664190.3672525

More Like this