Generate, Prune, Select: A Pipeline for Counterspeech Generation \\against Online Hate Speech

Zhu, Wanzheng.; Bhat, Suma

Citation Details

Countermeasures to effectively fight the ever increasing hate speech online without blocking freedom of speech is of great social interest. Natural Language Generation (NLG), is uniquely capable of developing scalable solutions. However, off-the-shelf NLG methods are primarily sequence-to-sequence neural models and they are limited in that they generate commonplace, repetitive and safe responses regardless of the hate speech (\eg, ``Please refrain from using such language.") or irrelevant responses, making them ineffective for de-escalating hateful conversations. In this paper, we design a three-module pipeline approach to effectively improve the diversity} and relevance. Our proposed pipeline first generates various counterspeech candidates by a generative model to promote \textit{diversity}, then filters the ungrammatical ones using a BERT model, and finally selects the most \textit{relevant} counterspeech response using a novel retrieval-based method. Extensive Experiments on three representative datasets demonstrate the efficacy of our approach in generating diverse and relevant counterspeech. more »

Award ID(s):: 1720268

PAR ID:: 10292072

Author(s) / Creator(s):: Zhu, Wanzheng.; Bhat, Suma

Date Published:: 2021-01-01

Journal Name:: Findings of the Association for Computational Linguistics

Page Range / eLocation ID:: 134-149

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this