Proactively Reducing the Hate Intensity of Online Posts via Hate Speech Normalization (arXiv)

“The intention of hate speech normalization is not to support hate but instead to provide the users with a stepping stone towards non-hate while giving online platforms more time to monitor any improvement in the user’s behavior. To this end, we manually curated a parallel corpus – hate texts and their normalized counterparts (a normalized text is less hateful and more benign). We introduce NACL, a simple yet efficient hate speech normalization model that operates in three stages – first, it measures the hate intensity of the original sample; second, it identifies the hate span(s) within it; and finally, it reduces hate intensity by paraphrasing the hate spans. We perform extensive experiments to measure the efficacy of NACL via three-way evaluation (intrinsic, extrinsic, and human-study). We observe that NACL outperforms six baselines – NACL yields a score of 0.1365 RMSE for the intensity prediction, 0.622 F1-score in the span identification, and 82.27 BLEU and 80.05 perplexity for the normalized text generation. We further show the generalizability of NACL across other platforms (Reddit, Facebook, Gab). An interactive prototype of NACL was put together for the user study. Further, the tool is being deployed in a real-world setting at Wipro AI as a part of its mission to tackle harmful content on online platforms.”

https://arxiv.org/abs/2206.04007

Proactively Reducing the Hate Intensity of Online Posts via Hate Speech Normalization (arXiv)

Like this:

Leave a Reply Cancel reply

LATEST NEWS

New on preventhate.org, 12 July 2026 (Policyinstitute.net)

UNESCO launches issue brief on Media and Information Literacy to counter hate speech in the digital age (UNESCO)

Five lessons from the No Hate Speech Week: what we heard, what we learned, what comes next (Council of Europe)

Hate speech levels across Europe alarming, stronger action needed (Council of Europe)

Soft Security Resources: Press Articles, Documents, and Recordings on Countering Extremism, Hate Speech, and False Information – December 2025 (II/II)

TAGS

preventhate.org | Policyinstitute.net

Proactively Reducing the Hate Intensity of Online Posts via Hate Speech Normalization (arXiv)

Share this:

Like this:

Leave a Reply Cancel reply

New on preventhate.org, 12 July 2026 (Policyinstitute.net)

UNESCO launches issue brief on Media and Information Literacy to counter hate speech in the digital age (UNESCO)

Five lessons from the No Hate Speech Week: what we heard, what we learned, what comes next (Council of Europe)

Hate speech levels across Europe alarming, stronger action needed (Council of Europe)

Soft Security Resources: Press Articles, Documents, and Recordings on Countering Extremism, Hate Speech, and False Information – December 2025 (II/II)

TAGS