Hatemoji: A Test Suite and Adversarially-Generated Dataset for Benchmarking and Detecting Emoji-Based Hate (ACL Anthology)

Sep 17, 2022 #Algorithms, #Assorted

Detecting online hate is a complex task, and low-performing models have harmful consequences when used for sensitive applications such as content moderation. Emoji-based hate is an emerging challenge for automated detection. We present HatemojiCheck, a test suite of 3,930 short-form statements that allows us to evaluate performance on hateful language expressed with emoji. Using the test suite, we expose weaknesses in existing hate detection models. To address these weaknesses, we create the HatemojiBuild dataset using a human-and-model-in-the-loop approach. Models built with these 5,912 adversarial examples perform substantially better at detecting emoji-based hate, while retaining strong performance on text-only hate. Both HatemojiCheck and HatemojiBuild are made publicly available.

https://aclanthology.org/2022.naacl-main.97/

Hatemoji: A Test Suite and Adversarially-Generated Dataset for Benchmarking and Detecting Emoji-Based Hate (ACL Anthology)

Like this:

Leave a Reply Cancel reply

LATEST NEWS

New on preventhate.org, 12 July 2026 (Policyinstitute.net)

UNESCO launches issue brief on Media and Information Literacy to counter hate speech in the digital age (UNESCO)

Five lessons from the No Hate Speech Week: what we heard, what we learned, what comes next (Council of Europe)

Hate speech levels across Europe alarming, stronger action needed (Council of Europe)

Soft Security Resources: Press Articles, Documents, and Recordings on Countering Extremism, Hate Speech, and False Information – December 2025 (II/II)

TAGS

preventhate.org | Policyinstitute.net

Hatemoji: A Test Suite and Adversarially-Generated Dataset for Benchmarking and Detecting Emoji-Based Hate (ACL Anthology)

Share this:

Like this:

Leave a Reply Cancel reply

New on preventhate.org, 12 July 2026 (Policyinstitute.net)

UNESCO launches issue brief on Media and Information Literacy to counter hate speech in the digital age (UNESCO)

Five lessons from the No Hate Speech Week: what we heard, what we learned, what comes next (Council of Europe)

Hate speech levels across Europe alarming, stronger action needed (Council of Europe)

Soft Security Resources: Press Articles, Documents, and Recordings on Countering Extremism, Hate Speech, and False Information – December 2025 (II/II)

TAGS