HateDay: Insights from a Global Hate Speech Dataset Representative of a Day on Twitter (ACL Anthology)

Biases in evaluation datasets hinder the real-world usefulness of current hate speech detection models. HateDay is a globally representative hate speech dataset that was created from a random sample of tweets sent on September 21, 2022, in four English-speaking nations and eight different languages. The prevalence and composition of hate speech vary significantly by geography and language, according to the analysis. Assessments using scholarly datasets significantly exaggerate detection performance, which is particularly subpar for languages other than English. Distinguishing hate speech from offensive speech and the misalignment of academic dataset goals with actual victim categories are two major issues. According to the current research, the public models that are now in use are insufficient for automated moderation, and thorough human supervision is necessary for efficient identification. This emphasizes how important it is to test systems using data that reflects the intricacy of the global conversation on social media.

https://aclanthology.org/2025.acl-long.115

HateDay: Insights from a Global Hate Speech Dataset Representative of a Day on Twitter (ACL Anthology)

Like this:

Leave a Reply Cancel reply

LATEST NEWS

Two Weeks in Soft Security: Free Resources on Countering Extremism, Hate, and Disinformation, September 2025 (II/II)

Audio: preventhate.org, 1 October 2025

DeHate: A Stable Diffusion-based Multimodal Approach to Mitigate Hate Speech in Images (arXiv)

Institute for Media and Diversity: Escalation of Hate Speech in the Media Due to Political Crisis (ANEM)

Beyond Hate Speech: Online Rumors and Out-Group Resentment in Divided Societies (Comparative Political Studies)

preventhate.org | Policyinstitute.net

HateDay: Insights from a Global Hate Speech Dataset Representative of a Day on Twitter (ACL Anthology)

Share this:

Like this:

Leave a Reply Cancel reply

Two Weeks in Soft Security: Free Resources on Countering Extremism, Hate, and Disinformation, September 2025 (II/II)

Audio: preventhate.org, 1 October 2025

DeHate: A Stable Diffusion-based Multimodal Approach to Mitigate Hate Speech in Images (arXiv)

Institute for Media and Diversity: Escalation of Hate Speech in the Media Due to Political Crisis (ANEM)

Beyond Hate Speech: Online Rumors and Out-Group Resentment in Divided Societies (Comparative Political Studies)