Large-Scale Hate Speech Detection with Cross-Domain Transfer (arXiv)

Mar 16, 2022 #Algorithms

“In this study, we construct large-scale tweet datasets for hate speech detection in English and a low-resource language, Turkish, consisting of human-labeled 100k tweets per each. Our datasets are designed to have equal number of tweets distributed over five domains. The experimental results supported by statistical tests show that Transformer-based language models outperform conventional bag-of-words and neural models by at least 5% in English and 10% in Turkish for large-scale hate speech detection. The performance is also scalable to different training sizes, such that 98% of performance in English, and 97% in Turkish, are recovered when 20% of training instances are used. We further examine the generalization ability of cross-domain transfer among hate domains. We show that 96% of the performance of a target domain in average is recovered by other domains for English, and 92% for Turkish. Gender and religion are more successful to generalize to other domains, while sports fail most.”

https://arxiv.org/pdf/2203.01111.pdf

Large-Scale Hate Speech Detection with Cross-Domain Transfer (arXiv)

Like this:

Leave a Reply Cancel reply

LATEST NEWS

New on preventhate.org, 12 July 2026 (Policyinstitute.net)

UNESCO launches issue brief on Media and Information Literacy to counter hate speech in the digital age (UNESCO)

Five lessons from the No Hate Speech Week: what we heard, what we learned, what comes next (Council of Europe)

Hate speech levels across Europe alarming, stronger action needed (Council of Europe)

Soft Security Resources: Press Articles, Documents, and Recordings on Countering Extremism, Hate Speech, and False Information – December 2025 (II/II)

TAGS

preventhate.org | Policyinstitute.net

Large-Scale Hate Speech Detection with Cross-Domain Transfer (arXiv)

Share this:

Like this:

Leave a Reply Cancel reply

New on preventhate.org, 12 July 2026 (Policyinstitute.net)

UNESCO launches issue brief on Media and Information Literacy to counter hate speech in the digital age (UNESCO)

Five lessons from the No Hate Speech Week: what we heard, what we learned, what comes next (Council of Europe)

Hate speech levels across Europe alarming, stronger action needed (Council of Europe)

Soft Security Resources: Press Articles, Documents, and Recordings on Countering Extremism, Hate Speech, and False Information – December 2025 (II/II)

TAGS