Large-Scale Hate Speech Detection with Cross-Domain Transfer (arXiv)

“In this study, we construct large-scale tweet datasets for hate speech detection in English and a low-resource language, Turkish, consisting of human-labeled 100k tweets per each. Our datasets are designed to have equal number of tweets distributed over five domains. The experimental results supported by statistical tests show that Transformer-based language models outperform conventional bag-of-words and neural models by at least 5% in English and 10% in Turkish for large-scale hate speech detection. The performance is also scalable to different training sizes, such that 98% of performance in English, and 97% in Turkish, are recovered when 20% of training instances are used. We further examine the generalization ability of cross-domain transfer among hate domains. We show that 96% of the performance of a target domain in average is recovered by other domains for English, and 92% for Turkish. Gender and religion are more successful to generalize to other domains, while sports fail most.”

https://arxiv.org/pdf/2203.01111.pdf

Leave a Reply

Your email address will not be published.

Related Post

Using Transfer-based Language Models to Detect Hateful and Offensive Language Online (Proceedings of the Fourth Workshop on Online Abuse and Harms)Using Transfer-based Language Models to Detect Hateful and Offensive Language Online (Proceedings of the Fourth Workshop on Online Abuse and Harms)

“The results indicate that the attention-based models profoundly confuse hate speech with offensive and normal language. However, the pre-trained models outperform state-of-the-art results in terms of accurately predicting the hateful

%d bloggers like this: