“In this study, we construct large-scale tweet datasets for hate speech detection in English and a low-resource language, Turkish, consisting of human-labeled 100k tweets per each. Our datasets are designed to have equal number of tweets distributed over five domains. The experimental results supported by statistical tests show that Transformer-based language models outperform conventional bag-of-words and neural models by at least 5% in English and 10% in Turkish for large-scale hate speech detection. The performance is also scalable to different training sizes, such that 98% of performance in English, and 97% in Turkish, are recovered when 20% of training instances are used. We further examine the generalization ability of cross-domain transfer among hate domains. We show that 96% of the performance of a target domain in average is recovered by other domains for English, and 92% for Turkish. Gender and religion are more successful to generalize to other domains, while sports fail most.”
Large-Scale Hate Speech Detection with Cross-Domain Transfer (arXiv)
Categories:
Related Post
Latent Hatred: A Benchmark for Understanding Implicit Hate Speech (arXiv)Latent Hatred: A Benchmark for Understanding Implicit Hate Speech (arXiv)
“Despite much attention being paid to characterize and detect discriminatory speech, most work has focused on explicit or overt hate speech, failing to address a more pervasive form based on
Improving Generalization of Hate Speech Detection Systems to Novel Target Groups via Domain Adaptation (ACL Anthology)Improving Generalization of Hate Speech Detection Systems to Novel Target Groups via Domain Adaptation (ACL Anthology)
“In this paper, we investigate the generalization capabilities of deep learning models to different target groups of hate speech under clean experimental settings. Furthermore, we assess the efficacy of three
Hate Speech and Counter Speech Detection: Conversational Context Does Matter (ACL Anthology)Hate Speech and Counter Speech Detection: Conversational Context Does Matter (ACL Anthology)
“This paper investigates the role of context in the annotation and detection of online hate and counter speech, where context is defined as the preceding comment in a conversation thread.