Improving Generalization of Hate Speech Detection Systems to Novel Target Groups via Domain Adaptation (ACL Anthology)

In this paper, we investigate the generalization capabilities of deep learning models to different target groups of hate speech under clean experimental settings. Furthermore, we assess the efficacy of three different strategies of unsupervised domain adaptation to improve these capabilities. Given the diversity of hate and its rapid dynamics in the online world (e.g. the evolution of new target groups like virologists during the COVID-19 pandemic), robustly detecting hate aimed at newly identified target groups is a highly relevant research question. We show that naively trained models suffer from a target group specific bias, which can be reduced via domain adaptation. We were able to achieve a relative improvement of the F1-score between 5.8% and 10.7% for out-of-domain target groups of hate speech compared to baseline approaches by utilizing domain adaptation.

https://aclanthology.org/2022.woah-1.4/

Improving Generalization of Hate Speech Detection Systems to Novel Target Groups via Domain Adaptation (ACL Anthology)

Byauthor

Like this:

By author

Related Post

Audio… How good is AI at detecting online hate? (The Alan Turing Institute)

Code-mixing unveiled: Enhancing the hate speech detection in Arabic dialect tweets using machine learning models (PLOS ONE)

Large-Scale Hate Speech Detection with Cross-Domain Transfer (Papers with Code)

Leave a Reply Cancel reply

preventhate.org | Policyinstitute.net

Improving Generalization of Hate Speech Detection Systems to Novel Target Groups via Domain Adaptation (ACL Anthology)

Byauthor

Share this:

Like this:

By author

Related Post

Audio… How good is AI at detecting online hate? (The Alan Turing Institute)

Code-mixing unveiled: Enhancing the hate speech detection in Arabic dialect tweets using machine learning models (PLOS ONE)

Large-Scale Hate Speech Detection with Cross-Domain Transfer (Papers with Code)

Leave a Reply Cancel reply