Improving Generalization of Hate Speech Detection Systems to Novel Target Groups via Domain Adaptation (ACL Anthology)

Byauthor

Aug 14, 2022 #Algorithms, #Generalities

In this paper, we investigate the generalization capabilities of deep learning models to different target groups of hate speech under clean experimental settings. Furthermore, we assess the efficacy of three different strategies of unsupervised domain adaptation to improve these capabilities. Given the diversity of hate and its rapid dynamics in the online world (e.g. the evolution of new target groups like virologists during the COVID-19 pandemic), robustly detecting hate aimed at newly identified target groups is a highly relevant research question. We show that naively trained models suffer from a target group specific bias, which can be reduced via domain adaptation. We were able to achieve a relative improvement of the F1-score between 5.8% and 10.7% for out-of-domain target groups of hate speech compared to baseline approaches by utilizing domain adaptation.

https://aclanthology.org/2022.woah-1.4/

By author

Algorithms Policies Society

Improving Generalization of Hate Speech Detection Systems to Novel Target Groups via Domain Adaptation (ACL Anthology)

Byauthor

Like this:

By author

Related Post

Two Weeks in Soft Security: Free Resources on Countering Extremism, Hate, and Disinformation, July 2025 (I/II)

A Large Language Model-Based Approach for Multilingual Hate Speech Detection on Social Media (MDPI)

Trio Innovators @ DravidianLangTech 2025: Multimodal Hate Speech Detection in Dravidian Languages (ACL Anthology)

Leave a Reply Cancel reply

LATEST NEWS

Two Weeks in Soft Security: Free Resources on Countering Extremism, Hate, and Disinformation, July 2025 (I/II)

Experiences of online hate and abuse among women in politics (Ofcom)

A Large Language Model-Based Approach for Multilingual Hate Speech Detection on Social Media (MDPI)

Conspiracy to Commit: Information Pollution, Artificial Intelligence, and Real-World Hate Crime (arXiv)

Trio Innovators @ DravidianLangTech 2025: Multimodal Hate Speech Detection in Dravidian Languages (ACL Anthology)

Site Stats

preventhate.org | Policyinstitute.net

Byauthor

Share this:

Like this:

By author

Related Post

Leave a Reply Cancel reply