Causality Guided Disentanglement for Cross-Platform Hate Speech Detection (paperswithcode)

Byauthor

Aug 15, 2023 #Algorithms, #Assorted

Our research introduces a cross-platform hate speech detection model capable of being trained on one platform’s data and generalizing to multiple unseen platforms. To achieve good generalizability across platforms, one way is to disentangle the input representations into invariant and platform-dependent features. We also argue that learning causal relationships, which remain constant across diverse environments, can significantly aid in understanding invariant representations in hate speech. By disentangling input into platform-dependent features (useful for predicting hate targets) and platform-independent features (used to predict the presence of hate), we learn invariant representations resistant to distribution shifts. These features are then used to predict hate speech across unseen platforms.

https://paperswithcode.com/paper/causality-guided-disentanglement-for-cross

By author

Algorithms Assorted Policies

International Day for Countering Hate Speech 2025: Hate Speech and Artificial Intelligence nexus (United Nations)

Jun 22, 2025 author

Assorted Policies

Reducing the Emotional Distress of Content Moderators through LLM-based Target Substitution in Implicit and Explicit Hate-Speech (ACM)

Jun 22, 2025 author

Assorted Policies

Video… Preventing and combatting hate crime, including criminalised hate speech, in focus of a conference in Strasbourg (Council of Europe)

Jun 22, 2025 author

Causality Guided Disentanglement for Cross-Platform Hate Speech Detection (paperswithcode)

Byauthor

Like this:

By author

Related Post

International Day for Countering Hate Speech 2025: Hate Speech and Artificial Intelligence nexus (United Nations)

Reducing the Emotional Distress of Content Moderators through LLM-based Target Substitution in Implicit and Explicit Hate-Speech (ACM)

Video… Preventing and combatting hate crime, including criminalised hate speech, in focus of a conference in Strasbourg (Council of Europe)

Leave a Reply Cancel reply

LATEST NEWS

International Day for Countering Hate Speech 2025: Hate Speech and Artificial Intelligence nexus (United Nations)

Reducing the Emotional Distress of Content Moderators through LLM-based Target Substitution in Implicit and Explicit Hate-Speech (ACM)

Video… Preventing and combatting hate crime, including criminalised hate speech, in focus of a conference in Strasbourg (Council of Europe)

Rethinking Hate Speech Detection on Social Media: Can LLMs Replace Traditional Models? (arXiv)

Human and LLM Biases in Hate Speech Annotations: A Socio-Demographic Analysis of Annotators and Targets (arXiv)

Site Stats

preventhate.org | Policyinstitute.net

Byauthor

Share this:

Like this:

By author

Related Post

Leave a Reply Cancel reply