Rule By Example: Harnessing Logical Rules for Explainable Hate Speech Detection (arXiv)

Byauthor

Jul 28, 2023 #Algorithms, #Policies

In this paper, we present Rule By Example (RBE): a novel exemplar-based contrastive learning approach for learning from logical rules for the task of textual content moderation. RBE is capable of providing rule-grounded predictions, allowing for more explainable and customizable predictions compared to typical deep learning-based approaches. We demonstrate that our approach is capable of learning rich rule embedding representations using only a few data examples. Experimental results on 3 popular hate speech classification datasets show that RBE is able to outperform state-of-the-art deep learning classifiers as well as the use of rules in both supervised and unsupervised settings while providing explainable model predictions via rule-grounding.

https://arxiv.org/abs/2307.12935

By author

Algorithms

Rule By Example: Harnessing Logical Rules for Explainable Hate Speech Detection (arXiv)

Byauthor

Like this:

By author

Related Post

Leveraging the Potential of Prompt Engineering for Hate Speech Detection in Low-Resource Languages (arXiv)

Social Hatred: Efficient Multimodal Detection of Hatemongers (arXiv)

The Role of Context in Detecting the Target of Hate Speech (ACL Anthology)

Leave a Reply Cancel reply

LATEST NEWS

Criminalising Hate Speech: A Comparative Study (SSRN)

Counterspeech encouraging users to adopt the perspective of minority groups reduces hate speech and its amplification on social media (scientific reports)

Leveraging the Potential of Prompt Engineering for Hate Speech Detection in Low-Resource Languages (arXiv)

Social Hatred: Efficient Multimodal Detection of Hatemongers (arXiv)

The Role of Context in Detecting the Target of Hate Speech (ACL Anthology)

Site Stats

preventhate.org | Policyinstitute.net

Byauthor

Share this:

Like this:

By author

Related Post

Leave a Reply Cancel reply