“Despite much attention being paid to characterize and detect discriminatory speech, most work has focused on explicit or overt hate speech, failing to address a more pervasive form based on coded or indirect language. To fill this gap, this work introduces a theoretically-justified taxonomy of implicit hate speech and a benchmark corpus with fine-grained labels for each message and its implication. We present systematic analyses of our dataset using contemporary baselines to detect and explain implicit hate speech, and we discuss key features that challenge existing models.”
Latent Hatred: A Benchmark for Understanding Implicit Hate Speech (arXiv)
Categories:
Related Post
Quantifying How Hateful Communities Radicalize Online Users (arXiv)Quantifying How Hateful Communities Radicalize Online Users (arXiv)
“We measure members’ usage of hate speech outside the studied community before and after they become active participants. Using Interrupted Time Series (ITS) analysis as a causal inference method, we
The Role of Context in Detecting the Target of Hate Speech (ACL Anthology)The Role of Context in Detecting the Target of Hate Speech (ACL Anthology)
“In this paper, we focus on detecting the target of hate speech in Dutch social media: whether a hateful Facebook comment is directed against migrants or not (i.e., against someone
FRA Report on artificial intelligence and discrimination (The IOI)FRA Report on artificial intelligence and discrimination (The IOI)
“Predictive policing shows how bias can amplify over time, potentially leading to discriminatory policing. If the police only go to one area based on predictions influenced by biased crime records,