Human and LLM Biases in Hate Speech Annotations: A Socio-Demographic Analysis of Annotators and Targets (arXiv)

Jun 22, 2025 #Algorithms, #Assorted, #Policies

Because hate speech has increased due to the growth of internet platforms, scalable detection techniques are needed. However, these algorithms rely on data that has been categorized by humans, which frequently exhibits biases. This has been mentioned in previous studies, but the relationship between annotator and target features has not been thoroughly explored. The current study fills that gap by demonstrating the relationship between biases and target features using a dataset that is rich in sociodemographic detail for both sides. Different bias frequencies and intensities are identified by the study. Although they both display bias, comparisons with persona-based LLMs reveal that their tendencies differ greatly. These results contribute to a better understanding of annotation bias and guide the creation of more equitable AI-powered hate speech detection systems.

https://arxiv.org/abs/2410.07991

Human and LLM Biases in Hate Speech Annotations: A Socio-Demographic Analysis of Annotators and Targets (arXiv)

Like this:

Leave a Reply Cancel reply

LATEST NEWS

“They’re Not So Separate After All” – Digital and Analog Dimensions of Radicalization (Policyinstitute.net)

Soft Security Resources: Press Articles, Documents, and Recordings on Countering Extremism, Hate Speech, and False Information – December 2025 (I/II)

Soft Security Resources: Press Articles, Documents, and Recordings on Countering Extremism, Hate Speech, and False Information – November 2025 (I/I)

New on preventhate.org | Policyinstitute.net, 17 November 2025

Meta Oversight Board’s Nascent Standard on Hate Speech: Towards Plural Standard Setting in International Human Rights Law (SSRN)

TAGS

preventhate.org | Policyinstitute.net

Human and LLM Biases in Hate Speech Annotations: A Socio-Demographic Analysis of Annotators and Targets (arXiv)

Share this:

Like this:

Leave a Reply Cancel reply

“They’re Not So Separate After All” – Digital and Analog Dimensions of Radicalization (Policyinstitute.net)

Soft Security Resources: Press Articles, Documents, and Recordings on Countering Extremism, Hate Speech, and False Information – December 2025 (I/II)

Soft Security Resources: Press Articles, Documents, and Recordings on Countering Extremism, Hate Speech, and False Information – November 2025 (I/I)

New on preventhate.org | Policyinstitute.net, 17 November 2025

Meta Oversight Board’s Nascent Standard on Hate Speech: Towards Plural Standard Setting in International Human Rights Law (SSRN)

TAGS