Evaluation of Hate Speech Detection Using Large Language Models and Geographical Contextualization (arXiv)

Mar 2, 2025 #Algorithms

The study looks at how well LLMs do in identifying hate speech in a variety of geographical locations and multilingual datasets. The three elements of the novel assessment methodology that the researchers propose are resilience against adversarial text, geography-aware contextual detection, and binary classification. The authors assess Llama2 (13b), Codellama (7b), and DeepSeekCoder (6.7b) using 1,000 comments from five locations. With an F1-score of 52.18% and the greatest binary classification recall of 70.6%, Codellama outperformed DeepSeekCoder in terms of geographic sensitivity, identifying 63 out of 265 sites. 62.5% of manipulated samples were incorrectly identified by Llama2, illustrating the trade-offs between robustness, contextual knowledge, and accuracy. By highlighting important advantages and disadvantages, this study lays the groundwork for the creation of multilingual hate speech detection systems and offers suggestions for further study and use.

https://www.arxiv.org/abs/2502.19612

Evaluation of Hate Speech Detection Using Large Language Models and Geographical Contextualization (arXiv)

Like this:

Leave a Reply Cancel reply

LATEST NEWS

Soft Security Resources: Press Articles, Documents, and Recordings on Countering Extremism, Hate Speech, and False Information – December 2025 (I/II)

Soft Security Resources: Press Articles, Documents, and Recordings on Countering Extremism, Hate Speech, and False Information – November 2025 (I/I)

New on preventhate.org | Policyinstitute.net, 17 November 2025

Meta Oversight Board’s Nascent Standard on Hate Speech: Towards Plural Standard Setting in International Human Rights Law (SSRN)

Coping with Digital Hostility: How Witnessing and Receiving Hate Speech Elicit Divergent Responses (SSRN)

preventhate.org | Policyinstitute.net

Evaluation of Hate Speech Detection Using Large Language Models and Geographical Contextualization (arXiv)

Share this:

Like this:

Leave a Reply Cancel reply

Soft Security Resources: Press Articles, Documents, and Recordings on Countering Extremism, Hate Speech, and False Information – December 2025 (I/II)

Soft Security Resources: Press Articles, Documents, and Recordings on Countering Extremism, Hate Speech, and False Information – November 2025 (I/I)

New on preventhate.org | Policyinstitute.net, 17 November 2025

Meta Oversight Board’s Nascent Standard on Hate Speech: Towards Plural Standard Setting in International Human Rights Law (SSRN)

Coping with Digital Hostility: How Witnessing and Receiving Hate Speech Elicit Divergent Responses (SSRN)