Algorithms – Page 10 – preventhate.org

Sat. Mar 21st, 2026

A survey of textual cyber abuse detection using cutting-edge language models and large language models (arXiv)

21 January 2025

The study offers a thorough examination of the many types of abuse that are common on social media, with an…

Two Weeks in Soft Security: Free Resources on Countering Extremism, Hate, and Disinformation, January 2025 (I/II)

16 January 2025

Browse and read from a list of the 165 most popular LinkedIn resources dealing with soft counter-extremism, countering hate speech,…

Algorithms

The Challenges of Creating a Parallel Multilingual Hate Speech Corpus: An Exploration (ACL Anthology)

07 January 2025

In order to investigate the prospect of employing machine translation to create a parallel multilingual hate speech dataset, the researchers…

Algorithms

SAFE-MEME: Structured Reasoning Framework for Robust Hate Speech Detection in Memes (arXiv)

07 January 2025

Two multimodal hate speech datasets, MHS and MHS-Con, which record fine-grained hostile abstractions in everyday and confusing situations, are curated…

Algorithms Legal Policies Society

Two Weeks in Soft Security: Free Resources on Countering Extremism, Hate, and Disinformation, December 2024 (II/II)

31 December 2024

Browse and read from a list of the 148 most popular LinkedIn resources dealing with soft counter-extremism, countering hate speech,…

Algorithms Policies

Recent Advances in Online Hate Speech Moderation: Multimodality and the Role of Large Models (ACL Anthology)

25 December 2024

The multimodal character of digital information makes it even more difficult to moderate hate speech (HS) in the dynamic online…

Algorithms Policies

The Content Moderator’s Dilemma: Removal of Toxic Content and Distortions to Online Discourse (arXiv)

25 December 2024

The authors use text embeddings from computational linguistics to develop and evaluate a approach for quantifying the distortions caused by…

Algorithms

Towards Efficient and Explainable Hate Speech Detection via Model Distillation (arXiv)

25 December 2024

To stop hatred and nasty language from spreading online, automatic detection is crucial. By identifying and elucidating hate speech, we…

Algorithms Assorted

ReZG: Retrieval-augmented zero-shot counter narrative generation for hate speech (Neurocomputing)

25 December 2024

In order to produce CNs with a high level of specificity for invisible targets, the authors suggest Retrieval-Augmented Zero-shot Generation…

Algorithms Assorted

Intent-conditioned and Non-toxic Counterspeech Generation using Multi-Task Instruction Tuning with RLAIF (ACL Anthology)

25 December 2024

The paper presents CoARL, a unique framework that models the pragmatic consequences of social biases in hostile remarks, hence improving…

Algorithms Legal Policies Society

Two Weeks in Soft Security: Free Resources on Countering Extremism, Hate, and Disinformation, December 2024 (I/II)

18 December 2024

Browse and read from a list of the 535 most popular LinkedIn resources dealing with soft counter-extremism, countering hate speech,…

Algorithms Legal

Hate Speech According to the Law: An Analysis for Effective Detection (arXiv)

12 December 2024

Hate speech is a problem that is not limited to the internet. Due to the issue’s practical consequences, the majority…

A survey of textual cyber abuse detection using cutting-edge language models and large language models (arXiv)

Two Weeks in Soft Security: Free Resources on Countering Extremism, Hate, and Disinformation, January 2025 (I/II)

The Challenges of Creating a Parallel Multilingual Hate Speech Corpus: An Exploration (ACL Anthology)

SAFE-MEME: Structured Reasoning Framework for Robust Hate Speech Detection in Memes (arXiv)

Two Weeks in Soft Security: Free Resources on Countering Extremism, Hate, and Disinformation, December 2024 (II/II)

Recent Advances in Online Hate Speech Moderation: Multimodality and the Role of Large Models (ACL Anthology)

The Content Moderator’s Dilemma: Removal of Toxic Content and Distortions to Online Discourse (arXiv)

Towards Efficient and Explainable Hate Speech Detection via Model Distillation (arXiv)

ReZG: Retrieval-augmented zero-shot counter narrative generation for hate speech (Neurocomputing)

Intent-conditioned and Non-toxic Counterspeech Generation using Multi-Task Instruction Tuning with RLAIF (ACL Anthology)

Two Weeks in Soft Security: Free Resources on Countering Extremism, Hate, and Disinformation, December 2024 (I/II)

Hate Speech According to the Law: An Analysis for Effective Detection (arXiv)

LATEST NEWS

“They’re Not So Separate After All” – Digital and Analog Dimensions of Radicalization (Policyinstitute.net)

Soft Security Resources: Press Articles, Documents, and Recordings on Countering Extremism, Hate Speech, and False Information – December 2025 (I/II)

Soft Security Resources: Press Articles, Documents, and Recordings on Countering Extremism, Hate Speech, and False Information – November 2025 (I/I)

New on preventhate.org | Policyinstitute.net, 17 November 2025

Meta Oversight Board’s Nascent Standard on Hate Speech: Towards Plural Standard Setting in International Human Rights Law (SSRN)

TAGS

preventhate.org | Policyinstitute.net

Category: Algorithms

TAGS