Algorithms Policies A survey of textual cyber abuse detection using cutting-edge language models and large language models (arXiv) 21 January 2025 The study offers a thorough examination of the many types of abuse that are common on social media, with an…
Algorithms Legal Policies Society Two Weeks in Soft Security: Free Resources on Countering Extremism, Hate, and Disinformation, January 2025 (I/II) 16 January 2025 Browse and read from a list of the 165 most popular LinkedIn resources dealing with soft counter-extremism, countering hate speech,…
Algorithms The Challenges of Creating a Parallel Multilingual Hate Speech Corpus: An Exploration (ACL Anthology) 07 January 2025 In order to investigate the prospect of employing machine translation to create a parallel multilingual hate speech dataset, the researchers…
Algorithms SAFE-MEME: Structured Reasoning Framework for Robust Hate Speech Detection in Memes (arXiv) 07 January 2025 Two multimodal hate speech datasets, MHS and MHS-Con, which record fine-grained hostile abstractions in everyday and confusing situations, are curated…
Algorithms Legal Policies Society Two Weeks in Soft Security: Free Resources on Countering Extremism, Hate, and Disinformation, December 2024 (II/II) 31 December 2024 Browse and read from a list of the 148 most popular LinkedIn resources dealing with soft counter-extremism, countering hate speech,…
Algorithms Policies Recent Advances in Online Hate Speech Moderation: Multimodality and the Role of Large Models (ACL Anthology) 25 December 2024 The multimodal character of digital information makes it even more difficult to moderate hate speech (HS) in the dynamic online…
Algorithms Policies The Content Moderator’s Dilemma: Removal of Toxic Content and Distortions to Online Discourse (arXiv) 25 December 2024 The authors use text embeddings from computational linguistics to develop and evaluate a approach for quantifying the distortions caused by…
Algorithms Towards Efficient and Explainable Hate Speech Detection via Model Distillation (arXiv) 25 December 2024 To stop hatred and nasty language from spreading online, automatic detection is crucial. By identifying and elucidating hate speech, we…
Algorithms Assorted ReZG: Retrieval-augmented zero-shot counter narrative generation for hate speech (Neurocomputing) 25 December 2024 In order to produce CNs with a high level of specificity for invisible targets, the authors suggest Retrieval-Augmented Zero-shot Generation…
Algorithms Assorted Intent-conditioned and Non-toxic Counterspeech Generation using Multi-Task Instruction Tuning with RLAIF (ACL Anthology) 25 December 2024 The paper presents CoARL, a unique framework that models the pragmatic consequences of social biases in hostile remarks, hence improving…
Algorithms Legal Policies Society Two Weeks in Soft Security: Free Resources on Countering Extremism, Hate, and Disinformation, December 2024 (I/II) 18 December 2024 Browse and read from a list of the 535 most popular LinkedIn resources dealing with soft counter-extremism, countering hate speech,…
Algorithms Legal Hate Speech According to the Law: An Analysis for Effective Detection (arXiv) 12 December 2024 Hate speech is a problem that is not limited to the internet. Due to the issue’s practical consequences, the majority…