Towards generalisable hate speech detection: a review on obstacles and solutions (arXiv)

“With online hate speech on the rise, its automatic detection as a natural language processing task is gaining increasing interest. However, it is only recently that it has been shown that existing models generalise poorly to unseen data. This survey paper attempts to summarise how generalisable existing hate speech detection models are, reason why hate speech models struggle to generalise, sums up existing attempts at addressing the main obstacles, and then proposes directions of future research to improve generalisation in hate speech detection.”

https://arxiv.org/pdf/2102.08886.pdf

Leave a Reply

Your email address will not be published.

Related Post

Using Transfer-based Language Models to Detect Hateful and Offensive Language Online (Proceedings of the Fourth Workshop on Online Abuse and Harms)Using Transfer-based Language Models to Detect Hateful and Offensive Language Online (Proceedings of the Fourth Workshop on Online Abuse and Harms)

“The results indicate that the attention-based models profoundly confuse hate speech with offensive and normal language. However, the pre-trained models outperform state-of-the-art results in terms of accurately predicting the hateful

%d bloggers like this: