Researchers developed datasets and machine learning algorithms that tackle the multi-label challenge of classifying hate speech in textual data. The first thorough and methodical review of the scientific literature on this new field of study in English is presented in this work (N=46). The researchers offer a succinct summary of 28 datasets that are suitable for multi-label classification model training, highlighting notable variations in label-set, size, meta-concept, annotation method, and inter-annotator agreement. Inconsistency in evaluation and a preference for designs based on Recurrent Neural Networks (RNNs) and Bidirectional Encoder Representation from Transformers (BERT) are further established by our examination of 24 articles that provide appropriate categorization models. Ten proposals for further study are developed after identifying the following important outstanding issues: limited and sparse datasets, uneven training data, dependence on crowdsourcing platforms, and insufficient methodological alignment.https://arxiv.org/abs/2504.08609Share this:FacebookXLike this:Like Loading... Post navigation A comprehensive framework for multi-modal hate speech detection in social media using deep learning (scientific reports) Assessing the Hatefulness of Social Media Posts: A Continuous Measure of Hate Using Generative AI (SSRN)