The current study presents a comprehensive approach to detecting hate speech in Roman Urdu, a widely used but under-resourced language variant. Recognizing the rise of online hate speech facilitated by anonymity and unrestricted expression on social media, the research expands the Roman Urdu Hate Speech and Offensive Language Detection dataset to 30,955 instances, introducing a new “Racism” category. Using a combination of supervised and unsupervised machine learning, deep learning, and natural language processing techniques—including mBERT, which achieved 92% accuracy—the system effectively identifies abusive, religious, sexist, and racist language patterns. The work contributes to scalable hate speech mitigation in linguistically diverse digital environments. https://dl.acm.org/doi/10.1145/3768571 Share this: Click to print (Opens in new window) Print Click to share on Facebook (Opens in new window) Facebook Click to share on LinkedIn (Opens in new window) LinkedIn Click to share on Reddit (Opens in new window) Reddit Click to share on WhatsApp (Opens in new window) WhatsApp Click to share on Bluesky (Opens in new window) Bluesky Click to email a link to a friend (Opens in new window) Email Like this:Like Loading... Post navigation Two Weeks in Soft Security: Free Resources on Countering Extremism, Hate, and Disinformation, September 2025 (I/II) AI-driven detection of hate speech on social media: a case study in the French language (HomeCluster Computing)