The current study presents a comprehensive approach to detecting hate speech in Roman Urdu, a widely used but under-resourced language variant. Recognizing the rise of online hate speech facilitated by anonymity and unrestricted expression on social media, the research expands the Roman Urdu Hate Speech and Offensive Language Detection dataset to 30,955 instances, introducing a new “Racism” category. Using a combination of supervised and unsupervised machine learning, deep learning, and natural language processing techniques—including mBERT, which achieved 92% accuracy—the system effectively identifies abusive, religious, sexist, and racist language patterns. The work contributes to scalable hate speech mitigation in linguistically diverse digital environments. https://dl.acm.org/doi/10.1145/3768571 Share this: Print (Opens in new window) Print Share on Facebook (Opens in new window) Facebook Share on LinkedIn (Opens in new window) LinkedIn Share on Reddit (Opens in new window) Reddit Share on WhatsApp (Opens in new window) WhatsApp Share on Bluesky (Opens in new window) Bluesky Email a link to a friend (Opens in new window) Email Like this:Like Loading... Post navigation Two Weeks in Soft Security: Free Resources on Countering Extremism, Hate, and Disinformation, September 2025 (I/II) AI-driven detection of hate speech on social media: a case study in the French language (HomeCluster Computing)