Our research introduces a cross-platform hate speech detection model capable of being trained on one platform’s data and generalizing to multiple unseen platforms. To achieve good generalizability across platforms, one way is to disentangle the input representations into invariant and platform-dependent features. We also argue that learning causal relationships, which remain constant across diverse environments, can significantly aid in understanding invariant representations in hate speech. By disentangling input into platform-dependent features (useful for predicting hate targets) and platform-independent features (used to predict the presence of hate), we learn invariant representations resistant to distribution shifts. These features are then used to predict hate speech across unseen platforms.https://paperswithcode.com/paper/causality-guided-disentanglement-for-crossShare this:FacebookXLike this:Like Loading... Post navigation Robust Hate Speech Detection in Social Media: A Cross-Dataset Empirical Evaluation (catalycex) What Did You Learn To Hate? A Topic-Oriented Analysis of Generalization in Hate Speech Detection (ACL Anthology)