Multilingual Hate Speech Xlm Roberta
This is a fine-tuned XLM-RoBERTa model for multilingual hate speech detection, specifically trained on English and Serbian text. The model classifies text into 8 categories:
- Race: Racial discrimination and slurs - Sexual Orientation: Homophobic content, LGBTQ+ discrimination - Gender: Sexist content, misogyny, gender-based harassment - Physical Appearance: Body shaming, lookism, appearance-based harassment - Religion: Religious discrimination, islamophobia, antisemitism - Class: Classist content, economic discrimination - Disability: Ableist content, discrimination against disabled people - Appropriate: Non-hateful, normal conversation
- English: Comprehensive hate speech detection - Serbian: Native Serbian language support (Cyrillic and Latin scripts)
The model was fine-tuned on multilingual hate speech datasets including: - English hate speech datasets - Serbian hate speech datasets - Augmented examples for better multilingual performance
- Accuracy: High-confidence predictions with detailed explanations - Languages: English and Serbian with cross-lingual capabilities - Categories: 8-class classification including appropriate content
This model is designed for research and educational purposes. Results should be interpreted carefully and human judgment should always be applied for critical decisions. The system is designed to assist, not replace, human moderation.
Try the interactive demo: Multilingual Hate Speech Detector Space