Xlm Roberta Base Language Detection

Downloads
Hugging Face
2.2M
360
Context
Small context
514
License
license:mit
Updated
11/3/2025
by
papluca

Detects multiple languages including Arabic, Bulgarian, German, Greek, English, Spanish, French, Hindi, Italian, Japanese, Dutch, Polish, Portuguese, Russian, Swahili, Thai, Turkish, Urdu, Vietnamese, and Chinese. The model is based on XLM-Roberta and is licensed under MIT. It has been trained on the papluca/language-identification dataset and evaluated using metrics such as accuracy and F1 score.

Other
OTHER
1911.02116B params

Quick Info

Released
3/2/2022
Framework
OTHER

Resources