Xlm Roberta Base Language Detection
Downloads
Hugging Face
2.2M
HF Likes
Hugging Face
360
Context
Small context
514
License
license:mit
Updated
11/3/2025
by
papluca
Detects multiple languages including Arabic, Bulgarian, German, Greek, English, Spanish, French, Hindi, Italian, Japanese, Dutch, Polish, Portuguese, Russian, Swahili, Thai, Turkish, Urdu, Vietnamese, and Chinese. The model is based on XLM-Roberta and is licensed under MIT. It has been trained on the papluca/language-identification dataset and evaluated using metrics such as accuracy and F1 score.
Other
OTHER
1911.02116B params
Quick Info
Released
3/2/2022Framework
OTHER