metadata
library_name: transformers
language:
- en
- fr
- it
- es
- ru
- uk
- tt
- ar
- hi
- ja
- zh
- he
- am
- de
license: openrail++
datasets:
- textdetox/multilingual_toxicity_dataset
metrics:
- f1
base_model:
- FacebookAI/xlm-roberta-large
Multilingual Toxicity Classifier for 15 Languages (2025)
This is an instance of xlm-roberta-large that was fine-tuned on binary toxicity classification task based on our updated (2025) dataset textdetox/multilingual_toxicity_dataset.
Now, the models covers 15 languages from various language families:
- English (en); F1:
- Russian (ru); F1:
- Ukrainian (uk); F1:
- German (de); F1:
- Spanish (es); F1:
- Arabic (ar); F1:
- Amharic (am); F1:
- Hindi (hi); F1:
- Chinese (zh); F1:
- Italian (it); F1:
- French (fr); F1:
- Hinglish (hin); F1:
- Hebrew (he); F1:
- Japanese (ja); F1:
- Tatar (tt); F1:
Citation
The model is prepared for TextDetox 2025 Shared Task evaluation.
Citation TBD soon.