Text Classification
Transformers
Safetensors
xlm-roberta
Inference Endpoints
dardem's picture
Update README.md
9b898cc verified
|
raw
history blame
1.09 kB
metadata
library_name: transformers
language:
  - en
  - fr
  - it
  - es
  - ru
  - uk
  - tt
  - ar
  - hi
  - ja
  - zh
  - he
  - am
  - de
license: openrail++
datasets:
  - textdetox/multilingual_toxicity_dataset
metrics:
  - f1
base_model:
  - FacebookAI/xlm-roberta-large

Multilingual Toxicity Classifier for 15 Languages (2025)

This is an instance of xlm-roberta-large that was fine-tuned on binary toxicity classification task based on our updated (2025) dataset textdetox/multilingual_toxicity_dataset.

Now, the models covers 15 languages from various language families:

  • English (en); F1:
  • Russian (ru); F1:
  • Ukrainian (uk); F1:
  • German (de); F1:
  • Spanish (es); F1:
  • Arabic (ar); F1:
  • Amharic (am); F1:
  • Hindi (hi); F1:
  • Chinese (zh); F1:
  • Italian (it); F1:
  • French (fr); F1:
  • Hinglish (hin); F1:
  • Hebrew (he); F1:
  • Japanese (ja); F1:
  • Tatar (tt); F1:

Citation

The model is prepared for TextDetox 2025 Shared Task evaluation.

Citation TBD soon.