textdetox
/

xlmr-large-toxicity-classifier-v2

Text Classification

Model card Files Files and versions Community

dardem commited on Mar 20

Commit

023e4e8

·

verified ·

1 Parent(s): dff4c4f

Update README.md

Files changed (1) hide show

README.md +17 -15

README.md CHANGED Viewed

@@ -29,22 +29,24 @@ base_model:
 This is an instance of [xlm-roberta-large](https://huggingface.co/FacebookAI/xlm-roberta-large) that was fine-tuned on binary toxicity classification task based on our updated (2025) dataset [textdetox/multilingual_toxicity_dataset](https://huggingface.co/datasets/textdetox/multilingual_toxicity_dataset).
 Now, the models covers 15 languages from various language families:
-* English (en); F1: 0.9225
-* Russian (ru); F1: 0.9525
-* Ukrainian (uk); F1: 0.96
-* German (de); F1: 0.7325
-* Spanish (es); F1: 0.7125
-* Arabic (ar); F1: 0.6625
-* Amharic (am); F1: 0.5575
-* Hindi (hi); F1: 0.9725
-* Chinese (zh); F1: 0.9175
-* Italian (it); F1: 0.5864
-* French (fr); F1: 0.9235
-* Hinglish (hin); F1: 0.61
-* Hebrew (he); F1: 0.8775
-* Japanese (ja); F1: 0.8773
-* Tatar (tt); F1: 0.5744
 ## Citation
 The model is prepared for [TextDetox 2025 Shared Task](https://pan.webis.de/clef25/pan25-web/text-detoxification.html) evaluation.

 This is an instance of [xlm-roberta-large](https://huggingface.co/FacebookAI/xlm-roberta-large) that was fine-tuned on binary toxicity classification task based on our updated (2025) dataset [textdetox/multilingual_toxicity_dataset](https://huggingface.co/datasets/textdetox/multilingual_toxicity_dataset).
 Now, the models covers 15 languages from various language families:
+| Language  | Code | F1 Score |
+|-----------|------|---------|
+| English   | en   | 0.9225  |
+| Russian   | ru   | 0.9525  |
+| Ukrainian | uk   | 0.96    |
+| German    | de   | 0.7325  |
+| Spanish   | es   | 0.7125  |
+| Arabic    | ar   | 0.6625  |
+| Amharic   | am   | 0.5575  |
+| Hindi     | hi   | 0.9725  |
+| Chinese   | zh   | 0.9175  |
+| Italian   | it   | 0.5864  |
+| French    | fr   | 0.9235  |
+| Hinglish  | hin  | 0.61    |
+| Hebrew    | he   | 0.8775  |
+| Japanese  | ja   | 0.8773  |
+| Tatar     | tt   | 0.5744  |
 ## Citation
 The model is prepared for [TextDetox 2025 Shared Task](https://pan.webis.de/clef25/pan25-web/text-detoxification.html) evaluation.