Update README.md
Browse files
README.md
CHANGED
@@ -21,4 +21,43 @@ language:
|
|
21 |
- en
|
22 |
- fr
|
23 |
- es
|
24 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
21 |
- en
|
22 |
- fr
|
23 |
- es
|
24 |
+
---
|
25 |
+
|
26 |
+
# German, English, French and Spanish Language Detector
|
27 |
+
|
28 |
+
The ImranzamanML/GEFS-language-detector is a fined tuned model by using the dataset of papluca [Language Identification](https://huggingface.co/datasets/papluca/language-identification#additional-information) and the base model [xlm-roberta-base](https://huggingface.co/xlm-roberta-base) .
|
29 |
+
|
30 |
+
## Supported languages
|
31 |
+
Currently this model support 4 languages for [Theum AG](https://theum.com/en/index.htm?t=)
|
32 |
+

|
33 |
+
|
34 |
+
Following languages supported by the model:
|
35 |
+
- german (de)
|
36 |
+
- english (en)
|
37 |
+
- spanish (es)
|
38 |
+
- french (fr)
|
39 |
+
|
40 |
+
## Training Results
|
41 |
+
|
42 |
+
Epoch Training Loss Validation Loss
|
43 |
+
1 0.002600 0.000148
|
44 |
+
2 0.001000 0.000015
|
45 |
+
3 0.000000 0.000011
|
46 |
+
4 0.001800 0.000009
|
47 |
+
5 0.002700 0.000016
|
48 |
+
6 0.001600 0.000012
|
49 |
+
7 0.001300 0.000009
|
50 |
+
8 0.001200 0.000008
|
51 |
+
9 0.000900 0.000007
|
52 |
+
10 0.000900 0.000007
|
53 |
+
|
54 |
+
|
55 |
+
## Testing Results
|
56 |
+
|
57 |
+
Language Precision Recall F1 Accuracy
|
58 |
+
de 0.9997 0.9998 0.9998 0.9999
|
59 |
+
en 1.0000 1.0000 1.0000 1.0000
|
60 |
+
fr 0.9995 0.9996 0.9996 0.9996
|
61 |
+
es 0.9994 0.9996 0.9995 0.9996
|
62 |
+
|
63 |
+
|