End of training
Browse files- README.md +9 -9
- pytorch_model.bin +1 -1
- training_args.bin +1 -1
README.md
CHANGED
@@ -22,7 +22,7 @@ model-index:
|
|
22 |
metrics:
|
23 |
- name: Sacrebleu
|
24 |
type: sacrebleu
|
25 |
-
value: 38.
|
26 |
---
|
27 |
|
28 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
@@ -32,9 +32,9 @@ should probably proofread and complete it, then remove this comment. -->
|
|
32 |
|
33 |
This model is a fine-tuned version of [facebook/nllb-200-distilled-600M](https://huggingface.co/facebook/nllb-200-distilled-600M) on the nusatranslation_mt dataset.
|
34 |
It achieves the following results on the evaluation set:
|
35 |
-
- Loss: 1.
|
36 |
-
- Sacrebleu: 38.
|
37 |
-
- Gen Len: 37.
|
38 |
|
39 |
## Model description
|
40 |
|
@@ -66,11 +66,11 @@ The following hyperparameters were used during training:
|
|
66 |
|
67 |
| Training Loss | Epoch | Step | Validation Loss | Sacrebleu | Gen Len |
|
68 |
|:-------------:|:-----:|:----:|:---------------:|:---------:|:-------:|
|
69 |
-
| 1.
|
70 |
-
| 1.
|
71 |
-
| 0.
|
72 |
-
| 0.
|
73 |
-
| 0.
|
74 |
|
75 |
|
76 |
### Framework versions
|
|
|
22 |
metrics:
|
23 |
- name: Sacrebleu
|
24 |
type: sacrebleu
|
25 |
+
value: 38.1114
|
26 |
---
|
27 |
|
28 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
|
|
32 |
|
33 |
This model is a fine-tuned version of [facebook/nllb-200-distilled-600M](https://huggingface.co/facebook/nllb-200-distilled-600M) on the nusatranslation_mt dataset.
|
34 |
It achieves the following results on the evaluation set:
|
35 |
+
- Loss: 1.1353
|
36 |
+
- Sacrebleu: 38.1114
|
37 |
+
- Gen Len: 37.3265
|
38 |
|
39 |
## Model description
|
40 |
|
|
|
66 |
|
67 |
| Training Loss | Epoch | Step | Validation Loss | Sacrebleu | Gen Len |
|
68 |
|:-------------:|:-----:|:----:|:---------------:|:---------:|:-------:|
|
69 |
+
| 1.36 | 1.0 | 1650 | 1.2432 | 34.4418 | 37.63 |
|
70 |
+
| 1.0823 | 2.0 | 3300 | 1.1404 | 36.8766 | 37.312 |
|
71 |
+
| 0.9761 | 3.0 | 4950 | 1.1281 | 37.9027 | 37.2895 |
|
72 |
+
| 0.8241 | 4.0 | 6600 | 1.1267 | 38.1879 | 37.1395 |
|
73 |
+
| 0.7579 | 5.0 | 8250 | 1.1353 | 38.1114 | 37.3265 |
|
74 |
|
75 |
|
76 |
### Framework versions
|
pytorch_model.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 2460469182
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:b8443084b90404cc61bd92a3e5c4aca76dd7bd00a53af5b7b3d81d5b50a63920
|
3 |
size 2460469182
|
training_args.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 4728
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:d1f01947661adcc5cdd4ed067af6a1acd166eb4229710734ab2ae5e1d3fa4523
|
3 |
size 4728
|