kepinsam commited on
Commit
0563ff0
1 Parent(s): 7d106a9

End of training

Browse files
README.md CHANGED
@@ -22,7 +22,7 @@ model-index:
22
  metrics:
23
  - name: Sacrebleu
24
  type: sacrebleu
25
- value: 37.5332
26
  ---
27
 
28
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -32,9 +32,9 @@ should probably proofread and complete it, then remove this comment. -->
32
 
33
  This model is a fine-tuned version of [facebook/nllb-200-distilled-600M](https://huggingface.co/facebook/nllb-200-distilled-600M) on the nusatranslation_mt dataset.
34
  It achieves the following results on the evaluation set:
35
- - Loss: 1.1380
36
- - Sacrebleu: 37.5332
37
- - Gen Len: 37.17
38
 
39
  ## Model description
40
 
@@ -67,16 +67,16 @@ The following hyperparameters were used during training:
67
 
68
  | Training Loss | Epoch | Step | Validation Loss | Sacrebleu | Gen Len |
69
  |:-------------:|:-----:|:----:|:---------------:|:---------:|:-------:|
70
- | 4.6139 | 1.0 | 825 | 1.2915 | 32.7445 | 37.5685 |
71
- | 1.1761 | 2.0 | 1650 | 1.1701 | 35.9991 | 37.3645 |
72
- | 1.0383 | 3.0 | 2475 | 1.1361 | 36.9321 | 37.035 |
73
- | 0.9094 | 4.0 | 3300 | 1.1333 | 37.4774 | 37.039 |
74
- | 0.8158 | 5.0 | 4125 | 1.1380 | 37.5332 | 37.17 |
75
 
76
 
77
  ### Framework versions
78
 
79
  - Transformers 4.41.2
80
  - Pytorch 2.3.0+cu121
81
- - Datasets 2.13.1
82
  - Tokenizers 0.19.1
 
22
  metrics:
23
  - name: Sacrebleu
24
  type: sacrebleu
25
+ value: 37.7411
26
  ---
27
 
28
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
32
 
33
  This model is a fine-tuned version of [facebook/nllb-200-distilled-600M](https://huggingface.co/facebook/nllb-200-distilled-600M) on the nusatranslation_mt dataset.
34
  It achieves the following results on the evaluation set:
35
+ - Loss: 1.1318
36
+ - Sacrebleu: 37.7411
37
+ - Gen Len: 37.2965
38
 
39
  ## Model description
40
 
 
67
 
68
  | Training Loss | Epoch | Step | Validation Loss | Sacrebleu | Gen Len |
69
  |:-------------:|:-----:|:----:|:---------------:|:---------:|:-------:|
70
+ | 3.3872 | 1.0 | 825 | 1.2738 | 33.3267 | 37.8435 |
71
+ | 1.1626 | 2.0 | 1650 | 1.1593 | 36.2145 | 37.436 |
72
+ | 0.9846 | 3.0 | 2475 | 1.1289 | 37.2589 | 37.0315 |
73
+ | 0.8883 | 4.0 | 3300 | 1.1254 | 37.8219 | 37.216 |
74
+ | 0.832 | 5.0 | 4125 | 1.1318 | 37.7411 | 37.2965 |
75
 
76
 
77
  ### Framework versions
78
 
79
  - Transformers 4.41.2
80
  - Pytorch 2.3.0+cu121
81
+ - Datasets 2.14.6
82
  - Tokenizers 0.19.1
generation_config.json CHANGED
@@ -1,6 +1,8 @@
1
  {
2
  "bos_token_id": 0,
3
- "max_length": 64,
4
- "push_to_hub": true,
 
 
5
  "transformers_version": "4.41.2"
6
  }
 
1
  {
2
  "bos_token_id": 0,
3
+ "decoder_start_token_id": 2,
4
+ "eos_token_id": 2,
5
+ "max_length": 200,
6
+ "pad_token_id": 1,
7
  "transformers_version": "4.41.2"
8
  }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:55d5558de6e978fdc617b20589ad042a4b2942177a733fa450833086b90400e8
3
  size 2460354912
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e75d3616fbd087a692a97ddfe9f4a2625851334c3f9e7d2d77354a2b0625d45c
3
  size 2460354912
runs/Jul14_08-59-13_fa0f6b075495/events.out.tfevents.1720947556.fa0f6b075495.195.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:0e3014ff2cff22518d27597819c667e934ea2855855046ed0ce6d272577816e5
3
- size 8142
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3ba0d2e1ffa3eb200fcd2023f01ab18552b02b1b3854dfbcf44e8c8b12a47ef4
3
+ size 8496