kalese commited on
Commit
e36d861
1 Parent(s): cfec945

End of training

Browse files
README.md CHANGED
@@ -22,7 +22,7 @@ model-index:
22
  metrics:
23
  - name: Bleu
24
  type: bleu
25
- value: 6.3359
26
  ---
27
 
28
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -32,9 +32,9 @@ should probably proofread and complete it, then remove this comment. -->
32
 
33
  This model is a fine-tuned version of [Helsinki-NLP/opus-mt-en-ro](https://huggingface.co/Helsinki-NLP/opus-mt-en-ro) on the arrow dataset.
34
  It achieves the following results on the evaluation set:
35
- - Loss: 1.6318
36
- - Bleu: 6.3359
37
- - Gen Len: 58.0829
38
 
39
  ## Model description
40
 
@@ -54,8 +54,8 @@ More information needed
54
 
55
  The following hyperparameters were used during training:
56
  - learning_rate: 2e-05
57
- - train_batch_size: 128
58
- - eval_batch_size: 128
59
  - seed: 42
60
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
61
  - lr_scheduler_type: linear
@@ -65,16 +65,16 @@ The following hyperparameters were used during training:
65
 
66
  | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
67
  |:-------------:|:-----:|:----:|:---------------:|:------:|:-------:|
68
- | No log | 1.0 | 140 | 3.1399 | 0.0997 | 69.5849 |
69
- | No log | 2.0 | 280 | 2.2017 | 1.6584 | 62.2953 |
70
- | No log | 3.0 | 420 | 1.9380 | 3.61 | 59.1948 |
71
- | 3.0959 | 4.0 | 560 | 1.8188 | 4.5827 | 57.0106 |
72
- | 3.0959 | 5.0 | 700 | 1.7496 | 5.1558 | 56.731 |
73
- | 3.0959 | 6.0 | 840 | 1.7012 | 5.6941 | 58.2515 |
74
- | 3.0959 | 7.0 | 980 | 1.6683 | 5.964 | 57.0337 |
75
- | 1.851 | 8.0 | 1120 | 1.6473 | 6.1539 | 57.0685 |
76
- | 1.851 | 9.0 | 1260 | 1.6357 | 6.3233 | 57.5607 |
77
- | 1.851 | 10.0 | 1400 | 1.6318 | 6.3359 | 58.0829 |
78
 
79
 
80
  ### Framework versions
 
22
  metrics:
23
  - name: Bleu
24
  type: bleu
25
+ value: 7.1794
26
  ---
27
 
28
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
32
 
33
  This model is a fine-tuned version of [Helsinki-NLP/opus-mt-en-ro](https://huggingface.co/Helsinki-NLP/opus-mt-en-ro) on the arrow dataset.
34
  It achieves the following results on the evaluation set:
35
+ - Loss: 1.5644
36
+ - Bleu: 7.1794
37
+ - Gen Len: 60.0222
38
 
39
  ## Model description
40
 
 
54
 
55
  The following hyperparameters were used during training:
56
  - learning_rate: 2e-05
57
+ - train_batch_size: 96
58
+ - eval_batch_size: 96
59
  - seed: 42
60
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
61
  - lr_scheduler_type: linear
 
65
 
66
  | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
67
  |:-------------:|:-----:|:----:|:---------------:|:------:|:-------:|
68
+ | No log | 1.0 | 186 | 2.6868 | 0.299 | 76.178 |
69
+ | No log | 2.0 | 372 | 2.0190 | 2.2444 | 67.167 |
70
+ | 3.115 | 3.0 | 558 | 1.8364 | 4.5959 | 59.7357 |
71
+ | 3.115 | 4.0 | 744 | 1.7372 | 5.1827 | 61.7218 |
72
+ | 3.115 | 5.0 | 930 | 1.6732 | 5.8295 | 59.7346 |
73
+ | 1.8706 | 6.0 | 1116 | 1.6301 | 6.4389 | 60.9085 |
74
+ | 1.8706 | 7.0 | 1302 | 1.6002 | 6.6498 | 60.4191 |
75
+ | 1.8706 | 8.0 | 1488 | 1.5792 | 6.8315 | 60.2721 |
76
+ | 1.7133 | 9.0 | 1674 | 1.5680 | 7.1239 | 60.5609 |
77
+ | 1.7133 | 10.0 | 1860 | 1.5644 | 7.1794 | 60.0222 |
78
 
79
 
80
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:64f3e3a7f5dec57f9dafad6451243f53038fb52575fd267e164b6f47921defc1
3
  size 298765276
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9beb3ba4954c12b0a73154b2ef5346d93e1e65385af77f804193422991c1ec09
3
  size 298765276
runs/Mar19_10-24-03_43de2ec8c46d/events.out.tfevents.1710843844.43de2ec8c46d.1689.1 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:5c8f529640e898708e3aa6823987784aebdaf5567a5ffa02b62ad21086ae1e04
3
- size 9042
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9dae2fb146582da664e48884ab34b1c4d6d1a17028fe70e195819cdd2e65a131
3
+ size 10136