meoo225 commited on
Commit
135beb5
·
verified ·
1 Parent(s): d3bee21

End of training

Browse files
README.md CHANGED
@@ -16,9 +16,9 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 0.4030
20
- - Bleu Score: 45.1405
21
- - Gen Len: 16.8196
22
 
23
  ## Model description
24
 
@@ -43,15 +43,16 @@ The following hyperparameters were used during training:
43
  - seed: 42
44
  - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
45
  - lr_scheduler_type: linear
46
- - num_epochs: 3
47
 
48
  ### Training results
49
 
50
  | Training Loss | Epoch | Step | Validation Loss | Bleu Score | Gen Len |
51
  |:-------------:|:-----:|:----:|:---------------:|:----------:|:-------:|
52
- | 2.8374 | 1.0 | 838 | 0.5595 | 41.3763 | 16.7802 |
53
- | 0.7564 | 2.0 | 1676 | 0.4503 | 44.4878 | 16.8124 |
54
- | 0.6103 | 3.0 | 2514 | 0.4030 | 45.1405 | 16.8196 |
 
55
 
56
 
57
  ### Framework versions
 
16
 
17
  This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 0.3417
20
+ - Bleu Score: 47.0526
21
+ - Gen Len: 16.8315
22
 
23
  ## Model description
24
 
 
43
  - seed: 42
44
  - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
45
  - lr_scheduler_type: linear
46
+ - num_epochs: 4
47
 
48
  ### Training results
49
 
50
  | Training Loss | Epoch | Step | Validation Loss | Bleu Score | Gen Len |
51
  |:-------------:|:-----:|:----:|:---------------:|:----------:|:-------:|
52
+ | 2.798 | 1.0 | 838 | 0.5495 | 41.8683 | 16.7766 |
53
+ | 0.7216 | 2.0 | 1676 | 0.4311 | 44.9002 | 16.8148 |
54
+ | 0.5551 | 3.0 | 2514 | 0.3565 | 46.5247 | 16.816 |
55
+ | 0.4951 | 4.0 | 3352 | 0.3417 | 47.0526 | 16.8315 |
56
 
57
 
58
  ### Framework versions
logs/events.out.tfevents.1731690205.45659cea2274.248.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:fc44d63860e165525fc82c3272898c8f25ad36ede1026bd96cd5aac123a43fe3
3
- size 7073
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ab6c4386a3364b586bf133d0bdd1f3a535e6ff19764260121a98bcc970ba7828
3
+ size 8014