Zelyanoth commited on
Commit
227e887
1 Parent(s): e1c4a07

End of training

Browse files
Files changed (1) hide show
  1. README.md +8 -9
README.md CHANGED
@@ -20,9 +20,9 @@ should probably proofread and complete it, then remove this comment. -->
20
 
21
  This model is a fine-tuned version of [google/madlad400-3b-mt](https://huggingface.co/google/madlad400-3b-mt) on the generator dataset.
22
  It achieves the following results on the evaluation set:
23
- - Loss: 5.2698
24
- - Bleu: 0.217
25
- - Gen Len: 9.6092
26
 
27
  ## Model description
28
 
@@ -43,20 +43,19 @@ More information needed
43
  The following hyperparameters were used during training:
44
  - learning_rate: 0.0003
45
  - train_batch_size: 18
46
- - eval_batch_size: 18
47
  - seed: 42
48
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
49
  - lr_scheduler_type: linear
50
- - num_epochs: 4
51
 
52
  ### Training results
53
 
54
  | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
55
  |:-------------:|:-----:|:-----:|:---------------:|:------:|:-------:|
56
- | 3.3025 | 1.0 | 2528 | 5.2043 | 0.2814 | 8.9524 |
57
- | 3.1868 | 2.0 | 5056 | 5.2746 | 0.1393 | 8.8278 |
58
- | 3.0907 | 3.0 | 7584 | 5.1989 | 0.2607 | 9.6087 |
59
- | 3.0409 | 4.0 | 10112 | 5.2698 | 0.217 | 9.6092 |
60
 
61
 
62
  ### Framework versions
 
20
 
21
  This model is a fine-tuned version of [google/madlad400-3b-mt](https://huggingface.co/google/madlad400-3b-mt) on the generator dataset.
22
  It achieves the following results on the evaluation set:
23
+ - Loss: 4.4321
24
+ - Bleu: 3.9461
25
+ - Gen Len: 7.2243
26
 
27
  ## Model description
28
 
 
43
  The following hyperparameters were used during training:
44
  - learning_rate: 0.0003
45
  - train_batch_size: 18
46
+ - eval_batch_size: 8
47
  - seed: 42
48
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
49
  - lr_scheduler_type: linear
50
+ - num_epochs: 3
51
 
52
  ### Training results
53
 
54
  | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
55
  |:-------------:|:-----:|:-----:|:---------------:|:------:|:-------:|
56
+ | 2.1442 | 1.0 | 4997 | 4.4262 | 3.0781 | 7.4202 |
57
+ | 2.0066 | 2.0 | 9994 | 4.4181 | 3.8977 | 7.2596 |
58
+ | 1.948 | 3.0 | 14991 | 4.4321 | 3.9461 | 7.2243 |
 
59
 
60
 
61
  ### Framework versions