manapole commited on
Commit
836566b
·
verified ·
1 Parent(s): d0634e1

Training complete

Browse files
Files changed (1) hide show
  1. README.md +8 -8
README.md CHANGED
@@ -19,9 +19,9 @@ should probably proofread and complete it, then remove this comment. -->
19
 
20
  This model is a fine-tuned version of [facebook/mbart-large-50](https://huggingface.co/facebook/mbart-large-50) on an unknown dataset.
21
  It achieves the following results on the evaluation set:
22
- - Loss: 0.0165
23
- - Bleu: 96.2379
24
- - Gen Len: 18.7812
25
 
26
  ## Model description
27
 
@@ -52,13 +52,13 @@ The following hyperparameters were used during training:
52
 
53
  | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
54
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|
55
- | No log | 1.0 | 440 | 0.0173 | 96.4182 | 26.8646 |
56
- | 0.2248 | 2.0 | 880 | 0.0165 | 96.2379 | 18.7812 |
57
 
58
 
59
  ### Framework versions
60
 
61
  - Transformers 4.49.0
62
- - Pytorch 2.5.1+cu124
63
- - Datasets 3.3.2
64
- - Tokenizers 0.21.0
 
19
 
20
  This model is a fine-tuned version of [facebook/mbart-large-50](https://huggingface.co/facebook/mbart-large-50) on an unknown dataset.
21
  It achieves the following results on the evaluation set:
22
+ - Loss: 2.7815
23
+ - Bleu: 39.3114
24
+ - Gen Len: 21.8462
25
 
26
  ## Model description
27
 
 
52
 
53
  | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
54
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|
55
+ | No log | 1.0 | 25 | 3.3111 | 29.4526 | 21.1692 |
56
+ | No log | 2.0 | 50 | 2.7815 | 39.3114 | 21.8462 |
57
 
58
 
59
  ### Framework versions
60
 
61
  - Transformers 4.49.0
62
+ - Pytorch 2.6.0+cu124
63
+ - Datasets 3.4.1
64
+ - Tokenizers 0.21.1