jegilj commited on
Commit
666bc07
1 Parent(s): 56c9df8

Training complete

Browse files
Files changed (5) hide show
  1. README.md +7 -7
  2. config.json +1 -1
  3. generation_config.json +1 -1
  4. model.safetensors +1 -1
  5. training_args.bin +1 -1
README.md CHANGED
@@ -18,9 +18,9 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [facebook/mbart-large-50](https://huggingface.co/facebook/mbart-large-50) on the None dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 0.5849
22
- - Bleu: 65.68
23
- - Gen Len: 78.0384
24
 
25
  ## Model description
26
 
@@ -51,13 +51,13 @@ The following hyperparameters were used during training:
51
 
52
  | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
53
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|
54
- | No log | 1.0 | 137 | 0.7570 | 59.8199 | 73.8712 |
55
- | No log | 2.0 | 274 | 0.5849 | 65.68 | 78.0384 |
56
 
57
 
58
  ### Framework versions
59
 
60
- - Transformers 4.40.0
61
  - Pytorch 2.2.1+cu121
62
- - Datasets 2.19.0
63
  - Tokenizers 0.19.1
 
18
 
19
  This model is a fine-tuned version of [facebook/mbart-large-50](https://huggingface.co/facebook/mbart-large-50) on the None dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 0.6145
22
+ - Bleu: 64.1523
23
+ - Gen Len: 83.1479
24
 
25
  ## Model description
26
 
 
51
 
52
  | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
53
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|
54
+ | No log | 1.0 | 137 | 0.7661 | 61.6664 | 82.0521 |
55
+ | No log | 2.0 | 274 | 0.6145 | 64.1523 | 83.1479 |
56
 
57
 
58
  ### Framework versions
59
 
60
+ - Transformers 4.40.2
61
  - Pytorch 2.2.1+cu121
62
+ - Datasets 2.19.1
63
  - Tokenizers 0.19.1
config.json CHANGED
@@ -52,7 +52,7 @@
52
  "static_position_embeddings": false,
53
  "tokenizer_class": "MBart50Tokenizer",
54
  "torch_dtype": "float32",
55
- "transformers_version": "4.40.0",
56
  "use_cache": true,
57
  "vocab_size": 250054
58
  }
 
52
  "static_position_embeddings": false,
53
  "tokenizer_class": "MBart50Tokenizer",
54
  "torch_dtype": "float32",
55
+ "transformers_version": "4.40.2",
56
  "use_cache": true,
57
  "vocab_size": 250054
58
  }
generation_config.json CHANGED
@@ -7,5 +7,5 @@
7
  "max_length": 200,
8
  "num_beams": 5,
9
  "pad_token_id": 1,
10
- "transformers_version": "4.40.0"
11
  }
 
7
  "max_length": 200,
8
  "num_beams": 5,
9
  "pad_token_id": 1,
10
+ "transformers_version": "4.40.2"
11
  }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:79041e15181289acbcfc86ff49c35818eb86938e592d7a0ba4cbbe5ba551c6fd
3
  size 2444578688
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f60e73f997fc0b2f23dfba844f7b2389fa27d4916ed58553c5b83517ddb1bf84
3
  size 2444578688
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ed355f064b19060ee3cfeb6aaf4d94c2f211862be2399f36c0e2049c6bf69a29
3
  size 5112
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9c33b412a66385a72be3102951fdb69136de576160c625fb446fb305f87be76d
3
  size 5112