Training complete

Files changed (5) hide show

README.md CHANGED Viewed

@@ -18,9 +18,9 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [facebook/mbart-large-50](https://huggingface.co/facebook/mbart-large-50) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.5849
-- Bleu: 65.68
-- Gen Len: 78.0384
 ## Model description
@@ -51,13 +51,13 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Bleu    | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|
-| No log        | 1.0   | 137  | 0.7570          | 59.8199 | 73.8712 |
-| No log        | 2.0   | 274  | 0.5849          | 65.68   | 78.0384 |
 ### Framework versions
-- Transformers 4.40.0
 - Pytorch 2.2.1+cu121
-- Datasets 2.19.0
 - Tokenizers 0.19.1

 This model is a fine-tuned version of [facebook/mbart-large-50](https://huggingface.co/facebook/mbart-large-50) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.6145
+- Bleu: 64.1523
+- Gen Len: 83.1479
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss | Bleu    | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|
+| No log        | 1.0   | 137  | 0.7661          | 61.6664 | 82.0521 |
+| No log        | 2.0   | 274  | 0.6145          | 64.1523 | 83.1479 |
 ### Framework versions
+- Transformers 4.40.2
 - Pytorch 2.2.1+cu121
+- Datasets 2.19.1
 - Tokenizers 0.19.1

config.json CHANGED Viewed

@@ -52,7 +52,7 @@
   "static_position_embeddings": false,
   "tokenizer_class": "MBart50Tokenizer",
   "torch_dtype": "float32",
-  "transformers_version": "4.40.0",
   "use_cache": true,
   "vocab_size": 250054
 }

   "static_position_embeddings": false,
   "tokenizer_class": "MBart50Tokenizer",
   "torch_dtype": "float32",
+  "transformers_version": "4.40.2",
   "use_cache": true,
   "vocab_size": 250054
 }

generation_config.json CHANGED Viewed

@@ -7,5 +7,5 @@
   "max_length": 200,
   "num_beams": 5,
   "pad_token_id": 1,
-  "transformers_version": "4.40.0"
 }

   "max_length": 200,
   "num_beams": 5,
   "pad_token_id": 1,
+  "transformers_version": "4.40.2"
 }

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:79041e15181289acbcfc86ff49c35818eb86938e592d7a0ba4cbbe5ba551c6fd
 size 2444578688

 version https://git-lfs.github.com/spec/v1
+oid sha256:f60e73f997fc0b2f23dfba844f7b2389fa27d4916ed58553c5b83517ddb1bf84
 size 2444578688

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:ed355f064b19060ee3cfeb6aaf4d94c2f211862be2399f36c0e2049c6bf69a29
 size 5112

 version https://git-lfs.github.com/spec/v1
+oid sha256:9c33b412a66385a72be3102951fdb69136de576160c625fb446fb305f87be76d
 size 5112