End of training

Files changed (4) hide show

README.md CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
-base_model: facebook/mbart-large-50
 library_name: transformers
 license: mit
 tags:
 - generated_from_trainer
 model-index:
@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [facebook/mbart-large-50](https://huggingface.co/facebook/mbart-large-50) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.0105
 ## Model description
@@ -39,7 +39,7 @@ The following hyperparameters were used during training:
 - train_batch_size: 4
 - eval_batch_size: 4
 - seed: 42
-- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - num_epochs: 3
@@ -47,13 +47,15 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| 0.4406        | 1.1641 | 1000 | 0.0237          |
-| 0.0243        | 2.3283 | 2000 | 0.0105          |
 ### Framework versions
-- Transformers 4.44.2
-- Pytorch 2.4.1+cu121
-- Datasets 3.0.0
-- Tokenizers 0.19.1

 ---
 library_name: transformers
 license: mit
+base_model: facebook/mbart-large-50
 tags:
 - generated_from_trainer
 model-index:
 This model is a fine-tuned version of [facebook/mbart-large-50](https://huggingface.co/facebook/mbart-large-50) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0111
 ## Model description
 - train_batch_size: 4
 - eval_batch_size: 4
 - seed: 42
+- optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - num_epochs: 3
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| 1.6089        | 0.6667 | 200  | 0.0635          |
+| 0.0572        | 1.3333 | 400  | 0.0297          |
+| 0.0352        | 2.0    | 600  | 0.0168          |
+| 0.0195        | 2.6667 | 800  | 0.0111          |
 ### Framework versions
+- Transformers 4.47.1
+- Pytorch 2.5.1+cu121
+- Datasets 3.2.0
+- Tokenizers 0.21.0

generation_config.json CHANGED Viewed

@@ -8,5 +8,5 @@
   "max_length": 200,
   "num_beams": 5,
   "pad_token_id": 1,
-  "transformers_version": "4.44.2"
 }

   "max_length": 200,
   "num_beams": 5,
   "pad_token_id": 1,
+  "transformers_version": "4.47.1"
 }

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:fe5ec5a4f5615c7c4fcb219b1bea41dc3996f83b95d924961afc02c9cb714236
 size 2444578688

 version https://git-lfs.github.com/spec/v1
+oid sha256:7afb5e680b4a5c17d3e48796dc957417a0ccf3481d18fc3a169c6d29e1734a41
 size 2444578688

runs/Jan12_12-23-38_353270b004df/events.out.tfevents.1736684621.353270b004df.5788.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f47d034709e38647ff8574e5af08ee1cce2f14edaad809f181dd15fee3974d89
-size 7720

 version https://git-lfs.github.com/spec/v1
+oid sha256:9ba9ad712cf89393dadc68baa3db6a71505715faed0212b1c50a7ea8c4aa572a
+size 8074