End of training

Files changed (4) hide show

README.md CHANGED Viewed

@@ -22,7 +22,7 @@ model-index:
     metrics:
     - name: Sacrebleu
       type: sacrebleu
-      value: 37.5332
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -32,9 +32,9 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [facebook/nllb-200-distilled-600M](https://huggingface.co/facebook/nllb-200-distilled-600M) on the nusatranslation_mt dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.1380
-- Sacrebleu: 37.5332
-- Gen Len: 37.17
 ## Model description
@@ -67,16 +67,16 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Sacrebleu | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:---------:|:-------:|
-| 4.6139        | 1.0   | 825  | 1.2915          | 32.7445   | 37.5685 |
-| 1.1761        | 2.0   | 1650 | 1.1701          | 35.9991   | 37.3645 |
-| 1.0383        | 3.0   | 2475 | 1.1361          | 36.9321   | 37.035  |
-| 0.9094        | 4.0   | 3300 | 1.1333          | 37.4774   | 37.039  |
-| 0.8158        | 5.0   | 4125 | 1.1380          | 37.5332   | 37.17   |
 ### Framework versions
 - Transformers 4.41.2
 - Pytorch 2.3.0+cu121
-- Datasets 2.13.1
 - Tokenizers 0.19.1

     metrics:
     - name: Sacrebleu
       type: sacrebleu
+      value: 37.7411
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [facebook/nllb-200-distilled-600M](https://huggingface.co/facebook/nllb-200-distilled-600M) on the nusatranslation_mt dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.1318
+- Sacrebleu: 37.7411
+- Gen Len: 37.2965
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss | Sacrebleu | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:---------:|:-------:|
+| 3.3872        | 1.0   | 825  | 1.2738          | 33.3267   | 37.8435 |
+| 1.1626        | 2.0   | 1650 | 1.1593          | 36.2145   | 37.436  |
+| 0.9846        | 3.0   | 2475 | 1.1289          | 37.2589   | 37.0315 |
+| 0.8883        | 4.0   | 3300 | 1.1254          | 37.8219   | 37.216  |
+| 0.832         | 5.0   | 4125 | 1.1318          | 37.7411   | 37.2965 |
 ### Framework versions
 - Transformers 4.41.2
 - Pytorch 2.3.0+cu121
+- Datasets 2.14.6
 - Tokenizers 0.19.1

generation_config.json CHANGED Viewed

@@ -1,6 +1,8 @@
 {
   "bos_token_id": 0,
-  "max_length": 64,
-  "push_to_hub": true,
   "transformers_version": "4.41.2"
 }

 {
   "bos_token_id": 0,
+  "decoder_start_token_id": 2,
+  "eos_token_id": 2,
+  "max_length": 200,
+  "pad_token_id": 1,
   "transformers_version": "4.41.2"
 }

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:55d5558de6e978fdc617b20589ad042a4b2942177a733fa450833086b90400e8
 size 2460354912

 version https://git-lfs.github.com/spec/v1
+oid sha256:e75d3616fbd087a692a97ddfe9f4a2625851334c3f9e7d2d77354a2b0625d45c
 size 2460354912

runs/Jul14_08-59-13_fa0f6b075495/events.out.tfevents.1720947556.fa0f6b075495.195.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:0e3014ff2cff22518d27597819c667e934ea2855855046ed0ce6d272577816e5
-size 8142

 version https://git-lfs.github.com/spec/v1
+oid sha256:3ba0d2e1ffa3eb200fcd2023f01ab18552b02b1b3854dfbcf44e8c8b12a47ef4
+size 8496