MiuN2k3
/

mtl-infoxlm-base-viwiki-3-7

Generated from Trainer

Model card Files Files and versions Community

MiuN2k3 commited on Feb 4

Commit

cf1e456

·

verified ·

1 Parent(s): 7177e29

End of training

Files changed (2) hide show

README.md +13 -13
model.safetensors +1 -1

README.md CHANGED Viewed

@@ -4,18 +4,18 @@ base_model: microsoft/infoxlm-base
 tags:
 - generated_from_trainer
 model-index:
-- name: mtl-infoxlm-base-viwiki-1-9
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# mtl-infoxlm-base-viwiki-1-9
 This model is a fine-tuned version of [microsoft/infoxlm-base](https://huggingface.co/microsoft/infoxlm-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.7978
 ## Model description
@@ -47,16 +47,16 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 1.3968        | 1.0   | 960  | 1.2382          |
-| 1.2884        | 2.0   | 1920 | 1.0464          |
-| 1.077         | 3.0   | 2880 | 0.9246          |
-| 0.997         | 4.0   | 3840 | 0.9064          |
-| 0.848         | 5.0   | 4800 | 0.8866          |
-| 0.9368        | 6.0   | 5760 | 0.8515          |
-| 0.7929        | 7.0   | 6720 | 0.8010          |
-| 0.7975        | 8.0   | 7680 | 0.8067          |
-| 0.6187        | 9.0   | 8640 | 0.7987          |
-| 0.5906        | 10.0  | 9600 | 0.7978          |
 ### Framework versions

 tags:
 - generated_from_trainer
 model-index:
+- name: mtl-infoxlm-base-viwiki-3-7
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# mtl-infoxlm-base-viwiki-3-7
 This model is a fine-tuned version of [microsoft/infoxlm-base](https://huggingface.co/microsoft/infoxlm-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.6374
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 1.46          | 1.0   | 960  | 1.1906          |
+| 1.0221        | 2.0   | 1920 | 0.8743          |
+| 0.8335        | 3.0   | 2880 | 0.7794          |
+| 0.8466        | 4.0   | 3840 | 0.7162          |
+| 0.702         | 5.0   | 4800 | 0.6996          |
+| 0.741         | 6.0   | 5760 | 0.6525          |
+| 0.615         | 7.0   | 6720 | 0.6300          |
+| 0.6112        | 8.0   | 7680 | 0.6404          |
+| 0.5314        | 9.0   | 8640 | 0.6420          |
+| 0.3893        | 10.0  | 9600 | 0.6374          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:2d469e18025f0bfb5b95326b507544c51b3447879cd649ebb538a7b0f62611e3
 size 1109845836

 version https://git-lfs.github.com/spec/v1
+oid sha256:1dccaa95f2eeba5e984fdc5ede362c95eb11ed2bac9c5c2c1323e7b9688cf8ad
 size 1109845836