MiuN2k3
/

mtl-xlmr-base-viwiki-v2

Transformers

Safetensors

roberta

Generated from Trainer

Model card Files Files and versions Community

MiuN2k3 commited on Jul 31, 2024

Commit

be6f3d9

verified ·

1 Parent(s): 7079ea0

End of training

Browse files

Files changed (2) hide show

README.md +19 -19
model.safetensors +1 -1

README.md CHANGED Viewed

@@ -1,21 +1,21 @@
 ---
 license: mit
-base_model: xlm-roberta-large
 tags:
 - generated_from_trainer
 model-index:
-- name: mtl-xlmr-large-dsc-v2
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# mtl-xlmr-large-dsc-v2
-This model is a fine-tuned version of [xlm-roberta-large](https://huggingface.co/xlm-roberta-large) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.2257
 ## Model description
@@ -35,8 +35,8 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 1e-05
-- train_batch_size: 8
-- eval_batch_size: 4
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
@@ -45,18 +45,18 @@ The following hyperparameters were used during training:
 ### Training results
-| Training Loss | Epoch | Step  | Validation Loss |
-|:-------------:|:-----:|:-----:|:---------------:|
-| 0.4901        | 1.0   | 3087  | 0.5458          |
-| 0.5308        | 2.0   | 6174  | 0.4982          |
-| 0.39          | 3.0   | 9261  | 0.5104          |
-| 0.1466        | 4.0   | 12348 | 0.6800          |
-| 0.075         | 5.0   | 15435 | 0.6820          |
-| 0.2699        | 6.0   | 18522 | 0.7831          |
-| 0.1868        | 7.0   | 21609 | 0.9032          |
-| 0.3338        | 8.0   | 24696 | 1.0255          |
-| 0.0001        | 9.0   | 27783 | 1.1477          |
-| 0.0001        | 10.0  | 30870 | 1.2257          |
 ### Framework versions

 ---
 license: mit
+base_model: xlm-roberta-base
 tags:
 - generated_from_trainer
 model-index:
+- name: mtl-xlmr-base-viwiki-v2
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# mtl-xlmr-base-viwiki-v2
+This model is a fine-tuned version of [xlm-roberta-base](https://huggingface.co/xlm-roberta-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.6919
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 1e-05
+- train_batch_size: 16
+- eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss |
+|:-------------:|:-----:|:----:|:---------------:|
+| 0.7756        | 1.0   | 960  | 0.7972          |
+| 0.6104        | 2.0   | 1920 | 0.6775          |
+| 0.5942        | 3.0   | 2880 | 0.6227          |
+| 0.6037        | 4.0   | 3840 | 0.6349          |
+| 0.5208        | 5.0   | 4800 | 0.5975          |
+| 0.347         | 6.0   | 5760 | 0.6008          |
+| 0.415         | 7.0   | 6720 | 0.6142          |
+| 0.3473        | 8.0   | 7680 | 0.6252          |
+| 0.3312        | 9.0   | 8640 | 0.6748          |
+| 0.2134        | 10.0  | 9600 | 0.6919          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:8259812bbade2b832b60cb607a25e0d50b838f1662657b5680503366a79c26aa
 size 1109845836

 version https://git-lfs.github.com/spec/v1
+oid sha256:f96f5d661868e66d99a5707fadd7f9def9ab8dc81a8c13f72d8a52ef7b083116
 size 1109845836