End of training

Files changed (4) hide show

README.md CHANGED Viewed

@@ -17,7 +17,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [makhataei/qa-persian-distilbert-fa-zwnj-base](https://huggingface.co/makhataei/qa-persian-distilbert-fa-zwnj-base) on the parsinlu_reading_comprehension dataset.
 It achieves the following results on the evaluation set:
-- Loss: 4.5713
 ## Model description
@@ -36,7 +36,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 2.5e-06
 - train_batch_size: 13
 - eval_batch_size: 13
 - seed: 42
@@ -48,16 +48,16 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 0.0646        | 1.0   | 47   | 4.1193          |
-| 0.0608        | 2.0   | 94   | 4.2495          |
-| 0.0517        | 3.0   | 141  | 4.2699          |
-| 0.0467        | 4.0   | 188  | 4.3474          |
-| 0.0394        | 5.0   | 235  | 4.4168          |
-| 0.0506        | 6.0   | 282  | 4.5005          |
-| 0.0487        | 7.0   | 329  | 4.5162          |
-| 0.0678        | 8.0   | 376  | 4.5540          |
-| 0.07          | 9.0   | 423  | 4.5671          |
-| 0.0922        | 10.0  | 470  | 4.5713          |
 ### Framework versions

 This model is a fine-tuned version of [makhataei/qa-persian-distilbert-fa-zwnj-base](https://huggingface.co/makhataei/qa-persian-distilbert-fa-zwnj-base) on the parsinlu_reading_comprehension dataset.
 It achieves the following results on the evaluation set:
+- Loss: 4.8598
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 1.25e-06
 - train_batch_size: 13
 - eval_batch_size: 13
 - seed: 42
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 0.0177        | 1.0   | 47   | 4.5693          |
+| 0.0193        | 2.0   | 94   | 4.6949          |
+| 0.0156        | 3.0   | 141  | 4.6756          |
+| 0.018         | 4.0   | 188  | 4.7300          |
+| 0.0149        | 5.0   | 235  | 4.7674          |
+| 0.022         | 6.0   | 282  | 4.8171          |
+| 0.024         | 7.0   | 329  | 4.8203          |
+| 0.0381        | 8.0   | 376  | 4.8388          |
+| 0.045         | 9.0   | 423  | 4.8562          |
+| 0.0697        | 10.0  | 470  | 4.8598          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:36c22955fc195d41e5649b5983a850e523d1871da5ffff563485fd7705897e04
 size 300730456

 version https://git-lfs.github.com/spec/v1
+oid sha256:51137a286350511a6757056a035c65322e96cd50ba1b3926aca09f561c773e9f
 size 300730456

runs/Feb20_12-18-22_Software-AI/events.out.tfevents.1708418902.Software-AI.146186.3 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:48a82711ae33c83ac856c2da393fb53543c61c37cc72c072782bd0d409669ac3
+size 8924

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:60151df045d8901496710d774fa21cd16e3f075bcf8da0f34eeeca5dd1b2e12c
 size 4219

 version https://git-lfs.github.com/spec/v1
+oid sha256:ca35848f308c8547af0c62ab4db55172025106ad65f95bffa518a2d7ac743f2d
 size 4219