End of training

Files changed (4) hide show

README.md CHANGED Viewed

@@ -17,7 +17,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [makhataei/qa-persian-distilbert-fa-zwnj-base](https://huggingface.co/makhataei/qa-persian-distilbert-fa-zwnj-base) on the parsinlu_reading_comprehension dataset.
 It achieves the following results on the evaluation set:
-- Loss: 5.1241
 ## Model description
@@ -36,9 +36,9 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 1.953125e-08
-- train_batch_size: 13
-- eval_batch_size: 13
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
@@ -48,16 +48,16 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 0.0024        | 1.0   | 47   | 5.1171          |
-| 0.0022        | 2.0   | 94   | 5.1147          |
-| 0.0026        | 3.0   | 141  | 5.1135          |
-| 0.0028        | 4.0   | 188  | 5.1116          |
-| 0.0034        | 5.0   | 235  | 5.1126          |
-| 0.0056        | 6.0   | 282  | 5.1166          |
-| 0.0081        | 7.0   | 329  | 5.1197          |
-| 0.0156        | 8.0   | 376  | 5.1220          |
-| 0.0253        | 9.0   | 423  | 5.1238          |
-| 0.0546        | 10.0  | 470  | 5.1241          |
 ### Framework versions

 This model is a fine-tuned version of [makhataei/qa-persian-distilbert-fa-zwnj-base](https://huggingface.co/makhataei/qa-persian-distilbert-fa-zwnj-base) on the parsinlu_reading_comprehension dataset.
 It achieves the following results on the evaluation set:
+- Loss: 5.6503
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 0.001
+- train_batch_size: 14
+- eval_batch_size: 14
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 5.6573        | 1.0   | 43   | 5.6503          |
+| 5.7768        | 2.0   | 86   | 5.6503          |
+| 5.7672        | 3.0   | 129  | 5.6503          |
+| 5.7933        | 4.0   | 172  | 5.6503          |
+| 5.7729        | 5.0   | 215  | 5.6503          |
+| 5.7827        | 6.0   | 258  | 5.6503          |
+| 5.7825        | 7.0   | 301  | 5.6503          |
+| 5.7841        | 8.0   | 344  | 5.6503          |
+| 5.7551        | 9.0   | 387  | 5.6503          |
+| 5.7649        | 10.0  | 430  | 5.6503          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:53c5910675125d6c3af8a93d907682c807c90d82e17bca9f49e83ee6b0bd4989
 size 300730456

 version https://git-lfs.github.com/spec/v1
+oid sha256:9406cebf1377d563e3fff3152213341387c29d256aaba5bcd4726a1108643d2f
 size 300730456

runs/Mar02_09-08-40_Software-AI/events.out.tfevents.1709357921.Software-AI.31256.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:ac26b85e5888c2306fc9a455e4876c2f6c91152ecc537e897fa5041be2718f65
+size 8965

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:618396f0bc1ad4b54c24aa0316f242c8b72c28f1a65f0191556f5f343ebf71fa
 size 4219

 version https://git-lfs.github.com/spec/v1
+oid sha256:cef9b96ad58dc501195b65ee2b727ef2d55fc8edc011ad723b4cd553850600ba
 size 4219