End of training

Browse files

Files changed (4) hide show

README.md +104 -16
model.safetensors +1 -1
runs/Mar06_14-48-57_Software-AI/events.out.tfevents.1709723938.Software-AI.118212.16 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -3,8 +3,6 @@ license: apache-2.0
 base_model: makhataei/qa-persian-bert-fa-base-uncased
 tags:
 - generated_from_trainer
-datasets:
-- parsinlu_reading_comprehension
 model-index:
 - name: qa-persian-bert-fa-base-uncased
   results: []
@@ -15,9 +13,9 @@ should probably proofread and complete it, then remove this comment. -->
 # qa-persian-bert-fa-base-uncased
-This model is a fine-tuned version of [makhataei/qa-persian-bert-fa-base-uncased](https://huggingface.co/makhataei/qa-persian-bert-fa-base-uncased) on the parsinlu_reading_comprehension dataset.
 It achieves the following results on the evaluation set:
-- Loss: 5.4907
 ## Model description
@@ -36,28 +34,118 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.0005
 - train_batch_size: 14
 - eval_batch_size: 14
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 10
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 5.4172        | 1.0   | 86   | 5.4907          |
-| 5.3912        | 2.0   | 172  | 5.4907          |
-| 5.4122        | 3.0   | 258  | 5.4907          |
-| 5.4156        | 4.0   | 344  | 5.4907          |
-| 5.4186        | 5.0   | 430  | 5.4907          |
-| 5.3939        | 6.0   | 516  | 5.4907          |
-| 5.4237        | 7.0   | 602  | 5.4907          |
-| 5.3991        | 8.0   | 688  | 5.4907          |
-| 5.412         | 9.0   | 774  | 5.4907          |
-| 5.448         | 10.0  | 860  | 5.4907          |
 ### Framework versions

 base_model: makhataei/qa-persian-bert-fa-base-uncased
 tags:
 - generated_from_trainer
 model-index:
 - name: qa-persian-bert-fa-base-uncased
   results: []
 # qa-persian-bert-fa-base-uncased
+This model is a fine-tuned version of [makhataei/qa-persian-bert-fa-base-uncased](https://huggingface.co/makhataei/qa-persian-bert-fa-base-uncased) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 5.3355
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 1e-07
 - train_batch_size: 14
 - eval_batch_size: 14
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 100
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 5.5994        | 1.0   | 9    | 5.3355          |
+| 5.6987        | 2.0   | 18   | 5.3355          |
+| 5.6845        | 3.0   | 27   | 5.3355          |
+| 5.6478        | 4.0   | 36   | 5.3355          |
+| 5.722         | 5.0   | 45   | 5.3355          |
+| 5.6464        | 6.0   | 54   | 5.3355          |
+| 5.5939        | 7.0   | 63   | 5.3355          |
+| 5.5771        | 8.0   | 72   | 5.3355          |
+| 5.5841        | 9.0   | 81   | 5.3355          |
+| 5.5864        | 10.0  | 90   | 5.3355          |
+| 5.5771        | 11.0  | 99   | 5.3355          |
+| 5.6131        | 12.0  | 108  | 5.3355          |
+| 5.6694        | 13.0  | 117  | 5.3355          |
+| 5.7032        | 14.0  | 126  | 5.3355          |
+| 5.6996        | 15.0  | 135  | 5.3355          |
+| 5.6724        | 16.0  | 144  | 5.3355          |
+| 5.7379        | 17.0  | 153  | 5.3355          |
+| 5.6688        | 18.0  | 162  | 5.3355          |
+| 5.7008        | 19.0  | 171  | 5.3355          |
+| 5.6231        | 20.0  | 180  | 5.3355          |
+| 5.6514        | 21.0  | 189  | 5.3355          |
+| 5.6814        | 22.0  | 198  | 5.3355          |
+| 5.6307        | 23.0  | 207  | 5.3355          |
+| 5.7506        | 24.0  | 216  | 5.3355          |
+| 5.6748        | 25.0  | 225  | 5.3355          |
+| 5.6644        | 26.0  | 234  | 5.3355          |
+| 5.6912        | 27.0  | 243  | 5.3355          |
+| 5.673         | 28.0  | 252  | 5.3355          |
+| 5.6223        | 29.0  | 261  | 5.3355          |
+| 5.6194        | 30.0  | 270  | 5.3355          |
+| 5.6944        | 31.0  | 279  | 5.3355          |
+| 5.6899        | 32.0  | 288  | 5.3355          |
+| 5.6169        | 33.0  | 297  | 5.3355          |
+| 5.6643        | 34.0  | 306  | 5.3355          |
+| 5.704         | 35.0  | 315  | 5.3355          |
+| 5.6704        | 36.0  | 324  | 5.3355          |
+| 5.6939        | 37.0  | 333  | 5.3355          |
+| 5.6055        | 38.0  | 342  | 5.3355          |
+| 5.5774        | 39.0  | 351  | 5.3355          |
+| 5.5988        | 40.0  | 360  | 5.3355          |
+| 5.6704        | 41.0  | 369  | 5.3355          |
+| 5.6441        | 42.0  | 378  | 5.3355          |
+| 5.6434        | 43.0  | 387  | 5.3355          |
+| 5.6054        | 44.0  | 396  | 5.3355          |
+| 5.6084        | 45.0  | 405  | 5.3355          |
+| 5.738         | 46.0  | 414  | 5.3355          |
+| 5.6527        | 47.0  | 423  | 5.3355          |
+| 5.6566        | 48.0  | 432  | 5.3355          |
+| 5.6381        | 49.0  | 441  | 5.3355          |
+| 5.7056        | 50.0  | 450  | 5.3355          |
+| 5.6694        | 51.0  | 459  | 5.3355          |
+| 5.6043        | 52.0  | 468  | 5.3355          |
+| 5.6552        | 53.0  | 477  | 5.3355          |
+| 5.5852        | 54.0  | 486  | 5.3355          |
+| 5.6209        | 55.0  | 495  | 5.3355          |
+| 5.6145        | 56.0  | 504  | 5.3355          |
+| 5.6426        | 57.0  | 513  | 5.3355          |
+| 5.5891        | 58.0  | 522  | 5.3355          |
+| 5.6143        | 59.0  | 531  | 5.3355          |
+| 5.6737        | 60.0  | 540  | 5.3355          |
+| 5.6741        | 61.0  | 549  | 5.3355          |
+| 5.6885        | 62.0  | 558  | 5.3355          |
+| 5.677         | 63.0  | 567  | 5.3355          |
+| 5.6158        | 64.0  | 576  | 5.3355          |
+| 5.6182        | 65.0  | 585  | 5.3355          |
+| 5.6781        | 66.0  | 594  | 5.3355          |
+| 5.686         | 67.0  | 603  | 5.3355          |
+| 5.6751        | 68.0  | 612  | 5.3355          |
+| 5.5912        | 69.0  | 621  | 5.3355          |
+| 5.66          | 70.0  | 630  | 5.3355          |
+| 5.7323        | 71.0  | 639  | 5.3355          |
+| 5.6168        | 72.0  | 648  | 5.3355          |
+| 5.6719        | 73.0  | 657  | 5.3355          |
+| 5.6933        | 74.0  | 666  | 5.3355          |
+| 5.5853        | 75.0  | 675  | 5.3355          |
+| 5.5871        | 76.0  | 684  | 5.3355          |
+| 5.652         | 77.0  | 693  | 5.3355          |
+| 5.6025        | 78.0  | 702  | 5.3355          |
+| 5.6427        | 79.0  | 711  | 5.3355          |
+| 5.639         | 80.0  | 720  | 5.3355          |
+| 5.6558        | 81.0  | 729  | 5.3355          |
+| 5.6957        | 82.0  | 738  | 5.3355          |
+| 5.6081        | 83.0  | 747  | 5.3355          |
+| 5.6185        | 84.0  | 756  | 5.3355          |
+| 5.6379        | 85.0  | 765  | 5.3355          |
+| 5.6208        | 86.0  | 774  | 5.3355          |
+| 5.7416        | 87.0  | 783  | 5.3355          |
+| 5.704         | 88.0  | 792  | 5.3355          |
+| 5.6387        | 89.0  | 801  | 5.3355          |
+| 5.6339        | 90.0  | 810  | 5.3355          |
+| 5.6447        | 91.0  | 819  | 5.3355          |
+| 5.6304        | 92.0  | 828  | 5.3355          |
+| 5.6814        | 93.0  | 837  | 5.3355          |
+| 5.6435        | 94.0  | 846  | 5.3355          |
+| 5.6821        | 95.0  | 855  | 5.3355          |
+| 5.6318        | 96.0  | 864  | 5.3355          |
+| 5.6404        | 97.0  | 873  | 5.3355          |
+| 5.6277        | 98.0  | 882  | 5.3355          |
+| 5.639         | 99.0  | 891  | 5.3355          |
+| 5.6655        | 100.0 | 900  | 5.3355          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:71156aeccdb4ae7402e818bdd574b0b82d7a81d237f470bdf6172971949a3ce8
 size 649032520

 version https://git-lfs.github.com/spec/v1
+oid sha256:1edb0f1ab0e06fd59439b00161ca94a4b17f0fd9c3d2590703570e153dac88ad
 size 649032520

runs/Mar06_14-48-57_Software-AI/events.out.tfevents.1709723938.Software-AI.118212.16 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:4b81182a319a1238737827223838de15e3c88ff7a99390eb41fba164392b769a
+size 47439

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a8c4e55593a329b12ce547c69a354b6bc41f73ca6a81af678d43b46994945276
 size 4219

 version https://git-lfs.github.com/spec/v1
+oid sha256:2ee7cbdabee3a994f2d9cd4f16a1335feafa48f4fe4f75e56259fc7f2836aaa8
 size 4219