ArabicNewSplits7_usingWellWrittenEssays_FineTuningAraBERT_run2_AugV5_k1_task1_organization

This model is a fine-tuned version of aubmindlab/bert-base-arabertv02 on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 1.4905
  • Qwk: 0.4375
  • Mse: 1.4905
  • Rmse: 1.2209

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 100

Training results

Training Loss Epoch Step Validation Loss Qwk Mse Rmse
No log 0.4 2 6.9282 -0.0056 6.9282 2.6322
No log 0.8 4 4.6622 0.0455 4.6622 2.1592
No log 1.2 6 3.2342 0.0970 3.2342 1.7984
No log 1.6 8 3.0309 0.0510 3.0309 1.7410
No log 2.0 10 2.2738 0.0640 2.2738 1.5079
No log 2.4 12 1.9303 0.0885 1.9303 1.3894
No log 2.8 14 1.8525 0.1111 1.8525 1.3611
No log 3.2 16 1.7615 0.1524 1.7615 1.3272
No log 3.6 18 1.7716 0.1524 1.7716 1.3310
No log 4.0 20 1.7963 0.1132 1.7963 1.3403
No log 4.4 22 2.0293 0.1849 2.0293 1.4245
No log 4.8 24 2.1242 0.1667 2.1242 1.4575
No log 5.2 26 1.8903 0.2478 1.8903 1.3749
No log 5.6 28 1.6658 0.1869 1.6658 1.2907
No log 6.0 30 1.6141 0.1887 1.6141 1.2705
No log 6.4 32 1.6922 0.2883 1.6922 1.3009
No log 6.8 34 1.9839 0.2576 1.9839 1.4085
No log 7.2 36 2.4854 0.0795 2.4854 1.5765
No log 7.6 38 2.3763 0.0816 2.3763 1.5415
No log 8.0 40 1.7705 0.3281 1.7705 1.3306
No log 8.4 42 1.5137 0.3036 1.5137 1.2303
No log 8.8 44 1.4904 0.3036 1.4904 1.2208
No log 9.2 46 1.4533 0.2545 1.4533 1.2055
No log 9.6 48 1.4339 0.2385 1.4339 1.1974
No log 10.0 50 1.7883 0.3000 1.7883 1.3373
No log 10.4 52 2.0196 0.2308 2.0196 1.4211
No log 10.8 54 1.6670 0.2414 1.6670 1.2911
No log 11.2 56 1.4803 0.4310 1.4803 1.2167
No log 11.6 58 1.4608 0.4407 1.4608 1.2086
No log 12.0 60 1.4514 0.3966 1.4514 1.2047
No log 12.4 62 1.4587 0.4103 1.4587 1.2078
No log 12.8 64 1.4137 0.3590 1.4137 1.1890
No log 13.2 66 1.4262 0.3967 1.4262 1.1942
No log 13.6 68 1.4065 0.4032 1.4065 1.1860
No log 14.0 70 1.4716 0.352 1.4716 1.2131
No log 14.4 72 1.4749 0.3200 1.4749 1.2145
No log 14.8 74 1.4828 0.3252 1.4828 1.2177
No log 15.2 76 1.5413 0.2787 1.5413 1.2415
No log 15.6 78 1.4718 0.3390 1.4718 1.2132
No log 16.0 80 1.5350 0.3793 1.5350 1.2390
No log 16.4 82 1.5402 0.3826 1.5402 1.2410
No log 16.8 84 1.6388 0.2645 1.6388 1.2802
No log 17.2 86 1.7303 0.2951 1.7303 1.3154
No log 17.6 88 1.6450 0.2301 1.6450 1.2826
No log 18.0 90 1.6250 0.2162 1.6250 1.2748
No log 18.4 92 1.6419 0.2478 1.6419 1.2814
No log 18.8 94 1.6111 0.2832 1.6111 1.2693
No log 19.2 96 1.7177 0.2097 1.7177 1.3106
No log 19.6 98 1.8814 0.2239 1.8814 1.3717
No log 20.0 100 1.7871 0.2424 1.7871 1.3368
No log 20.4 102 1.5977 0.35 1.5977 1.2640
No log 20.8 104 1.6966 0.3387 1.6966 1.3025
No log 21.2 106 1.7717 0.2975 1.7717 1.3311
No log 21.6 108 1.5812 0.3802 1.5812 1.2575
No log 22.0 110 1.6199 0.2787 1.6199 1.2728
No log 22.4 112 1.9086 0.2774 1.9086 1.3815
No log 22.8 114 1.9595 0.2286 1.9595 1.3998
No log 23.2 116 1.7607 0.3206 1.7607 1.3269
No log 23.6 118 1.5093 0.3115 1.5093 1.2286
No log 24.0 120 1.4008 0.4068 1.4008 1.1835
No log 24.4 122 1.3788 0.3833 1.3788 1.1742
No log 24.8 124 1.4099 0.4228 1.4099 1.1874
No log 25.2 126 1.4563 0.4065 1.4563 1.2068
No log 25.6 128 1.5006 0.4032 1.5006 1.2250
No log 26.0 130 1.4934 0.4409 1.4934 1.2221
No log 26.4 132 1.4822 0.4409 1.4822 1.2174
No log 26.8 134 1.4217 0.4603 1.4217 1.1924
No log 27.2 136 1.3737 0.4762 1.3737 1.1721
No log 27.6 138 1.3813 0.4194 1.3813 1.1753
No log 28.0 140 1.4726 0.4444 1.4726 1.2135
No log 28.4 142 1.5088 0.4444 1.5088 1.2283
No log 28.8 144 1.4319 0.5156 1.4319 1.1966
No log 29.2 146 1.3732 0.4167 1.3732 1.1718
No log 29.6 148 1.4002 0.4167 1.4002 1.1833
No log 30.0 150 1.4081 0.3419 1.4081 1.1866
No log 30.4 152 1.4327 0.3419 1.4327 1.1970
No log 30.8 154 1.4183 0.3559 1.4183 1.1909
No log 31.2 156 1.4131 0.4098 1.4131 1.1887
No log 31.6 158 1.3882 0.4480 1.3882 1.1782
No log 32.0 160 1.4007 0.4882 1.4007 1.1835
No log 32.4 162 1.4469 0.4341 1.4469 1.2029
No log 32.8 164 1.4245 0.4341 1.4245 1.1935
No log 33.2 166 1.4647 0.4427 1.4647 1.2102
No log 33.6 168 1.5992 0.4122 1.5992 1.2646
No log 34.0 170 1.6327 0.3511 1.6327 1.2778
No log 34.4 172 1.5716 0.3846 1.5716 1.2536
No log 34.8 174 1.4142 0.4409 1.4142 1.1892
No log 35.2 176 1.3068 0.4839 1.3068 1.1432
No log 35.6 178 1.3181 0.4839 1.3181 1.1481
No log 36.0 180 1.3539 0.4355 1.3539 1.1636
No log 36.4 182 1.5086 0.4275 1.5086 1.2282
No log 36.8 184 1.6572 0.3030 1.6572 1.2873
No log 37.2 186 1.7095 0.3066 1.7095 1.3075
No log 37.6 188 1.5720 0.3846 1.5720 1.2538
No log 38.0 190 1.4936 0.4186 1.4936 1.2221
No log 38.4 192 1.3582 0.5156 1.3582 1.1654
No log 38.8 194 1.2922 0.4844 1.2922 1.1368
No log 39.2 196 1.3023 0.4844 1.3023 1.1412
No log 39.6 198 1.4192 0.5271 1.4192 1.1913
No log 40.0 200 1.5491 0.3817 1.5491 1.2446
No log 40.4 202 1.5855 0.3788 1.5855 1.2592
No log 40.8 204 1.4773 0.4651 1.4773 1.2154
No log 41.2 206 1.2989 0.5116 1.2989 1.1397
No log 41.6 208 1.2612 0.5469 1.2612 1.1230
No log 42.0 210 1.3278 0.5079 1.3278 1.1523
No log 42.4 212 1.4303 0.4882 1.4303 1.1960
No log 42.8 214 1.4738 0.4603 1.4738 1.2140
No log 43.2 216 1.4097 0.5079 1.4097 1.1873
No log 43.6 218 1.3778 0.4960 1.3778 1.1738
No log 44.0 220 1.3702 0.5079 1.3702 1.1705
No log 44.4 222 1.3623 0.5079 1.3623 1.1672
No log 44.8 224 1.3908 0.5079 1.3908 1.1793
No log 45.2 226 1.4195 0.5079 1.4195 1.1914
No log 45.6 228 1.4467 0.5156 1.4467 1.2028
No log 46.0 230 1.3565 0.5079 1.3565 1.1647
No log 46.4 232 1.2714 0.5039 1.2714 1.1276
No log 46.8 234 1.2824 0.5039 1.2824 1.1324
No log 47.2 236 1.3544 0.5197 1.3544 1.1638
No log 47.6 238 1.3516 0.5197 1.3516 1.1626
No log 48.0 240 1.3075 0.5197 1.3075 1.1434
No log 48.4 242 1.2218 0.5312 1.2218 1.1053
No log 48.8 244 1.1688 0.5581 1.1688 1.0811
No log 49.2 246 1.1821 0.5354 1.1821 1.0872
No log 49.6 248 1.2543 0.5312 1.2543 1.1200
No log 50.0 250 1.3755 0.5039 1.3755 1.1728
No log 50.4 252 1.4233 0.4961 1.4233 1.1930
No log 50.8 254 1.3893 0.5312 1.3893 1.1787
No log 51.2 256 1.3455 0.5312 1.3455 1.1599
No log 51.6 258 1.3762 0.5312 1.3762 1.1731
No log 52.0 260 1.4081 0.5312 1.4081 1.1866
No log 52.4 262 1.4492 0.4688 1.4492 1.2038
No log 52.8 264 1.4602 0.4603 1.4602 1.2084
No log 53.2 266 1.4558 0.4480 1.4558 1.2066
No log 53.6 268 1.3969 0.5039 1.3969 1.1819
No log 54.0 270 1.3613 0.4762 1.3613 1.1667
No log 54.4 272 1.3481 0.4762 1.3481 1.1611
No log 54.8 274 1.3963 0.4762 1.3963 1.1817
No log 55.2 276 1.4847 0.4341 1.4847 1.2185
No log 55.6 278 1.5102 0.4308 1.5102 1.2289
No log 56.0 280 1.4976 0.4308 1.4976 1.2238
No log 56.4 282 1.4125 0.4806 1.4125 1.1885
No log 56.8 284 1.3006 0.5426 1.3006 1.1404
No log 57.2 286 1.2451 0.5581 1.2451 1.1159
No log 57.6 288 1.2269 0.5312 1.2269 1.1077
No log 58.0 290 1.2395 0.5426 1.2395 1.1133
No log 58.4 292 1.2730 0.5426 1.2730 1.1283
No log 58.8 294 1.3063 0.5469 1.3063 1.1429
No log 59.2 296 1.2692 0.5426 1.2692 1.1266
No log 59.6 298 1.2395 0.5426 1.2395 1.1133
No log 60.0 300 1.2607 0.5426 1.2607 1.1228
No log 60.4 302 1.2741 0.5426 1.2741 1.1288
No log 60.8 304 1.3379 0.5156 1.3379 1.1567
No log 61.2 306 1.3836 0.4921 1.3836 1.1763
No log 61.6 308 1.3378 0.5156 1.3378 1.1566
No log 62.0 310 1.2870 0.5156 1.2870 1.1345
No log 62.4 312 1.2754 0.5156 1.2754 1.1293
No log 62.8 314 1.2907 0.5156 1.2907 1.1361
No log 63.2 316 1.3637 0.5426 1.3637 1.1678
No log 63.6 318 1.4258 0.4580 1.4258 1.1941
No log 64.0 320 1.4747 0.4308 1.4747 1.2144
No log 64.4 322 1.5432 0.4122 1.5432 1.2423
No log 64.8 324 1.5548 0.4122 1.5548 1.2469
No log 65.2 326 1.4968 0.4341 1.4968 1.2234
No log 65.6 328 1.4387 0.4688 1.4387 1.1995
No log 66.0 330 1.4038 0.4651 1.4038 1.1848
No log 66.4 332 1.4136 0.4688 1.4136 1.1890
No log 66.8 334 1.4047 0.4724 1.4047 1.1852
No log 67.2 336 1.3803 0.4603 1.3803 1.1749
No log 67.6 338 1.3524 0.4567 1.3524 1.1629
No log 68.0 340 1.3253 0.496 1.3253 1.1512
No log 68.4 342 1.3249 0.496 1.3249 1.1510
No log 68.8 344 1.3579 0.4127 1.3579 1.1653
No log 69.2 346 1.4065 0.3968 1.4065 1.1860
No log 69.6 348 1.4921 0.4031 1.4921 1.2215
No log 70.0 350 1.5552 0.3692 1.5552 1.2471
No log 70.4 352 1.5678 0.3692 1.5678 1.2521
No log 70.8 354 1.5170 0.3692 1.5170 1.2317
No log 71.2 356 1.4601 0.4127 1.4601 1.2084
No log 71.6 358 1.4357 0.4409 1.4357 1.1982
No log 72.0 360 1.4438 0.4762 1.4438 1.2016
No log 72.4 362 1.4841 0.4480 1.4841 1.2182
No log 72.8 364 1.5466 0.4252 1.5466 1.2436
No log 73.2 366 1.5920 0.3906 1.5920 1.2618
No log 73.6 368 1.5749 0.4252 1.5749 1.2550
No log 74.0 370 1.5461 0.4286 1.5461 1.2434
No log 74.4 372 1.5001 0.4286 1.5001 1.2248
No log 74.8 374 1.4959 0.4286 1.4959 1.2231
No log 75.2 376 1.5316 0.4286 1.5316 1.2376
No log 75.6 378 1.5740 0.3566 1.5740 1.2546
No log 76.0 380 1.5997 0.3566 1.5997 1.2648
No log 76.4 382 1.6098 0.3566 1.6098 1.2688
No log 76.8 384 1.6246 0.3566 1.6246 1.2746
No log 77.2 386 1.6268 0.3692 1.6268 1.2754
No log 77.6 388 1.6112 0.3566 1.6112 1.2693
No log 78.0 390 1.6188 0.3566 1.6188 1.2723
No log 78.4 392 1.6041 0.3566 1.6041 1.2665
No log 78.8 394 1.5580 0.3906 1.5580 1.2482
No log 79.2 396 1.5186 0.4219 1.5186 1.2323
No log 79.6 398 1.4820 0.4286 1.4820 1.2174
No log 80.0 400 1.4690 0.4286 1.4690 1.2120
No log 80.4 402 1.4689 0.4286 1.4689 1.2120
No log 80.8 404 1.4645 0.4286 1.4645 1.2102
No log 81.2 406 1.4615 0.4286 1.4615 1.2089
No log 81.6 408 1.4625 0.4286 1.4625 1.2093
No log 82.0 410 1.4806 0.4286 1.4806 1.2168
No log 82.4 412 1.5072 0.4286 1.5072 1.2277
No log 82.8 414 1.5155 0.4252 1.5155 1.2311
No log 83.2 416 1.5003 0.4286 1.5003 1.2249
No log 83.6 418 1.4845 0.4286 1.4845 1.2184
No log 84.0 420 1.4917 0.4286 1.4917 1.2214
No log 84.4 422 1.4978 0.4286 1.4978 1.2238
No log 84.8 424 1.5021 0.4286 1.5021 1.2256
No log 85.2 426 1.5129 0.4252 1.5129 1.2300
No log 85.6 428 1.5225 0.4252 1.5225 1.2339
No log 86.0 430 1.5216 0.4252 1.5216 1.2335
No log 86.4 432 1.5251 0.4252 1.5251 1.2350
No log 86.8 434 1.5255 0.4252 1.5255 1.2351
No log 87.2 436 1.5105 0.4219 1.5105 1.2290
No log 87.6 438 1.5008 0.4219 1.5008 1.2251
No log 88.0 440 1.4849 0.4219 1.4849 1.2186
No log 88.4 442 1.4561 0.4252 1.4561 1.2067
No log 88.8 444 1.4312 0.4724 1.4312 1.1963
No log 89.2 446 1.4146 0.4724 1.4146 1.1894
No log 89.6 448 1.4060 0.4724 1.4060 1.1858
No log 90.0 450 1.4035 0.4724 1.4035 1.1847
No log 90.4 452 1.3963 0.4724 1.3963 1.1817
No log 90.8 454 1.3937 0.4724 1.3937 1.1806
No log 91.2 456 1.3922 0.4762 1.3922 1.1799
No log 91.6 458 1.4011 0.4762 1.4011 1.1837
No log 92.0 460 1.4063 0.4762 1.4063 1.1859
No log 92.4 462 1.4200 0.4286 1.4200 1.1916
No log 92.8 464 1.4296 0.4252 1.4296 1.1957
No log 93.2 466 1.4387 0.4252 1.4387 1.1994
No log 93.6 468 1.4552 0.4531 1.4552 1.2063
No log 94.0 470 1.4797 0.4651 1.4797 1.2164
No log 94.4 472 1.5017 0.4651 1.5017 1.2254
No log 94.8 474 1.5206 0.4308 1.5206 1.2331
No log 95.2 476 1.5410 0.4308 1.5410 1.2414
No log 95.6 478 1.5531 0.4308 1.5531 1.2462
No log 96.0 480 1.5550 0.4308 1.5550 1.2470
No log 96.4 482 1.5513 0.4308 1.5513 1.2455
No log 96.8 484 1.5418 0.4308 1.5418 1.2417
No log 97.2 486 1.5292 0.4688 1.5292 1.2366
No log 97.6 488 1.5179 0.4688 1.5179 1.2320
No log 98.0 490 1.5100 0.4651 1.5100 1.2288
No log 98.4 492 1.5027 0.4651 1.5027 1.2258
No log 98.8 494 1.4963 0.4375 1.4963 1.2232
No log 99.2 496 1.4932 0.4375 1.4932 1.2220
No log 99.6 498 1.4915 0.4375 1.4915 1.2213
0.2698 100.0 500 1.4905 0.4375 1.4905 1.2209

Framework versions

  • Transformers 4.44.2
  • Pytorch 2.4.0+cu118
  • Datasets 2.21.0
  • Tokenizers 0.19.1
Downloads last month
4
Safetensors
Model size
135M params
Tensor type
F32
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for MayBashendy/ArabicNewSplits7_usingWellWrittenEssays_FineTuningAraBERT_run2_AugV5_k1_task1_organization

Finetuned
(4205)
this model