ArabicNewSplits7_usingWellWrittenEssays_FineTuningAraBERT_run1_AugV5_k2_task1_organization

This model is a fine-tuned version of aubmindlab/bert-base-arabertv02 on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 1.6570
  • Qwk: 0.3158
  • Mse: 1.6570
  • Rmse: 1.2873

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 100

Training results

Training Loss Epoch Step Validation Loss Qwk Mse Rmse
No log 0.2222 2 7.2663 -0.0211 7.2663 2.6956
No log 0.4444 4 4.4662 0.0769 4.4662 2.1133
No log 0.6667 6 3.2771 -0.0328 3.2771 1.8103
No log 0.8889 8 4.0667 -0.0478 4.0667 2.0166
No log 1.1111 10 3.1854 0.0637 3.1854 1.7848
No log 1.3333 12 1.8423 0.0367 1.8423 1.3573
No log 1.5556 14 1.7113 0.1165 1.7113 1.3081
No log 1.7778 16 1.8216 0.1165 1.8216 1.3497
No log 2.0 18 1.9718 0.1682 1.9718 1.4042
No log 2.2222 20 1.9978 0.1495 1.9978 1.4134
No log 2.4444 22 2.0375 0.125 2.0375 1.4274
No log 2.6667 24 2.2224 0.0451 2.2224 1.4908
No log 2.8889 26 2.3459 0.0 2.3459 1.5316
No log 3.1111 28 2.2831 0.0588 2.2831 1.5110
No log 3.3333 30 2.0681 0.2063 2.0681 1.4381
No log 3.5556 32 1.8135 0.1982 1.8135 1.3467
No log 3.7778 34 1.7493 0.2807 1.7493 1.3226
No log 4.0 36 1.6548 0.2807 1.6548 1.2864
No log 4.2222 38 1.7410 0.3443 1.7410 1.3195
No log 4.4444 40 1.8292 0.3175 1.8292 1.3525
No log 4.6667 42 1.8931 0.2707 1.8931 1.3759
No log 4.8889 44 1.6615 0.352 1.6615 1.2890
No log 5.1111 46 1.3972 0.3967 1.3972 1.1820
No log 5.3333 48 1.3776 0.3934 1.3776 1.1737
No log 5.5556 50 1.6284 0.4203 1.6284 1.2761
No log 5.7778 52 1.2750 0.3840 1.2750 1.1292
No log 6.0 54 1.0499 0.5968 1.0499 1.0246
No log 6.2222 56 1.0287 0.5920 1.0287 1.0143
No log 6.4444 58 1.2256 0.4160 1.2256 1.1071
No log 6.6667 60 1.1259 0.5238 1.1259 1.0611
No log 6.8889 62 1.0740 0.6614 1.0740 1.0363
No log 7.1111 64 1.3239 0.4923 1.3239 1.1506
No log 7.3333 66 1.1462 0.5538 1.1462 1.0706
No log 7.5556 68 1.2222 0.5 1.2222 1.1055
No log 7.7778 70 1.4998 0.3529 1.4998 1.2247
No log 8.0 72 1.0615 0.5538 1.0615 1.0303
No log 8.2222 74 1.3203 0.4923 1.3203 1.1491
No log 8.4444 76 1.5445 0.3529 1.5445 1.2428
No log 8.6667 78 1.2190 0.5075 1.2190 1.1041
No log 8.8889 80 1.0744 0.6119 1.0744 1.0365
No log 9.1111 82 1.0349 0.6222 1.0349 1.0173
No log 9.3333 84 1.2702 0.5075 1.2702 1.1270
No log 9.5556 86 1.4434 0.4242 1.4434 1.2014
No log 9.7778 88 1.1324 0.6567 1.1324 1.0641
No log 10.0 90 1.0380 0.5984 1.0380 1.0188
No log 10.2222 92 1.1113 0.6357 1.1113 1.0542
No log 10.4444 94 1.2308 0.6094 1.2308 1.1094
No log 10.6667 96 1.1946 0.5984 1.1946 1.0930
No log 10.8889 98 1.2685 0.6094 1.2685 1.1263
No log 11.1111 100 1.1533 0.6308 1.1533 1.0739
No log 11.3333 102 1.0641 0.6466 1.0641 1.0315
No log 11.5556 104 1.0392 0.6617 1.0392 1.0194
No log 11.7778 106 1.2300 0.5037 1.2300 1.1091
No log 12.0 108 1.4975 0.3830 1.4975 1.2237
No log 12.2222 110 1.4588 0.3857 1.4588 1.2078
No log 12.4444 112 1.0691 0.5778 1.0691 1.0340
No log 12.6667 114 1.0303 0.5612 1.0303 1.0150
No log 12.8889 116 1.3156 0.4559 1.3156 1.1470
No log 13.1111 118 1.5062 0.3824 1.5062 1.2273
No log 13.3333 120 1.5151 0.4148 1.5151 1.2309
No log 13.5556 122 1.6788 0.2615 1.6788 1.2957
No log 13.7778 124 1.6420 0.3016 1.6420 1.2814
No log 14.0 126 1.4505 0.4806 1.4505 1.2044
No log 14.2222 128 1.2558 0.4138 1.2558 1.1206
No log 14.4444 130 1.2138 0.4746 1.2138 1.1017
No log 14.6667 132 1.3395 0.5116 1.3395 1.1574
No log 14.8889 134 1.5972 0.3433 1.5972 1.2638
No log 15.1111 136 1.5112 0.4211 1.5112 1.2293
No log 15.3333 138 1.1919 0.528 1.1919 1.0918
No log 15.5556 140 1.2476 0.4132 1.2476 1.1170
No log 15.7778 142 1.2002 0.4590 1.2002 1.0955
No log 16.0 144 1.1967 0.5079 1.1967 1.0939
No log 16.2222 146 1.4084 0.4328 1.4084 1.1868
No log 16.4444 148 1.7825 0.2837 1.7825 1.3351
No log 16.6667 150 1.6727 0.3333 1.6727 1.2933
No log 16.8889 152 1.3220 0.5039 1.3220 1.1498
No log 17.1111 154 1.2093 0.4370 1.2093 1.0997
No log 17.3333 156 1.2301 0.4500 1.2301 1.1091
No log 17.5556 158 1.1720 0.5484 1.1720 1.0826
No log 17.7778 160 1.4883 0.4179 1.4883 1.2200
No log 18.0 162 1.7323 0.3022 1.7323 1.3162
No log 18.2222 164 1.5640 0.3688 1.5640 1.2506
No log 18.4444 166 1.3301 0.4348 1.3301 1.1533
No log 18.6667 168 1.2347 0.5 1.2347 1.1112
No log 18.8889 170 1.2923 0.4925 1.2923 1.1368
No log 19.1111 172 1.2542 0.5625 1.2542 1.1199
No log 19.3333 174 1.2412 0.56 1.2412 1.1141
No log 19.5556 176 1.1971 0.4274 1.1971 1.0941
No log 19.7778 178 1.1842 0.4407 1.1842 1.0882
No log 20.0 180 1.2350 0.5323 1.2350 1.1113
No log 20.2222 182 1.3730 0.5385 1.3730 1.1718
No log 20.4444 184 1.3414 0.5344 1.3414 1.1582
No log 20.6667 186 1.1615 0.5426 1.1615 1.0777
No log 20.8889 188 1.1118 0.5669 1.1118 1.0544
No log 21.1111 190 1.0957 0.5669 1.0957 1.0467
No log 21.3333 192 1.2583 0.5152 1.2583 1.1217
No log 21.5556 194 1.5303 0.3650 1.5303 1.2370
No log 21.7778 196 1.4400 0.4427 1.4400 1.2000
No log 22.0 198 1.2831 0.5354 1.2831 1.1327
No log 22.2222 200 1.2562 0.528 1.2562 1.1208
No log 22.4444 202 1.2449 0.5484 1.2449 1.1158
No log 22.6667 204 1.2682 0.528 1.2682 1.1262
No log 22.8889 206 1.3228 0.5156 1.3228 1.1501
No log 23.1111 208 1.3074 0.5354 1.3074 1.1434
No log 23.3333 210 1.2367 0.5354 1.2367 1.1121
No log 23.5556 212 1.1699 0.5484 1.1699 1.0816
No log 23.7778 214 1.2005 0.5397 1.2005 1.0957
No log 24.0 216 1.2816 0.5354 1.2816 1.1321
No log 24.2222 218 1.3371 0.5512 1.3371 1.1563
No log 24.4444 220 1.4726 0.4769 1.4726 1.2135
No log 24.6667 222 1.6334 0.3212 1.6334 1.2781
No log 24.8889 224 1.5481 0.4361 1.5481 1.2442
No log 25.1111 226 1.3060 0.5469 1.3060 1.1428
No log 25.3333 228 1.2015 0.4034 1.2015 1.0961
No log 25.5556 230 1.2057 0.3866 1.2057 1.0981
No log 25.7778 232 1.1725 0.544 1.1725 1.0828
No log 26.0 234 1.3169 0.5426 1.3169 1.1476
No log 26.2222 236 1.6427 0.3043 1.6427 1.2817
No log 26.4444 238 1.6821 0.3043 1.6821 1.2970
No log 26.6667 240 1.4023 0.4627 1.4023 1.1842
No log 26.8889 242 1.1479 0.5600 1.1479 1.0714
No log 27.1111 244 1.1645 0.4878 1.1645 1.0791
No log 27.3333 246 1.1991 0.5082 1.1991 1.0950
No log 27.5556 248 1.3546 0.4882 1.3546 1.1639
No log 27.7778 250 1.5494 0.3556 1.5494 1.2448
No log 28.0 252 1.5038 0.3969 1.5038 1.2263
No log 28.2222 254 1.3506 0.5079 1.3506 1.1621
No log 28.4444 256 1.2954 0.4500 1.2954 1.1382
No log 28.6667 258 1.3097 0.4628 1.3097 1.1444
No log 28.8889 260 1.4130 0.48 1.4130 1.1887
No log 29.1111 262 1.5679 0.3759 1.5679 1.2522
No log 29.3333 264 1.5841 0.3485 1.5841 1.2586
No log 29.5556 266 1.3947 0.4094 1.3947 1.1810
No log 29.7778 268 1.2505 0.4677 1.2505 1.1183
No log 30.0 270 1.2195 0.4839 1.2195 1.1043
No log 30.2222 272 1.2564 0.5426 1.2564 1.1209
No log 30.4444 274 1.4604 0.3969 1.4604 1.2085
No log 30.6667 276 1.7069 0.2774 1.7069 1.3065
No log 30.8889 278 1.6632 0.2774 1.6632 1.2897
No log 31.1111 280 1.4539 0.3969 1.4539 1.2058
No log 31.3333 282 1.2501 0.5312 1.2501 1.1181
No log 31.5556 284 1.2343 0.5556 1.2343 1.1110
No log 31.7778 286 1.2425 0.5556 1.2425 1.1147
No log 32.0 288 1.3510 0.4651 1.3510 1.1623
No log 32.2222 290 1.4202 0.4 1.4202 1.1917
No log 32.4444 292 1.3874 0.4275 1.3874 1.1779
No log 32.6667 294 1.3622 0.4275 1.3622 1.1671
No log 32.8889 296 1.4183 0.4394 1.4183 1.1909
No log 33.1111 298 1.4904 0.3582 1.4904 1.2208
No log 33.3333 300 1.4757 0.3910 1.4757 1.2148
No log 33.5556 302 1.3957 0.4733 1.3957 1.1814
No log 33.7778 304 1.3850 0.4769 1.3850 1.1769
No log 34.0 306 1.4696 0.4179 1.4696 1.2123
No log 34.2222 308 1.5155 0.3910 1.5155 1.2311
No log 34.4444 310 1.4434 0.4427 1.4434 1.2014
No log 34.6667 312 1.3019 0.5041 1.3019 1.1410
No log 34.8889 314 1.2779 0.4793 1.2779 1.1305
No log 35.1111 316 1.2471 0.5203 1.2471 1.1167
No log 35.3333 318 1.2837 0.5197 1.2837 1.1330
No log 35.5556 320 1.4231 0.4361 1.4231 1.1929
No log 35.7778 322 1.6236 0.3382 1.6236 1.2742
No log 36.0 324 1.7741 0.3165 1.7741 1.3319
No log 36.2222 326 1.6918 0.3188 1.6918 1.3007
No log 36.4444 328 1.3893 0.4662 1.3893 1.1787
No log 36.6667 330 1.2711 0.5426 1.2711 1.1274
No log 36.8889 332 1.2983 0.5039 1.2983 1.1394
No log 37.1111 334 1.3804 0.4375 1.3804 1.1749
No log 37.3333 336 1.4864 0.4122 1.4864 1.2192
No log 37.5556 338 1.4711 0.4122 1.4711 1.2129
No log 37.7778 340 1.3454 0.4375 1.3454 1.1599
No log 38.0 342 1.3039 0.4567 1.3039 1.1419
No log 38.2222 344 1.2989 0.4651 1.2989 1.1397
No log 38.4444 346 1.4032 0.4122 1.4032 1.1846
No log 38.6667 348 1.4156 0.4122 1.4156 1.1898
No log 38.8889 350 1.4382 0.4122 1.4382 1.1992
No log 39.1111 352 1.4578 0.4462 1.4578 1.2074
No log 39.3333 354 1.4024 0.4252 1.4024 1.1842
No log 39.5556 356 1.4255 0.4252 1.4255 1.1939
No log 39.7778 358 1.4244 0.4252 1.4244 1.1935
No log 40.0 360 1.4956 0.3937 1.4956 1.2229
No log 40.2222 362 1.4750 0.3937 1.4750 1.2145
No log 40.4444 364 1.3750 0.4262 1.3750 1.1726
No log 40.6667 366 1.3154 0.5041 1.3154 1.1469
No log 40.8889 368 1.3254 0.4640 1.3254 1.1512
No log 41.1111 370 1.4015 0.4496 1.4015 1.1839
No log 41.3333 372 1.4270 0.4154 1.4270 1.1946
No log 41.5556 374 1.5012 0.3636 1.5012 1.2252
No log 41.7778 376 1.4983 0.3636 1.4983 1.2240
No log 42.0 378 1.3988 0.4341 1.3988 1.1827
No log 42.2222 380 1.2762 0.5041 1.2762 1.1297
No log 42.4444 382 1.2611 0.5323 1.2611 1.1230
No log 42.6667 384 1.3095 0.4640 1.3095 1.1443
No log 42.8889 386 1.4453 0.4341 1.4453 1.2022
No log 43.1111 388 1.5632 0.3609 1.5632 1.2503
No log 43.3333 390 1.6043 0.3609 1.6043 1.2666
No log 43.5556 392 1.5322 0.4154 1.5322 1.2378
No log 43.7778 394 1.4660 0.4032 1.4660 1.2108
No log 44.0 396 1.4375 0.4667 1.4375 1.1990
No log 44.2222 398 1.4249 0.4538 1.4249 1.1937
No log 44.4444 400 1.4092 0.4667 1.4092 1.1871
No log 44.6667 402 1.4043 0.4603 1.4043 1.1850
No log 44.8889 404 1.4267 0.4094 1.4267 1.1945
No log 45.1111 406 1.4248 0.4375 1.4248 1.1937
No log 45.3333 408 1.4365 0.4662 1.4365 1.1985
No log 45.5556 410 1.4309 0.4662 1.4309 1.1962
No log 45.7778 412 1.4952 0.3824 1.4952 1.2228
No log 46.0 414 1.6416 0.3111 1.6416 1.2812
No log 46.2222 416 1.6835 0.3088 1.6835 1.2975
No log 46.4444 418 1.5846 0.3382 1.5846 1.2588
No log 46.6667 420 1.5310 0.3582 1.5310 1.2373
No log 46.8889 422 1.4590 0.3906 1.4590 1.2079
No log 47.1111 424 1.4364 0.4333 1.4364 1.1985
No log 47.3333 426 1.4443 0.4333 1.4443 1.2018
No log 47.5556 428 1.4738 0.3802 1.4738 1.2140
No log 47.7778 430 1.5312 0.3906 1.5312 1.2374
No log 48.0 432 1.6427 0.3459 1.6427 1.2817
No log 48.2222 434 1.7490 0.3088 1.7490 1.3225
No log 48.4444 436 1.7551 0.3088 1.7551 1.3248
No log 48.6667 438 1.7044 0.3088 1.7044 1.3055
No log 48.8889 440 1.5715 0.3459 1.5715 1.2536
No log 49.1111 442 1.4098 0.4219 1.4098 1.1874
No log 49.3333 444 1.3063 0.4844 1.3063 1.1429
No log 49.5556 446 1.3003 0.5079 1.3003 1.1403
No log 49.7778 448 1.3326 0.4444 1.3326 1.1544
No log 50.0 450 1.3825 0.4127 1.3825 1.1758
No log 50.2222 452 1.4454 0.3651 1.4454 1.2022
No log 50.4444 454 1.5430 0.3969 1.5430 1.2422
No log 50.6667 456 1.6176 0.3459 1.6176 1.2718
No log 50.8889 458 1.6669 0.3459 1.6669 1.2911
No log 51.1111 460 1.5961 0.3636 1.5961 1.2634
No log 51.3333 462 1.4534 0.4031 1.4534 1.2056
No log 51.5556 464 1.3950 0.3937 1.3950 1.1811
No log 51.7778 466 1.3953 0.4160 1.3953 1.1812
No log 52.0 468 1.3988 0.4160 1.3988 1.1827
No log 52.2222 470 1.4109 0.4160 1.4109 1.1878
No log 52.4444 472 1.4463 0.4094 1.4463 1.2026
No log 52.6667 474 1.4753 0.4031 1.4753 1.2146
No log 52.8889 476 1.4753 0.4031 1.4753 1.2146
No log 53.1111 478 1.4259 0.4375 1.4259 1.1941
No log 53.3333 480 1.4060 0.4252 1.4060 1.1858
No log 53.5556 482 1.3696 0.4444 1.3696 1.1703
No log 53.7778 484 1.3928 0.4252 1.3928 1.1802
No log 54.0 486 1.4395 0.4252 1.4395 1.1998
No log 54.2222 488 1.5594 0.3846 1.5594 1.2487
No log 54.4444 490 1.6779 0.3459 1.6779 1.2953
No log 54.6667 492 1.6465 0.3788 1.6465 1.2832
No log 54.8889 494 1.5387 0.3846 1.5387 1.2404
No log 55.1111 496 1.4511 0.3740 1.4511 1.2046
No log 55.3333 498 1.4214 0.3103 1.4214 1.1922
0.3126 55.5556 500 1.4072 0.3729 1.4072 1.1863
0.3126 55.7778 502 1.4355 0.3871 1.4355 1.1981
0.3126 56.0 504 1.4603 0.3651 1.4603 1.2084
0.3126 56.2222 506 1.4646 0.3906 1.4646 1.2102
0.3126 56.4444 508 1.4545 0.3622 1.4545 1.2060
0.3126 56.6667 510 1.4532 0.3622 1.4532 1.2055
0.3126 56.8889 512 1.4093 0.3651 1.4093 1.1871
0.3126 57.1111 514 1.3866 0.3651 1.3866 1.1775
0.3126 57.3333 516 1.3843 0.3968 1.3843 1.1766
0.3126 57.5556 518 1.4150 0.368 1.4150 1.1895
0.3126 57.7778 520 1.4795 0.3360 1.4795 1.2164
0.3126 58.0 522 1.5610 0.3411 1.5610 1.2494
0.3126 58.2222 524 1.6509 0.3158 1.6509 1.2849
0.3126 58.4444 526 1.6570 0.3158 1.6570 1.2873

Framework versions

  • Transformers 4.44.2
  • Pytorch 2.4.0+cu118
  • Datasets 2.21.0
  • Tokenizers 0.19.1
Downloads last month
7
Safetensors
Model size
135M params
Tensor type
F32
·
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for MayBashendy/ArabicNewSplits7_usingWellWrittenEssays_FineTuningAraBERT_run1_AugV5_k2_task1_organization

Finetuned
(4222)
this model