ArabicNewSplits7_usingALLEssays_FineTuningAraBERT_run3_AugV5_k11_task2_organization

This model is a fine-tuned version of aubmindlab/bert-base-arabertv02 on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 1.5456
  • Qwk: 0.2268
  • Mse: 1.5456
  • Rmse: 1.2432

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 100

Training results

Training Loss Epoch Step Validation Loss Qwk Mse Rmse
No log 0.0513 2 4.4834 -0.0041 4.4834 2.1174
No log 0.1026 4 2.5992 0.0025 2.5992 1.6122
No log 0.1538 6 2.3325 -0.0796 2.3325 1.5273
No log 0.2051 8 2.3149 -0.0409 2.3149 1.5215
No log 0.2564 10 1.6127 0.0372 1.6127 1.2699
No log 0.3077 12 1.6593 0.0 1.6593 1.2881
No log 0.3590 14 1.8722 0.0 1.8722 1.3683
No log 0.4103 16 1.6836 -0.0149 1.6836 1.2975
No log 0.4615 18 1.5884 0.0169 1.5884 1.2603
No log 0.5128 20 1.4459 0.1165 1.4459 1.2025
No log 0.5641 22 1.3985 0.1045 1.3985 1.1826
No log 0.6154 24 1.2420 0.1848 1.2420 1.1145
No log 0.6667 26 1.2900 0.1753 1.2900 1.1358
No log 0.7179 28 1.6759 0.1196 1.6759 1.2946
No log 0.7692 30 2.0507 -0.0108 2.0507 1.4320
No log 0.8205 32 1.6363 0.0575 1.6363 1.2792
No log 0.8718 34 1.3468 0.1045 1.3468 1.1605
No log 0.9231 36 1.3142 0.1045 1.3142 1.1464
No log 0.9744 38 1.3150 0.1045 1.3150 1.1467
No log 1.0256 40 1.3140 0.1045 1.3140 1.1463
No log 1.0769 42 1.4598 0.1051 1.4598 1.2082
No log 1.1282 44 1.6539 0.0 1.6539 1.2860
No log 1.1795 46 1.4092 0.1168 1.4092 1.1871
No log 1.2308 48 1.3205 0.1743 1.3205 1.1491
No log 1.2821 50 1.3828 0.1168 1.3828 1.1759
No log 1.3333 52 1.5109 0.1280 1.5109 1.2292
No log 1.3846 54 1.3441 0.1379 1.3441 1.1594
No log 1.4359 56 1.1830 0.2333 1.1830 1.0876
No log 1.4872 58 1.2262 0.2095 1.2262 1.1073
No log 1.5385 60 1.3939 0.1530 1.3939 1.1806
No log 1.5897 62 1.5760 0.1585 1.5760 1.2554
No log 1.6410 64 1.4008 0.1346 1.4008 1.1835
No log 1.6923 66 1.3121 0.1622 1.3121 1.1455
No log 1.7436 68 1.3129 0.1622 1.3129 1.1458
No log 1.7949 70 1.2420 0.2150 1.2420 1.1145
No log 1.8462 72 1.1527 0.2815 1.1527 1.0736
No log 1.8974 74 1.1991 0.3056 1.1991 1.0950
No log 1.9487 76 1.3763 0.3192 1.3763 1.1731
No log 2.0 78 1.8519 0.1517 1.8519 1.3608
No log 2.0513 80 1.7547 0.1639 1.7547 1.3246
No log 2.1026 82 1.5410 0.1698 1.5410 1.2414
No log 2.1538 84 1.4642 0.1670 1.4642 1.2100
No log 2.2051 86 1.2937 0.1743 1.2937 1.1374
No log 2.2564 88 1.1954 0.1753 1.1954 1.0933
No log 2.3077 90 1.4026 0.1670 1.4026 1.1843
No log 2.3590 92 1.4471 0.1670 1.4471 1.2030
No log 2.4103 94 1.4017 0.2417 1.4017 1.1839
No log 2.4615 96 1.5560 0.1768 1.5560 1.2474
No log 2.5128 98 1.7935 0.1735 1.7935 1.3392
No log 2.5641 100 1.4381 0.2103 1.4381 1.1992
No log 2.6154 102 1.1854 0.1456 1.1854 1.0888
No log 2.6667 104 1.1824 0.2502 1.1824 1.0874
No log 2.7179 106 1.2552 0.2521 1.2552 1.1203
No log 2.7692 108 1.8835 0.0219 1.8835 1.3724
No log 2.8205 110 2.3422 0.0212 2.3422 1.5304
No log 2.8718 112 2.0646 0.0412 2.0645 1.4369
No log 2.9231 114 1.4184 0.2644 1.4184 1.1910
No log 2.9744 116 1.2671 0.2511 1.2671 1.1256
No log 3.0256 118 1.3464 0.2779 1.3464 1.1603
No log 3.0769 120 1.6317 0.2033 1.6317 1.2774
No log 3.1282 122 1.9082 0.1248 1.9082 1.3814
No log 3.1795 124 1.7585 0.1386 1.7585 1.3261
No log 3.2308 126 1.5192 0.2004 1.5192 1.2326
No log 3.2821 128 1.6310 0.2217 1.6310 1.2771
No log 3.3333 130 1.6010 0.1545 1.6010 1.2653
No log 3.3846 132 1.4323 0.1149 1.4323 1.1968
No log 3.4359 134 1.4298 0.1146 1.4298 1.1957
No log 3.4872 136 1.7133 0.0974 1.7133 1.3089
No log 3.5385 138 1.7214 0.1079 1.7214 1.3120
No log 3.5897 140 1.4878 0.2091 1.4878 1.2197
No log 3.6410 142 1.5352 0.2181 1.5352 1.2390
No log 3.6923 144 1.7085 0.1756 1.7085 1.3071
No log 3.7436 146 1.7806 0.1297 1.7806 1.3344
No log 3.7949 148 1.4185 0.2022 1.4185 1.1910
No log 3.8462 150 1.2768 0.3160 1.2768 1.1299
No log 3.8974 152 1.2546 0.1185 1.2546 1.1201
No log 3.9487 154 1.3478 0.1711 1.3478 1.1610
No log 4.0 156 1.7560 0.1275 1.7560 1.3251
No log 4.0513 158 1.8124 0.0904 1.8124 1.3463
No log 4.1026 160 1.5499 0.1756 1.5499 1.2449
No log 4.1538 162 1.3315 0.2473 1.3315 1.1539
No log 4.2051 164 1.1781 0.2068 1.1781 1.0854
No log 4.2564 166 1.1560 0.1904 1.1560 1.0752
No log 4.3077 168 1.2204 0.2324 1.2204 1.1047
No log 4.3590 170 1.3249 0.1868 1.3249 1.1510
No log 4.4103 172 1.4607 0.2145 1.4607 1.2086
No log 4.4615 174 1.3980 0.2746 1.3980 1.1824
No log 4.5128 176 1.4023 0.2746 1.4023 1.1842
No log 4.5641 178 1.3062 0.3213 1.3062 1.1429
No log 4.6154 180 1.1706 0.2389 1.1706 1.0820
No log 4.6667 182 1.1881 0.2389 1.1881 1.0900
No log 4.7179 184 1.3309 0.2851 1.3309 1.1536
No log 4.7692 186 1.5873 0.1253 1.5873 1.2599
No log 4.8205 188 1.4719 0.2404 1.4719 1.2132
No log 4.8718 190 1.2392 0.2065 1.2392 1.1132
No log 4.9231 192 1.1972 0.2640 1.1972 1.0941
No log 4.9744 194 1.1849 0.2188 1.1849 1.0885
No log 5.0256 196 1.2685 0.2180 1.2685 1.1263
No log 5.0769 198 1.6104 0.1984 1.6104 1.2690
No log 5.1282 200 1.6692 0.1984 1.6692 1.2920
No log 5.1795 202 1.4764 0.1459 1.4764 1.2151
No log 5.2308 204 1.3485 0.2361 1.3485 1.1612
No log 5.2821 206 1.3078 0.3617 1.3078 1.1436
No log 5.3333 208 1.4150 0.3238 1.4150 1.1896
No log 5.3846 210 1.5538 0.2105 1.5538 1.2465
No log 5.4359 212 1.5156 0.2868 1.5156 1.2311
No log 5.4872 214 1.3937 0.3184 1.3937 1.1806
No log 5.5385 216 1.3687 0.2762 1.3687 1.1699
No log 5.5897 218 1.5092 0.1983 1.5092 1.2285
No log 5.6410 220 1.4754 0.2272 1.4754 1.2147
No log 5.6923 222 1.2371 0.2138 1.2371 1.1122
No log 5.7436 224 1.1311 0.1863 1.1311 1.0635
No log 5.7949 226 1.1455 0.2614 1.1455 1.0703
No log 5.8462 228 1.2681 0.1858 1.2681 1.1261
No log 5.8974 230 1.5683 0.1670 1.5683 1.2523
No log 5.9487 232 1.8691 0.0428 1.8691 1.3671
No log 6.0 234 1.8591 0.0943 1.8591 1.3635
No log 6.0513 236 1.5470 0.1694 1.5470 1.2438
No log 6.1026 238 1.4769 0.2191 1.4769 1.2153
No log 6.1538 240 1.5052 0.2053 1.5052 1.2269
No log 6.2051 242 1.5241 0.1686 1.5241 1.2346
No log 6.2564 244 1.4960 0.1581 1.4960 1.2231
No log 6.3077 246 1.4263 0.1290 1.4263 1.1943
No log 6.3590 248 1.3287 0.2035 1.3287 1.1527
No log 6.4103 250 1.3101 0.2128 1.3101 1.1446
No log 6.4615 252 1.2777 0.1595 1.2777 1.1304
No log 6.5128 254 1.3283 0.1201 1.3283 1.1525
No log 6.5641 256 1.5363 0.1228 1.5363 1.2395
No log 6.6154 258 1.6406 0.1583 1.6406 1.2808
No log 6.6667 260 1.7134 0.1446 1.7134 1.3090
No log 6.7179 262 1.6149 0.2060 1.6149 1.2708
No log 6.7692 264 1.6106 0.2060 1.6106 1.2691
No log 6.8205 266 1.5481 0.1738 1.5481 1.2442
No log 6.8718 268 1.4843 0.1599 1.4843 1.2183
No log 6.9231 270 1.4544 0.1492 1.4544 1.2060
No log 6.9744 272 1.4421 0.1547 1.4421 1.2009
No log 7.0256 274 1.2887 0.1436 1.2887 1.1352
No log 7.0769 276 1.2216 0.1264 1.2216 1.1053
No log 7.1282 278 1.3113 0.1809 1.3113 1.1451
No log 7.1795 280 1.3361 0.2181 1.3361 1.1559
No log 7.2308 282 1.3251 0.2094 1.3251 1.1511
No log 7.2821 284 1.2455 0.1845 1.2455 1.1160
No log 7.3333 286 1.2370 0.2405 1.2370 1.1122
No log 7.3846 288 1.2625 0.2384 1.2625 1.1236
No log 7.4359 290 1.3974 0.2564 1.3974 1.1821
No log 7.4872 292 1.4408 0.2564 1.4408 1.2003
No log 7.5385 294 1.2983 0.2851 1.2983 1.1394
No log 7.5897 296 1.1710 0.1928 1.1710 1.0821
No log 7.6410 298 1.1713 0.2021 1.1713 1.0823
No log 7.6923 300 1.2418 0.1982 1.2418 1.1144
No log 7.7436 302 1.2932 0.2467 1.2932 1.1372
No log 7.7949 304 1.3140 0.2049 1.3140 1.1463
No log 7.8462 306 1.2768 0.2049 1.2768 1.1300
No log 7.8974 308 1.2169 0.2035 1.2169 1.1031
No log 7.9487 310 1.1882 0.2315 1.1882 1.0900
No log 8.0 312 1.2440 0.2049 1.2440 1.1153
No log 8.0513 314 1.2311 0.2374 1.2311 1.1096
No log 8.1026 316 1.1800 0.2273 1.1800 1.0863
No log 8.1538 318 1.1862 0.2424 1.1862 1.0891
No log 8.2051 320 1.2733 0.2779 1.2733 1.1284
No log 8.2564 322 1.3240 0.2762 1.3240 1.1506
No log 8.3077 324 1.2031 0.2022 1.2031 1.0968
No log 8.3590 326 1.1474 0.2108 1.1474 1.0711
No log 8.4103 328 1.1369 0.2108 1.1369 1.0663
No log 8.4615 330 1.1275 0.1811 1.1275 1.0618
No log 8.5128 332 1.1223 0.2474 1.1223 1.0594
No log 8.5641 334 1.1483 0.2126 1.1483 1.0716
No log 8.6154 336 1.2382 0.2285 1.2382 1.1127
No log 8.6667 338 1.2130 0.2424 1.2130 1.1014
No log 8.7179 340 1.1333 0.2021 1.1333 1.0645
No log 8.7692 342 1.1209 0.2115 1.1209 1.0587
No log 8.8205 344 1.1325 0.2115 1.1325 1.0642
No log 8.8718 346 1.1528 0.2353 1.1528 1.0737
No log 8.9231 348 1.1680 0.2667 1.1680 1.0807
No log 8.9744 350 1.2489 0.3231 1.2489 1.1175
No log 9.0256 352 1.3251 0.3316 1.3251 1.1511
No log 9.0769 354 1.2770 0.3418 1.2770 1.1301
No log 9.1282 356 1.1410 0.2159 1.1410 1.0682
No log 9.1795 358 1.1279 0.2740 1.1279 1.0620
No log 9.2308 360 1.2106 0.2640 1.2106 1.1003
No log 9.2821 362 1.2406 0.2640 1.2406 1.1138
No log 9.3333 364 1.1422 0.2871 1.1422 1.0687
No log 9.3846 366 1.1201 0.2555 1.1201 1.0583
No log 9.4359 368 1.1632 0.2521 1.1632 1.0785
No log 9.4872 370 1.3067 0.1868 1.3067 1.1431
No log 9.5385 372 1.4079 0.2321 1.4079 1.1865
No log 9.5897 374 1.3729 0.1886 1.3729 1.1717
No log 9.6410 376 1.3546 0.1836 1.3546 1.1639
No log 9.6923 378 1.3956 0.1836 1.3956 1.1814
No log 9.7436 380 1.5066 0.2270 1.5066 1.2275
No log 9.7949 382 1.4680 0.1929 1.4680 1.2116
No log 9.8462 384 1.2966 0.1290 1.2966 1.1387
No log 9.8974 386 1.2121 0.1873 1.2121 1.1009
No log 9.9487 388 1.1853 0.1723 1.1853 1.0887
No log 10.0 390 1.2198 0.2030 1.2198 1.1045
No log 10.0513 392 1.3618 0.1512 1.3618 1.1670
No log 10.1026 394 1.4617 0.1256 1.4617 1.2090
No log 10.1538 396 1.3702 0.1599 1.3702 1.1706
No log 10.2051 398 1.2402 0.2495 1.2402 1.1136
No log 10.2564 400 1.1845 0.2198 1.1845 1.0883
No log 10.3077 402 1.1534 0.1912 1.1534 1.0740
No log 10.3590 404 1.1395 0.2263 1.1395 1.0675
No log 10.4103 406 1.1408 0.2126 1.1408 1.0681
No log 10.4615 408 1.2241 0.1905 1.2241 1.1064
No log 10.5128 410 1.2577 0.2010 1.2577 1.1215
No log 10.5641 412 1.1668 0.1841 1.1668 1.0802
No log 10.6154 414 1.1062 0.2745 1.1062 1.0518
No log 10.6667 416 1.1084 0.2296 1.1084 1.0528
No log 10.7179 418 1.1052 0.2296 1.1052 1.0513
No log 10.7692 420 1.1260 0.2263 1.1260 1.0611
No log 10.8205 422 1.2173 0.2560 1.2173 1.1033
No log 10.8718 424 1.2725 0.2140 1.2725 1.1281
No log 10.9231 426 1.2645 0.2140 1.2645 1.1245
No log 10.9744 428 1.2013 0.1982 1.2013 1.0961
No log 11.0256 430 1.1681 0.1967 1.1681 1.0808
No log 11.0769 432 1.1600 0.2306 1.1600 1.0770
No log 11.1282 434 1.1948 0.2030 1.1948 1.0931
No log 11.1795 436 1.3379 0.1675 1.3379 1.1567
No log 11.2308 438 1.4533 0.2153 1.4533 1.2055
No log 11.2821 440 1.5280 0.2366 1.5280 1.2361
No log 11.3333 442 1.5451 0.2436 1.5451 1.2430
No log 11.3846 444 1.4822 0.2110 1.4822 1.2175
No log 11.4359 446 1.3591 0.1862 1.3591 1.1658
No log 11.4872 448 1.3067 0.1706 1.3067 1.1431
No log 11.5385 450 1.2752 0.1889 1.2752 1.1292
No log 11.5897 452 1.2274 0.2315 1.2274 1.1079
No log 11.6410 454 1.2471 0.2130 1.2471 1.1167
No log 11.6923 456 1.3102 0.1738 1.3102 1.1446
No log 11.7436 458 1.3590 0.1886 1.3590 1.1658
No log 11.7949 460 1.2931 0.1084 1.2931 1.1371
No log 11.8462 462 1.2363 0.1651 1.2363 1.1119
No log 11.8974 464 1.2275 0.1928 1.2275 1.1079
No log 11.9487 466 1.2345 0.1595 1.2345 1.1111
No log 12.0 468 1.2998 0.0900 1.2998 1.1401
No log 12.0513 470 1.4114 0.1315 1.4114 1.1880
No log 12.1026 472 1.3942 0.1021 1.3942 1.1808
No log 12.1538 474 1.2747 0.1171 1.2747 1.1290
No log 12.2051 476 1.2520 0.1691 1.2520 1.1189
No log 12.2564 478 1.2942 0.1322 1.2942 1.1376
No log 12.3077 480 1.2257 0.1742 1.2257 1.1071
No log 12.3590 482 1.1630 0.2021 1.1630 1.0784
No log 12.4103 484 1.1301 0.2647 1.1301 1.0631
No log 12.4615 486 1.1219 0.2358 1.1219 1.0592
No log 12.5128 488 1.1935 0.2030 1.1935 1.0925
No log 12.5641 490 1.2108 0.2328 1.2108 1.1004
No log 12.6154 492 1.1621 0.1935 1.1621 1.0780
No log 12.6667 494 1.0918 0.2126 1.0918 1.0449
No log 12.7179 496 1.0340 0.3041 1.0340 1.0168
No log 12.7692 498 1.0306 0.3933 1.0306 1.0152
0.3207 12.8205 500 1.0345 0.3639 1.0345 1.0171
0.3207 12.8718 502 1.0550 0.2665 1.0550 1.0271
0.3207 12.9231 504 1.0980 0.2126 1.0980 1.0479
0.3207 12.9744 506 1.2369 0.3008 1.2369 1.1122
0.3207 13.0256 508 1.4696 0.2268 1.4696 1.2123
0.3207 13.0769 510 1.5456 0.2268 1.5456 1.2432

Framework versions

  • Transformers 4.44.2
  • Pytorch 2.4.0+cu118
  • Datasets 2.21.0
  • Tokenizers 0.19.1
Downloads last month
183
Safetensors
Model size
135M params
Tensor type
F32
·
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for MayBashendy/ArabicNewSplits7_usingALLEssays_FineTuningAraBERT_run3_AugV5_k11_task2_organization

Finetuned
(4222)
this model