ArabicNewSplits7_usingALLEssays_FineTuningAraBERT_run3_AugV5_k15_task2_organization

This model is a fine-tuned version of aubmindlab/bert-base-arabertv02 on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 1.2455
  • Qwk: 0.2180
  • Mse: 1.2455
  • Rmse: 1.1160

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 100

Training results

Training Loss Epoch Step Validation Loss Qwk Mse Rmse
No log 0.0370 2 4.4932 0.0010 4.4932 2.1197
No log 0.0741 4 2.4830 0.0202 2.4830 1.5757
No log 0.1111 6 1.7011 0.0372 1.7011 1.3043
No log 0.1481 8 1.5350 0.0372 1.5350 1.2390
No log 0.1852 10 1.3655 0.0538 1.3655 1.1685
No log 0.2222 12 1.4489 0.0275 1.4489 1.2037
No log 0.2593 14 1.6663 -0.0066 1.6663 1.2908
No log 0.2963 16 1.5613 0.0104 1.5613 1.2495
No log 0.3333 18 1.4278 0.0838 1.4278 1.1949
No log 0.3704 20 1.3178 0.1314 1.3178 1.1480
No log 0.4074 22 1.1705 0.2245 1.1705 1.0819
No log 0.4444 24 1.0553 0.2835 1.0553 1.0273
No log 0.4815 26 1.0612 0.3195 1.0612 1.0302
No log 0.5185 28 1.1418 0.2589 1.1418 1.0685
No log 0.5556 30 1.4688 0.0575 1.4688 1.2119
No log 0.5926 32 1.8373 0.0041 1.8373 1.3555
No log 0.6296 34 1.8268 0.0198 1.8268 1.3516
No log 0.6667 36 1.6245 0.0222 1.6245 1.2746
No log 0.7037 38 1.3375 0.1165 1.3375 1.1565
No log 0.7407 40 1.3566 0.1346 1.3566 1.1647
No log 0.7778 42 1.3857 0.1076 1.3857 1.1771
No log 0.8148 44 1.4055 0.0512 1.4055 1.1855
No log 0.8519 46 1.3704 0.1404 1.3704 1.1707
No log 0.8889 48 1.3082 0.1715 1.3082 1.1438
No log 0.9259 50 1.3988 0.1288 1.3988 1.1827
No log 0.9630 52 1.4760 0.1199 1.4760 1.2149
No log 1.0 54 1.4272 0.1282 1.4272 1.1946
No log 1.0370 56 1.3496 0.1790 1.3496 1.1617
No log 1.0741 58 1.2668 0.2482 1.2668 1.1255
No log 1.1111 60 1.2826 0.1966 1.2826 1.1325
No log 1.1481 62 1.4471 0.1280 1.4471 1.2030
No log 1.1852 64 1.8253 0.0727 1.8253 1.3510
No log 1.2222 66 1.5437 0.1387 1.5437 1.2425
No log 1.2593 68 1.2616 0.1730 1.2616 1.1232
No log 1.2963 70 1.2382 0.1715 1.2382 1.1128
No log 1.3333 72 1.1897 0.2589 1.1897 1.0907
No log 1.3704 74 1.2599 0.1667 1.2599 1.1225
No log 1.4074 76 1.3474 0.1583 1.3474 1.1608
No log 1.4444 78 1.4153 0.1417 1.4153 1.1897
No log 1.4815 80 1.1897 0.3476 1.1897 1.0907
No log 1.5185 82 1.1266 0.2988 1.1266 1.0614
No log 1.5556 84 1.1061 0.3650 1.1061 1.0517
No log 1.5926 86 1.1427 0.2664 1.1427 1.0690
No log 1.6296 88 1.3118 0.1815 1.3118 1.1454
No log 1.6667 90 1.6315 0.1068 1.6315 1.2773
No log 1.7037 92 1.6484 0.1379 1.6484 1.2839
No log 1.7407 94 1.5906 0.1469 1.5906 1.2612
No log 1.7778 96 1.4747 0.1851 1.4747 1.2144
No log 1.8148 98 1.2170 0.2035 1.2170 1.1032
No log 1.8519 100 1.1917 0.2276 1.1917 1.0917
No log 1.8889 102 1.3398 0.1821 1.3398 1.1575
No log 1.9259 104 1.2799 0.2337 1.2799 1.1313
No log 1.9630 106 1.3236 0.2585 1.3236 1.1505
No log 2.0 108 1.4107 0.2437 1.4107 1.1877
No log 2.0370 110 1.3048 0.2412 1.3048 1.1423
No log 2.0741 112 1.2692 0.2613 1.2692 1.1266
No log 2.1111 114 1.1898 0.3445 1.1898 1.0908
No log 2.1481 116 1.2156 0.2953 1.2156 1.1025
No log 2.1852 118 1.2748 0.3033 1.2748 1.1291
No log 2.2222 120 1.2299 0.3191 1.2299 1.1090
No log 2.2593 122 1.3327 0.3218 1.3327 1.1544
No log 2.2963 124 1.3438 0.2569 1.3438 1.1592
No log 2.3333 126 1.2850 0.3472 1.2850 1.1336
No log 2.3704 128 1.1841 0.2956 1.1841 1.0882
No log 2.4074 130 1.2246 0.3159 1.2246 1.1066
No log 2.4444 132 1.2334 0.3076 1.2334 1.1106
No log 2.4815 134 1.2582 0.2835 1.2582 1.1217
No log 2.5185 136 1.1888 0.3882 1.1888 1.0903
No log 2.5556 138 1.1836 0.3882 1.1836 1.0879
No log 2.5926 140 1.3475 0.2552 1.3475 1.1608
No log 2.6296 142 1.3903 0.3590 1.3903 1.1791
No log 2.6667 144 1.3920 0.3108 1.3920 1.1798
No log 2.7037 146 1.2383 0.2515 1.2383 1.1128
No log 2.7407 148 1.2645 0.3016 1.2645 1.1245
No log 2.7778 150 1.3132 0.2728 1.3132 1.1459
No log 2.8148 152 1.3492 0.2665 1.3492 1.1616
No log 2.8519 154 1.4301 0.2779 1.4301 1.1959
No log 2.8889 156 1.9282 0.1492 1.9282 1.3886
No log 2.9259 158 2.1795 0.1028 2.1795 1.4763
No log 2.9630 160 1.7259 0.1386 1.7259 1.3137
No log 3.0 162 1.2743 0.2827 1.2743 1.1288
No log 3.0370 164 1.2381 0.2200 1.2381 1.1127
No log 3.0741 166 1.2039 0.2221 1.2039 1.0972
No log 3.1111 168 1.2496 0.2827 1.2496 1.1179
No log 3.1481 170 1.3331 0.2570 1.3331 1.1546
No log 3.1852 172 1.2357 0.2851 1.2357 1.1116
No log 3.2222 174 1.1448 0.2792 1.1448 1.0699
No log 3.2593 176 1.1428 0.3534 1.1428 1.0690
No log 3.2963 178 1.2828 0.3096 1.2828 1.1326
No log 3.3333 180 1.4519 0.2197 1.4519 1.2049
No log 3.3704 182 1.3711 0.2405 1.3711 1.1709
No log 3.4074 184 1.2817 0.1880 1.2817 1.1321
No log 3.4444 186 1.2355 0.3100 1.2355 1.1115
No log 3.4815 188 1.2402 0.1821 1.2402 1.1136
No log 3.5185 190 1.2467 0.1872 1.2467 1.1165
No log 3.5556 192 1.3428 0.2384 1.3428 1.1588
No log 3.5926 194 1.4217 0.2035 1.4217 1.1923
No log 3.6296 196 1.4919 0.1734 1.4919 1.2215
No log 3.6667 198 1.5308 0.2094 1.5308 1.2373
No log 3.7037 200 1.4824 0.1725 1.4824 1.2175
No log 3.7407 202 1.3258 0.1980 1.3258 1.1514
No log 3.7778 204 1.3344 0.1872 1.3344 1.1552
No log 3.8148 206 1.5382 0.0709 1.5382 1.2402
No log 3.8519 208 1.6394 0.0719 1.6394 1.2804
No log 3.8889 210 1.6854 0.0961 1.6854 1.2982
No log 3.9259 212 1.7265 0.1127 1.7265 1.3140
No log 3.9630 214 1.5348 0.0693 1.5348 1.2389
No log 4.0 216 1.5837 0.0903 1.5837 1.2584
No log 4.0370 218 1.6459 0.0938 1.6459 1.2829
No log 4.0741 220 1.4969 0.0741 1.4969 1.2235
No log 4.1111 222 1.3604 0.1354 1.3604 1.1663
No log 4.1481 224 1.4557 0.0655 1.4557 1.2065
No log 4.1852 226 1.6375 0.1758 1.6375 1.2797
No log 4.2222 228 1.4754 0.0655 1.4754 1.2147
No log 4.2593 230 1.2580 0.1758 1.2580 1.1216
No log 4.2963 232 1.2255 0.2451 1.2255 1.1070
No log 4.3333 234 1.2083 0.2254 1.2083 1.0992
No log 4.3704 236 1.3159 0.2019 1.3159 1.1471
No log 4.4074 238 1.4886 0.2264 1.4886 1.2201
No log 4.4444 240 1.4396 0.2301 1.4396 1.1998
No log 4.4815 242 1.4273 0.2545 1.4273 1.1947
No log 4.5185 244 1.2777 0.2405 1.2777 1.1303
No log 4.5556 246 1.1942 0.2110 1.1942 1.0928
No log 4.5926 248 1.2177 0.2338 1.2177 1.1035
No log 4.6296 250 1.3480 0.2496 1.3480 1.1610
No log 4.6667 252 1.4387 0.2133 1.4387 1.1995
No log 4.7037 254 1.3488 0.2270 1.3488 1.1614
No log 4.7407 256 1.2725 0.2576 1.2725 1.1280
No log 4.7778 258 1.2826 0.2379 1.2826 1.1325
No log 4.8148 260 1.4321 0.2373 1.4321 1.1967
No log 4.8519 262 1.6507 0.1844 1.6507 1.2848
No log 4.8889 264 1.5438 0.2523 1.5438 1.2425
No log 4.9259 266 1.2927 0.2149 1.2927 1.1370
No log 4.9630 268 1.1655 0.2647 1.1655 1.0796
No log 5.0 270 1.1470 0.2845 1.1470 1.0710
No log 5.0370 272 1.1207 0.2598 1.1207 1.0587
No log 5.0741 274 1.1113 0.3062 1.1113 1.0542
No log 5.1111 276 1.1863 0.3056 1.1863 1.0892
No log 5.1481 278 1.3035 0.3074 1.3035 1.1417
No log 5.1852 280 1.2183 0.2180 1.2183 1.1038
No log 5.2222 282 1.1369 0.2803 1.1369 1.0663
No log 5.2593 284 1.1567 0.2721 1.1567 1.0755
No log 5.2963 286 1.2137 0.3446 1.2137 1.1017
No log 5.3333 288 1.2488 0.2995 1.2488 1.1175
No log 5.3704 290 1.3176 0.2141 1.3176 1.1479
No log 5.4074 292 1.3930 0.2356 1.3930 1.1802
No log 5.4444 294 1.3582 0.2331 1.3582 1.1654
No log 5.4815 296 1.3227 0.2022 1.3227 1.1501
No log 5.5185 298 1.2885 0.2148 1.2885 1.1351
No log 5.5556 300 1.3195 0.1702 1.3195 1.1487
No log 5.5926 302 1.4057 0.1596 1.4057 1.1856
No log 5.6296 304 1.3287 0.1348 1.3287 1.1527
No log 5.6667 306 1.2366 0.3128 1.2366 1.1120
No log 5.7037 308 1.2179 0.2342 1.2179 1.1036
No log 5.7407 310 1.1989 0.2877 1.1989 1.0949
No log 5.7778 312 1.3198 0.1260 1.3198 1.1488
No log 5.8148 314 1.4402 0.1836 1.4402 1.2001
No log 5.8519 316 1.3616 0.1957 1.3616 1.1669
No log 5.8889 318 1.2248 0.2127 1.2248 1.1067
No log 5.9259 320 1.2239 0.3168 1.2239 1.1063
No log 5.9630 322 1.2063 0.2614 1.2063 1.0983
No log 6.0 324 1.2520 0.1839 1.2520 1.1189
No log 6.0370 326 1.3316 0.2081 1.3316 1.1540
No log 6.0741 328 1.2823 0.1891 1.2823 1.1324
No log 6.1111 330 1.2196 0.2226 1.2196 1.1043
No log 6.1481 332 1.2437 0.2531 1.2437 1.1152
No log 6.1852 334 1.2937 0.2011 1.2937 1.1374
No log 6.2222 336 1.3619 0.2301 1.3619 1.1670
No log 6.2593 338 1.5440 0.2647 1.5440 1.2426
No log 6.2963 340 1.5798 0.2015 1.5798 1.2569
No log 6.3333 342 1.3482 0.2335 1.3482 1.1611
No log 6.3704 344 1.2129 0.2108 1.2129 1.1013
No log 6.4074 346 1.1481 0.2227 1.1481 1.0715
No log 6.4444 348 1.1432 0.2971 1.1432 1.0692
No log 6.4815 350 1.1397 0.2551 1.1397 1.0676
No log 6.5185 352 1.1629 0.2148 1.1629 1.0784
No log 6.5556 354 1.2562 0.2431 1.2562 1.1208
No log 6.5926 356 1.3115 0.2779 1.3115 1.1452
No log 6.6296 358 1.2884 0.3011 1.2884 1.1351
No log 6.6667 360 1.2797 0.3011 1.2797 1.1313
No log 6.7037 362 1.2319 0.2746 1.2319 1.1099
No log 6.7407 364 1.2730 0.2863 1.2730 1.1283
No log 6.7778 366 1.3479 0.2989 1.3479 1.1610
No log 6.8148 368 1.2799 0.2863 1.2799 1.1313
No log 6.8519 370 1.1671 0.3639 1.1671 1.0803
No log 6.8889 372 1.1399 0.2995 1.1399 1.0677
No log 6.9259 374 1.1228 0.2877 1.1228 1.0596
No log 6.9630 376 1.1336 0.2389 1.1336 1.0647
No log 7.0 378 1.1918 0.1950 1.1918 1.0917
No log 7.0370 380 1.2386 0.2507 1.2386 1.1129
No log 7.0741 382 1.2004 0.1950 1.2004 1.0956
No log 7.1111 384 1.1323 0.1573 1.1323 1.0641
No log 7.1481 386 1.1482 0.2459 1.1482 1.0716
No log 7.1852 388 1.2194 0.2452 1.2194 1.1043
No log 7.2222 390 1.4297 0.2570 1.4297 1.1957
No log 7.2593 392 1.5473 0.1692 1.5473 1.2439
No log 7.2963 394 1.5581 0.1454 1.5581 1.2482
No log 7.3333 396 1.4405 0.2443 1.4405 1.2002
No log 7.3704 398 1.2996 0.1753 1.2996 1.1400
No log 7.4074 400 1.2620 0.1837 1.2620 1.1234
No log 7.4444 402 1.2117 0.2098 1.2117 1.1008
No log 7.4815 404 1.2070 0.1781 1.2070 1.0986
No log 7.5185 406 1.2404 0.2159 1.2404 1.1137
No log 7.5556 408 1.2644 0.2072 1.2644 1.1244
No log 7.5926 410 1.2913 0.2378 1.2913 1.1364
No log 7.6296 412 1.2525 0.2666 1.2525 1.1192
No log 7.6667 414 1.2027 0.1921 1.2027 1.0967
No log 7.7037 416 1.2188 0.2108 1.2188 1.1040
No log 7.7407 418 1.2404 0.2296 1.2404 1.1137
No log 7.7778 420 1.2230 0.1882 1.2230 1.1059
No log 7.8148 422 1.2299 0.1882 1.2299 1.1090
No log 7.8519 424 1.2457 0.1797 1.2457 1.1161
No log 7.8889 426 1.2664 0.1862 1.2664 1.1253
No log 7.9259 428 1.2062 0.2018 1.2062 1.0983
No log 7.9630 430 1.1991 0.1595 1.1991 1.0950
No log 8.0 432 1.2696 0.1912 1.2696 1.1268
No log 8.0370 434 1.3719 0.1254 1.3719 1.1713
No log 8.0741 436 1.3185 0.2100 1.3185 1.1483
No log 8.1111 438 1.2166 0.2031 1.2166 1.1030
No log 8.1481 440 1.1969 0.2323 1.1969 1.0940
No log 8.1852 442 1.2614 0.2022 1.2614 1.1231
No log 8.2222 444 1.3868 0.2143 1.3868 1.1776
No log 8.2593 446 1.3426 0.1858 1.3426 1.1587
No log 8.2963 448 1.2481 0.2514 1.2481 1.1172
No log 8.3333 450 1.1948 0.2777 1.1948 1.0931
No log 8.3704 452 1.1726 0.2769 1.1726 1.0829
No log 8.4074 454 1.1527 0.2931 1.1527 1.0737
No log 8.4444 456 1.1843 0.2340 1.1843 1.0882
No log 8.4815 458 1.2866 0.2470 1.2866 1.1343
No log 8.5185 460 1.2885 0.2218 1.2885 1.1351
No log 8.5556 462 1.2077 0.2945 1.2077 1.0990
No log 8.5926 464 1.1754 0.3034 1.1754 1.0841
No log 8.6296 466 1.1683 0.3190 1.1683 1.0809
No log 8.6667 468 1.2314 0.2384 1.2314 1.1097
No log 8.7037 470 1.3783 0.2048 1.3783 1.1740
No log 8.7407 472 1.3663 0.1734 1.3663 1.1689
No log 8.7778 474 1.2904 0.1734 1.2904 1.1359
No log 8.8148 476 1.2312 0.2209 1.2312 1.1096
No log 8.8519 478 1.2279 0.2299 1.2279 1.1081
No log 8.8889 480 1.2040 0.2056 1.2040 1.0973
No log 8.9259 482 1.2092 0.2056 1.2092 1.0996
No log 8.9630 484 1.2127 0.1592 1.2127 1.1012
No log 9.0 486 1.1937 0.1873 1.1937 1.0926
No log 9.0370 488 1.1954 0.1595 1.1954 1.0934
No log 9.0741 490 1.2073 0.1540 1.2073 1.0988
No log 9.1111 492 1.1955 0.1882 1.1955 1.0934
No log 9.1481 494 1.1851 0.3336 1.1851 1.0886
No log 9.1852 496 1.1790 0.3618 1.1790 1.0858
No log 9.2222 498 1.1747 0.3379 1.1747 1.0838
0.3189 9.2593 500 1.1836 0.2299 1.1836 1.0879
0.3189 9.2963 502 1.2155 0.2577 1.2155 1.1025
0.3189 9.3333 504 1.2336 0.2270 1.2336 1.1107
0.3189 9.3704 506 1.2276 0.1632 1.2276 1.1080
0.3189 9.4074 508 1.2205 0.1632 1.2205 1.1048
0.3189 9.4444 510 1.1495 0.2624 1.1495 1.0722
0.3189 9.4815 512 1.1129 0.3085 1.1129 1.0550
0.3189 9.5185 514 1.1063 0.2661 1.1063 1.0518
0.3189 9.5556 516 1.1129 0.2539 1.1129 1.0549
0.3189 9.5926 518 1.2081 0.1996 1.2081 1.0991
0.3189 9.6296 520 1.2893 0.2141 1.2893 1.1355
0.3189 9.6667 522 1.2422 0.2091 1.2422 1.1146
0.3189 9.7037 524 1.1391 0.2550 1.1391 1.0673
0.3189 9.7407 526 1.1217 0.3468 1.1217 1.0591
0.3189 9.7778 528 1.1277 0.2843 1.1277 1.0619
0.3189 9.8148 530 1.1561 0.1728 1.1561 1.0752
0.3189 9.8519 532 1.2293 0.1996 1.2293 1.1088
0.3189 9.8889 534 1.2821 0.1815 1.2821 1.1323
0.3189 9.9259 536 1.3166 0.1725 1.3166 1.1474
0.3189 9.9630 538 1.2455 0.2180 1.2455 1.1160

Framework versions

  • Transformers 4.44.2
  • Pytorch 2.4.0+cu118
  • Datasets 2.21.0
  • Tokenizers 0.19.1
Downloads last month
182
Safetensors
Model size
135M params
Tensor type
F32
·
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for MayBashendy/ArabicNewSplits7_usingALLEssays_FineTuningAraBERT_run3_AugV5_k15_task2_organization

Finetuned
(4222)
this model