--- library_name: transformers base_model: aubmindlab/bert-base-arabertv02 tags: - generated_from_trainer model-index: - name: ArabicNewSplits7_usingALLEssays_FineTuningAraBERT_run2_AugV5_k20_task1_organization results: [] --- # ArabicNewSplits7_usingALLEssays_FineTuningAraBERT_run2_AugV5_k20_task1_organization This model is a fine-tuned version of [aubmindlab/bert-base-arabertv02](https://huggingface.co/aubmindlab/bert-base-arabertv02) on the None dataset. It achieves the following results on the evaluation set: - Loss: 0.9054 - Qwk: 0.6565 - Mse: 0.9054 - Rmse: 0.9515 ## Model description More information needed ## Intended uses & limitations More information needed ## Training and evaluation data More information needed ## Training procedure ### Training hyperparameters The following hyperparameters were used during training: - learning_rate: 2e-05 - train_batch_size: 8 - eval_batch_size: 8 - seed: 42 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08 - lr_scheduler_type: linear - num_epochs: 100 ### Training results | Training Loss | Epoch | Step | Validation Loss | Qwk | Mse | Rmse | |:-------------:|:------:|:----:|:---------------:|:-------:|:------:|:------:| | No log | 0.0211 | 2 | 6.9147 | 0.0057 | 6.9147 | 2.6296 | | No log | 0.0421 | 4 | 4.6767 | 0.0658 | 4.6767 | 2.1626 | | No log | 0.0632 | 6 | 3.5127 | 0.0 | 3.5127 | 1.8742 | | No log | 0.0842 | 8 | 3.9518 | -0.0302 | 3.9518 | 1.9879 | | No log | 0.1053 | 10 | 2.8542 | 0.0694 | 2.8542 | 1.6894 | | No log | 0.1263 | 12 | 1.8263 | 0.2182 | 1.8263 | 1.3514 | | No log | 0.1474 | 14 | 2.0466 | 0.1368 | 2.0466 | 1.4306 | | No log | 0.1684 | 16 | 3.3066 | 0.0235 | 3.3066 | 1.8184 | | No log | 0.1895 | 18 | 3.3134 | 0.0235 | 3.3134 | 1.8203 | | No log | 0.2105 | 20 | 2.5913 | -0.0274 | 2.5913 | 1.6097 | | No log | 0.2316 | 22 | 2.2169 | 0.0769 | 2.2169 | 1.4889 | | No log | 0.2526 | 24 | 2.0737 | 0.2034 | 2.0737 | 1.4400 | | No log | 0.2737 | 26 | 1.9056 | 0.2037 | 1.9056 | 1.3805 | | No log | 0.2947 | 28 | 1.8019 | 0.1333 | 1.8019 | 1.3423 | | No log | 0.3158 | 30 | 1.8078 | 0.1333 | 1.8078 | 1.3445 | | No log | 0.3368 | 32 | 2.0400 | 0.2314 | 2.0400 | 1.4283 | | No log | 0.3579 | 34 | 2.6458 | 0.0 | 2.6458 | 1.6266 | | No log | 0.3789 | 36 | 3.5686 | 0.0123 | 3.5686 | 1.8891 | | No log | 0.4 | 38 | 3.4749 | 0.0370 | 3.4749 | 1.8641 | | No log | 0.4211 | 40 | 3.1173 | 0.0 | 3.1173 | 1.7656 | | No log | 0.4421 | 42 | 2.4951 | 0.0272 | 2.4951 | 1.5796 | | No log | 0.4632 | 44 | 1.8426 | 0.3651 | 1.8426 | 1.3574 | | No log | 0.4842 | 46 | 1.6128 | 0.3130 | 1.6128 | 1.2700 | | No log | 0.5053 | 48 | 1.5038 | 0.2703 | 1.5038 | 1.2263 | | No log | 0.5263 | 50 | 1.4433 | 0.2703 | 1.4433 | 1.2014 | | No log | 0.5474 | 52 | 1.5475 | 0.3448 | 1.5475 | 1.2440 | | No log | 0.5684 | 54 | 1.9270 | 0.3333 | 1.9270 | 1.3882 | | No log | 0.5895 | 56 | 2.0740 | 0.2302 | 2.0740 | 1.4402 | | No log | 0.6105 | 58 | 1.8771 | 0.3385 | 1.8771 | 1.3701 | | No log | 0.6316 | 60 | 1.4436 | 0.3559 | 1.4436 | 1.2015 | | No log | 0.6526 | 62 | 1.3216 | 0.3158 | 1.3216 | 1.1496 | | No log | 0.6737 | 64 | 1.3039 | 0.3130 | 1.3039 | 1.1419 | | No log | 0.6947 | 66 | 1.2831 | 0.3866 | 1.2831 | 1.1327 | | No log | 0.7158 | 68 | 1.2356 | 0.4793 | 1.2356 | 1.1116 | | No log | 0.7368 | 70 | 1.2627 | 0.4878 | 1.2627 | 1.1237 | | No log | 0.7579 | 72 | 1.7097 | 0.3846 | 1.7097 | 1.3076 | | No log | 0.7789 | 74 | 2.4479 | 0.1333 | 2.4479 | 1.5646 | | No log | 0.8 | 76 | 2.7969 | 0.0750 | 2.7969 | 1.6724 | | No log | 0.8211 | 78 | 2.6151 | 0.0530 | 2.6151 | 1.6171 | | No log | 0.8421 | 80 | 2.3699 | 0.0544 | 2.3699 | 1.5394 | | No log | 0.8632 | 82 | 2.0413 | 0.2941 | 2.0413 | 1.4287 | | No log | 0.8842 | 84 | 1.6477 | 0.3492 | 1.6477 | 1.2836 | | No log | 0.9053 | 86 | 1.3389 | 0.3934 | 1.3389 | 1.1571 | | No log | 0.9263 | 88 | 1.4392 | 0.3770 | 1.4392 | 1.1997 | | No log | 0.9474 | 90 | 1.6772 | 0.3492 | 1.6772 | 1.2951 | | No log | 0.9684 | 92 | 1.7334 | 0.3188 | 1.7334 | 1.3166 | | No log | 0.9895 | 94 | 1.5769 | 0.3453 | 1.5769 | 1.2557 | | No log | 1.0105 | 96 | 1.2468 | 0.5180 | 1.2468 | 1.1166 | | No log | 1.0316 | 98 | 1.0736 | 0.6099 | 1.0736 | 1.0361 | | No log | 1.0526 | 100 | 1.0502 | 0.6143 | 1.0502 | 1.0248 | | No log | 1.0737 | 102 | 1.3220 | 0.4966 | 1.3220 | 1.1498 | | No log | 1.0947 | 104 | 1.5547 | 0.4966 | 1.5547 | 1.2469 | | No log | 1.1158 | 106 | 1.5209 | 0.4937 | 1.5209 | 1.2333 | | No log | 1.1368 | 108 | 1.2254 | 0.6125 | 1.2254 | 1.1070 | | No log | 1.1579 | 110 | 1.2745 | 0.6049 | 1.2745 | 1.1290 | | No log | 1.1789 | 112 | 1.3467 | 0.6049 | 1.3467 | 1.1605 | | No log | 1.2 | 114 | 1.4391 | 0.5952 | 1.4391 | 1.1996 | | No log | 1.2211 | 116 | 1.4934 | 0.5952 | 1.4934 | 1.2220 | | No log | 1.2421 | 118 | 1.6305 | 0.4848 | 1.6305 | 1.2769 | | No log | 1.2632 | 120 | 1.7518 | 0.4524 | 1.7518 | 1.3235 | | No log | 1.2842 | 122 | 1.6777 | 0.4540 | 1.6777 | 1.2952 | | No log | 1.3053 | 124 | 1.6582 | 0.4375 | 1.6582 | 1.2877 | | No log | 1.3263 | 126 | 1.5145 | 0.4654 | 1.5145 | 1.2306 | | No log | 1.3474 | 128 | 1.4462 | 0.5157 | 1.4462 | 1.2026 | | No log | 1.3684 | 130 | 1.1904 | 0.5467 | 1.1904 | 1.0911 | | No log | 1.3895 | 132 | 0.9410 | 0.6667 | 0.9410 | 0.9700 | | No log | 1.4105 | 134 | 0.9373 | 0.6423 | 0.9373 | 0.9681 | | No log | 1.4316 | 136 | 0.9491 | 0.6 | 0.9491 | 0.9742 | | No log | 1.4526 | 138 | 0.8702 | 0.7324 | 0.8702 | 0.9328 | | No log | 1.4737 | 140 | 0.8608 | 0.7273 | 0.8608 | 0.9278 | | No log | 1.4947 | 142 | 0.9341 | 0.6338 | 0.9341 | 0.9665 | | No log | 1.5158 | 144 | 1.0152 | 0.5714 | 1.0152 | 1.0076 | | No log | 1.5368 | 146 | 1.0549 | 0.5972 | 1.0549 | 1.0271 | | No log | 1.5579 | 148 | 1.0469 | 0.6241 | 1.0469 | 1.0232 | | No log | 1.5789 | 150 | 0.9911 | 0.6269 | 0.9911 | 0.9956 | | No log | 1.6 | 152 | 0.9760 | 0.6667 | 0.9760 | 0.9880 | | No log | 1.6211 | 154 | 0.9524 | 0.6906 | 0.9524 | 0.9759 | | No log | 1.6421 | 156 | 0.9406 | 0.6950 | 0.9406 | 0.9698 | | No log | 1.6632 | 158 | 0.9463 | 0.6714 | 0.9463 | 0.9728 | | No log | 1.6842 | 160 | 0.9491 | 0.6619 | 0.9491 | 0.9742 | | No log | 1.7053 | 162 | 0.9385 | 0.6331 | 0.9385 | 0.9688 | | No log | 1.7263 | 164 | 0.8984 | 0.6803 | 0.8984 | 0.9478 | | No log | 1.7474 | 166 | 0.8712 | 0.6803 | 0.8712 | 0.9334 | | No log | 1.7684 | 168 | 0.8899 | 0.6761 | 0.8899 | 0.9433 | | No log | 1.7895 | 170 | 1.0000 | 0.6111 | 1.0000 | 1.0000 | | No log | 1.8105 | 172 | 1.0447 | 0.5775 | 1.0447 | 1.0221 | | No log | 1.8316 | 174 | 1.0635 | 0.5522 | 1.0635 | 1.0313 | | No log | 1.8526 | 176 | 1.0397 | 0.5385 | 1.0397 | 1.0196 | | No log | 1.8737 | 178 | 1.0406 | 0.5397 | 1.0406 | 1.0201 | | No log | 1.8947 | 180 | 0.9838 | 0.4959 | 0.9838 | 0.9919 | | No log | 1.9158 | 182 | 0.9065 | 0.6565 | 0.9065 | 0.9521 | | No log | 1.9368 | 184 | 0.8501 | 0.6667 | 0.8501 | 0.9220 | | No log | 1.9579 | 186 | 0.8315 | 0.6815 | 0.8315 | 0.9119 | | No log | 1.9789 | 188 | 0.8385 | 0.7111 | 0.8385 | 0.9157 | | No log | 2.0 | 190 | 0.9007 | 0.6950 | 0.9007 | 0.9490 | | No log | 2.0211 | 192 | 1.0128 | 0.6111 | 1.0128 | 1.0064 | | No log | 2.0421 | 194 | 0.9623 | 0.5846 | 0.9623 | 0.9810 | | No log | 2.0632 | 196 | 0.8741 | 0.7059 | 0.8741 | 0.9350 | | No log | 2.0842 | 198 | 0.8473 | 0.7299 | 0.8473 | 0.9205 | | No log | 2.1053 | 200 | 0.8685 | 0.7015 | 0.8685 | 0.9319 | | No log | 2.1263 | 202 | 0.8730 | 0.6866 | 0.8730 | 0.9344 | | No log | 2.1474 | 204 | 0.9180 | 0.6866 | 0.9180 | 0.9581 | | No log | 2.1684 | 206 | 1.0557 | 0.6043 | 1.0557 | 1.0275 | | No log | 2.1895 | 208 | 1.1874 | 0.5441 | 1.1874 | 1.0897 | | No log | 2.2105 | 210 | 1.3732 | 0.5342 | 1.3732 | 1.1718 | | No log | 2.2316 | 212 | 1.3254 | 0.5342 | 1.3254 | 1.1512 | | No log | 2.2526 | 214 | 1.1241 | 0.5481 | 1.1241 | 1.0602 | | No log | 2.2737 | 216 | 0.9579 | 0.6714 | 0.9579 | 0.9787 | | No log | 2.2947 | 218 | 0.8948 | 0.7059 | 0.8948 | 0.9459 | | No log | 2.3158 | 220 | 0.8576 | 0.7050 | 0.8576 | 0.9261 | | No log | 2.3368 | 222 | 0.8369 | 0.7347 | 0.8369 | 0.9148 | | No log | 2.3579 | 224 | 0.8554 | 0.6986 | 0.8554 | 0.9249 | | No log | 2.3789 | 226 | 0.9670 | 0.6447 | 0.9670 | 0.9834 | | No log | 2.4 | 228 | 1.0809 | 0.5594 | 1.0809 | 1.0397 | | No log | 2.4211 | 230 | 1.0430 | 0.5652 | 1.0430 | 1.0213 | | No log | 2.4421 | 232 | 1.0147 | 0.5652 | 1.0147 | 1.0073 | | No log | 2.4632 | 234 | 1.0125 | 0.5612 | 1.0125 | 1.0062 | | No log | 2.4842 | 236 | 1.0915 | 0.5772 | 1.0915 | 1.0447 | | No log | 2.5053 | 238 | 1.0495 | 0.5946 | 1.0495 | 1.0245 | | No log | 2.5263 | 240 | 0.9157 | 0.5857 | 0.9157 | 0.9569 | | No log | 2.5474 | 242 | 0.8149 | 0.6269 | 0.8149 | 0.9027 | | No log | 2.5684 | 244 | 0.7958 | 0.6957 | 0.7958 | 0.8921 | | No log | 2.5895 | 246 | 0.7943 | 0.7050 | 0.7943 | 0.8912 | | No log | 2.6105 | 248 | 0.8308 | 0.6993 | 0.8308 | 0.9115 | | No log | 2.6316 | 250 | 0.9834 | 0.6667 | 0.9834 | 0.9917 | | No log | 2.6526 | 252 | 1.3131 | 0.5680 | 1.3131 | 1.1459 | | No log | 2.6737 | 254 | 1.4336 | 0.5650 | 1.4336 | 1.1973 | | No log | 2.6947 | 256 | 1.1698 | 0.5786 | 1.1698 | 1.0816 | | No log | 2.7158 | 258 | 0.8472 | 0.7075 | 0.8472 | 0.9204 | | No log | 2.7368 | 260 | 0.8291 | 0.7092 | 0.8291 | 0.9106 | | No log | 2.7579 | 262 | 0.9441 | 0.6423 | 0.9441 | 0.9716 | | No log | 2.7789 | 264 | 0.9637 | 0.6176 | 0.9637 | 0.9817 | | No log | 2.8 | 266 | 0.9317 | 0.6119 | 0.9317 | 0.9653 | | No log | 2.8211 | 268 | 0.9029 | 0.7153 | 0.9029 | 0.9502 | | No log | 2.8421 | 270 | 0.8909 | 0.6763 | 0.8909 | 0.9439 | | No log | 2.8632 | 272 | 0.8627 | 0.7310 | 0.8627 | 0.9288 | | No log | 2.8842 | 274 | 0.8517 | 0.7222 | 0.8517 | 0.9229 | | No log | 2.9053 | 276 | 0.8626 | 0.6980 | 0.8626 | 0.9288 | | No log | 2.9263 | 278 | 0.9225 | 0.6623 | 0.9225 | 0.9605 | | No log | 2.9474 | 280 | 0.8919 | 0.6622 | 0.8919 | 0.9444 | | No log | 2.9684 | 282 | 0.8077 | 0.7042 | 0.8077 | 0.8987 | | No log | 2.9895 | 284 | 0.8139 | 0.7246 | 0.8139 | 0.9022 | | No log | 3.0105 | 286 | 0.8100 | 0.7246 | 0.8100 | 0.9000 | | No log | 3.0316 | 288 | 0.8139 | 0.7007 | 0.8139 | 0.9022 | | No log | 3.0526 | 290 | 0.8140 | 0.7353 | 0.8140 | 0.9022 | | No log | 3.0737 | 292 | 0.8294 | 0.7218 | 0.8294 | 0.9107 | | No log | 3.0947 | 294 | 0.8336 | 0.7164 | 0.8336 | 0.9130 | | No log | 3.1158 | 296 | 0.8242 | 0.6957 | 0.8242 | 0.9079 | | No log | 3.1368 | 298 | 0.8370 | 0.7403 | 0.8370 | 0.9149 | | No log | 3.1579 | 300 | 0.8097 | 0.75 | 0.8097 | 0.8998 | | No log | 3.1789 | 302 | 0.8030 | 0.7425 | 0.8030 | 0.8961 | | No log | 3.2 | 304 | 0.7775 | 0.7784 | 0.7775 | 0.8817 | | No log | 3.2211 | 306 | 0.7873 | 0.7529 | 0.7873 | 0.8873 | | No log | 3.2421 | 308 | 0.7734 | 0.7683 | 0.7734 | 0.8794 | | No log | 3.2632 | 310 | 0.7647 | 0.7436 | 0.7647 | 0.8745 | | No log | 3.2842 | 312 | 0.8014 | 0.7162 | 0.8014 | 0.8952 | | No log | 3.3053 | 314 | 0.8398 | 0.6912 | 0.8398 | 0.9164 | | No log | 3.3263 | 316 | 0.9002 | 0.6667 | 0.9002 | 0.9488 | | No log | 3.3474 | 318 | 0.8539 | 0.6815 | 0.8539 | 0.9241 | | No log | 3.3684 | 320 | 0.8135 | 0.6950 | 0.8135 | 0.9020 | | No log | 3.3895 | 322 | 0.7865 | 0.7413 | 0.7865 | 0.8868 | | No log | 3.4105 | 324 | 0.7881 | 0.7413 | 0.7881 | 0.8878 | | No log | 3.4316 | 326 | 0.8029 | 0.7114 | 0.8029 | 0.8961 | | No log | 3.4526 | 328 | 0.8282 | 0.7237 | 0.8282 | 0.9101 | | No log | 3.4737 | 330 | 0.8940 | 0.7711 | 0.8940 | 0.9455 | | No log | 3.4947 | 332 | 1.0440 | 0.7006 | 1.0440 | 1.0218 | | No log | 3.5158 | 334 | 1.1158 | 0.6552 | 1.1158 | 1.0563 | | No log | 3.5368 | 336 | 0.9944 | 0.6971 | 0.9944 | 0.9972 | | No log | 3.5579 | 338 | 0.9132 | 0.7432 | 0.9132 | 0.9556 | | No log | 3.5789 | 340 | 0.9585 | 0.6755 | 0.9585 | 0.9790 | | No log | 3.6 | 342 | 1.1431 | 0.5926 | 1.1431 | 1.0692 | | No log | 3.6211 | 344 | 1.2288 | 0.5890 | 1.2288 | 1.1085 | | No log | 3.6421 | 346 | 1.1618 | 0.5890 | 1.1618 | 1.0779 | | No log | 3.6632 | 348 | 0.8822 | 0.6883 | 0.8822 | 0.9392 | | No log | 3.6842 | 350 | 0.7543 | 0.7133 | 0.7543 | 0.8685 | | No log | 3.7053 | 352 | 0.7774 | 0.7050 | 0.7774 | 0.8817 | | No log | 3.7263 | 354 | 0.7965 | 0.7007 | 0.7965 | 0.8925 | | No log | 3.7474 | 356 | 0.8196 | 0.6812 | 0.8196 | 0.9053 | | No log | 3.7684 | 358 | 0.9387 | 0.5865 | 0.9387 | 0.9689 | | No log | 3.7895 | 360 | 1.1042 | 0.5156 | 1.1042 | 1.0508 | | No log | 3.8105 | 362 | 1.1047 | 0.5238 | 1.1047 | 1.0510 | | No log | 3.8316 | 364 | 1.0618 | 0.5161 | 1.0618 | 1.0304 | | No log | 3.8526 | 366 | 0.9941 | 0.5397 | 0.9941 | 0.9970 | | No log | 3.8737 | 368 | 0.9151 | 0.6 | 0.9151 | 0.9566 | | No log | 3.8947 | 370 | 0.8493 | 0.7259 | 0.8493 | 0.9216 | | No log | 3.9158 | 372 | 0.7988 | 0.7465 | 0.7988 | 0.8937 | | No log | 3.9368 | 374 | 0.7695 | 0.75 | 0.7695 | 0.8772 | | No log | 3.9579 | 376 | 0.7925 | 0.7383 | 0.7925 | 0.8902 | | No log | 3.9789 | 378 | 0.8450 | 0.7516 | 0.8450 | 0.9192 | | No log | 4.0 | 380 | 0.8272 | 0.7578 | 0.8272 | 0.9095 | | No log | 4.0211 | 382 | 0.8119 | 0.7532 | 0.8119 | 0.9010 | | No log | 4.0421 | 384 | 0.8183 | 0.7248 | 0.8183 | 0.9046 | | No log | 4.0632 | 386 | 0.8635 | 0.7297 | 0.8635 | 0.9293 | | No log | 4.0842 | 388 | 0.9073 | 0.6621 | 0.9073 | 0.9525 | | No log | 4.1053 | 390 | 0.9430 | 0.6443 | 0.9430 | 0.9711 | | No log | 4.1263 | 392 | 0.9180 | 0.6711 | 0.9180 | 0.9581 | | No log | 4.1474 | 394 | 0.8735 | 0.7439 | 0.8735 | 0.9346 | | No log | 4.1684 | 396 | 0.8047 | 0.7453 | 0.8047 | 0.8971 | | No log | 4.1895 | 398 | 0.7587 | 0.7673 | 0.7587 | 0.8710 | | No log | 4.2105 | 400 | 0.7473 | 0.7927 | 0.7473 | 0.8645 | | No log | 4.2316 | 402 | 0.7532 | 0.7738 | 0.7532 | 0.8679 | | No log | 4.2526 | 404 | 0.7732 | 0.7453 | 0.7732 | 0.8793 | | No log | 4.2737 | 406 | 0.8088 | 0.725 | 0.8088 | 0.8993 | | No log | 4.2947 | 408 | 0.8168 | 0.7248 | 0.8168 | 0.9038 | | No log | 4.3158 | 410 | 0.8296 | 0.7467 | 0.8296 | 0.9108 | | No log | 4.3368 | 412 | 0.8137 | 0.7517 | 0.8137 | 0.9021 | | No log | 4.3579 | 414 | 0.8159 | 0.7517 | 0.8159 | 0.9033 | | No log | 4.3789 | 416 | 0.8062 | 0.7297 | 0.8062 | 0.8979 | | No log | 4.4 | 418 | 0.7740 | 0.7361 | 0.7740 | 0.8798 | | No log | 4.4211 | 420 | 0.7488 | 0.76 | 0.7488 | 0.8653 | | No log | 4.4421 | 422 | 0.7392 | 0.76 | 0.7392 | 0.8598 | | No log | 4.4632 | 424 | 0.7521 | 0.75 | 0.7521 | 0.8672 | | No log | 4.4842 | 426 | 0.7665 | 0.7296 | 0.7665 | 0.8755 | | No log | 4.5053 | 428 | 0.7865 | 0.7089 | 0.7865 | 0.8868 | | No log | 4.5263 | 430 | 0.7931 | 0.7089 | 0.7931 | 0.8905 | | No log | 4.5474 | 432 | 0.8488 | 0.7030 | 0.8488 | 0.9213 | | No log | 4.5684 | 434 | 0.9802 | 0.6460 | 0.9802 | 0.9900 | | No log | 4.5895 | 436 | 1.0860 | 0.6026 | 1.0860 | 1.0421 | | No log | 4.6105 | 438 | 1.0336 | 0.5753 | 1.0336 | 1.0167 | | No log | 4.6316 | 440 | 0.9525 | 0.6 | 0.9525 | 0.9759 | | No log | 4.6526 | 442 | 0.9252 | 0.6364 | 0.9252 | 0.9619 | | No log | 4.6737 | 444 | 0.9774 | 0.6107 | 0.9774 | 0.9886 | | No log | 4.6947 | 446 | 0.9680 | 0.6061 | 0.9680 | 0.9839 | | No log | 4.7158 | 448 | 0.8967 | 0.6515 | 0.8967 | 0.9469 | | No log | 4.7368 | 450 | 0.8432 | 0.7083 | 0.8432 | 0.9183 | | No log | 4.7579 | 452 | 0.8539 | 0.7320 | 0.8539 | 0.9241 | | No log | 4.7789 | 454 | 0.8888 | 0.6490 | 0.8888 | 0.9428 | | No log | 4.8 | 456 | 0.8369 | 0.7320 | 0.8369 | 0.9148 | | No log | 4.8211 | 458 | 0.7733 | 0.7285 | 0.7733 | 0.8794 | | No log | 4.8421 | 460 | 0.7664 | 0.7383 | 0.7664 | 0.8754 | | No log | 4.8632 | 462 | 0.7862 | 0.7162 | 0.7862 | 0.8867 | | No log | 4.8842 | 464 | 0.8462 | 0.6986 | 0.8462 | 0.9199 | | No log | 4.9053 | 466 | 0.8606 | 0.6986 | 0.8606 | 0.9277 | | No log | 4.9263 | 468 | 0.8436 | 0.7123 | 0.8436 | 0.9185 | | No log | 4.9474 | 470 | 0.8436 | 0.6939 | 0.8436 | 0.9185 | | No log | 4.9684 | 472 | 0.8377 | 0.7125 | 0.8377 | 0.9153 | | No log | 4.9895 | 474 | 0.8862 | 0.7117 | 0.8862 | 0.9414 | | No log | 5.0105 | 476 | 0.9180 | 0.6707 | 0.9180 | 0.9581 | | No log | 5.0316 | 478 | 0.8822 | 0.7117 | 0.8822 | 0.9392 | | No log | 5.0526 | 480 | 0.9290 | 0.6748 | 0.9290 | 0.9639 | | No log | 5.0737 | 482 | 1.0426 | 0.6588 | 1.0426 | 1.0211 | | No log | 5.0947 | 484 | 1.0471 | 0.6588 | 1.0471 | 1.0233 | | No log | 5.1158 | 486 | 0.9415 | 0.6627 | 0.9415 | 0.9703 | | No log | 5.1368 | 488 | 0.8213 | 0.7389 | 0.8213 | 0.9062 | | No log | 5.1579 | 490 | 0.7950 | 0.7310 | 0.7950 | 0.8916 | | No log | 5.1789 | 492 | 0.8209 | 0.75 | 0.8209 | 0.9060 | | No log | 5.2 | 494 | 0.8626 | 0.6906 | 0.8626 | 0.9288 | | No log | 5.2211 | 496 | 0.8957 | 0.6043 | 0.8957 | 0.9464 | | No log | 5.2421 | 498 | 0.8968 | 0.6187 | 0.8968 | 0.9470 | | 0.4811 | 5.2632 | 500 | 0.8757 | 0.6861 | 0.8757 | 0.9358 | | 0.4811 | 5.2842 | 502 | 0.8680 | 0.6716 | 0.8680 | 0.9317 | | 0.4811 | 5.3053 | 504 | 0.8531 | 0.6963 | 0.8531 | 0.9236 | | 0.4811 | 5.3263 | 506 | 0.8212 | 0.6812 | 0.8212 | 0.9062 | | 0.4811 | 5.3474 | 508 | 0.8163 | 0.7123 | 0.8163 | 0.9035 | | 0.4811 | 5.3684 | 510 | 0.8057 | 0.6906 | 0.8057 | 0.8976 | | 0.4811 | 5.3895 | 512 | 0.7959 | 0.6767 | 0.7959 | 0.8921 | | 0.4811 | 5.4105 | 514 | 0.8258 | 0.7164 | 0.8258 | 0.9088 | | 0.4811 | 5.4316 | 516 | 0.8374 | 0.6912 | 0.8374 | 0.9151 | | 0.4811 | 5.4526 | 518 | 0.8287 | 0.6617 | 0.8287 | 0.9103 | | 0.4811 | 5.4737 | 520 | 0.8049 | 0.6963 | 0.8049 | 0.8972 | | 0.4811 | 5.4947 | 522 | 0.8358 | 0.6986 | 0.8358 | 0.9142 | | 0.4811 | 5.5158 | 524 | 0.8658 | 0.6897 | 0.8658 | 0.9305 | | 0.4811 | 5.5368 | 526 | 0.8271 | 0.6944 | 0.8271 | 0.9094 | | 0.4811 | 5.5579 | 528 | 0.7890 | 0.7092 | 0.7890 | 0.8883 | | 0.4811 | 5.5789 | 530 | 0.7809 | 0.6912 | 0.7809 | 0.8837 | | 0.4811 | 5.6 | 532 | 0.8006 | 0.6617 | 0.8006 | 0.8948 | | 0.4811 | 5.6211 | 534 | 0.8093 | 0.6617 | 0.8093 | 0.8996 | | 0.4811 | 5.6421 | 536 | 0.8195 | 0.7133 | 0.8195 | 0.9053 | | 0.4811 | 5.6632 | 538 | 0.8445 | 0.7042 | 0.8445 | 0.9190 | | 0.4811 | 5.6842 | 540 | 0.8640 | 0.7042 | 0.8640 | 0.9295 | | 0.4811 | 5.7053 | 542 | 0.8855 | 0.6957 | 0.8855 | 0.9410 | | 0.4811 | 5.7263 | 544 | 0.8816 | 0.6861 | 0.8816 | 0.9390 | | 0.4811 | 5.7474 | 546 | 0.8418 | 0.7092 | 0.8418 | 0.9175 | | 0.4811 | 5.7684 | 548 | 0.8154 | 0.7083 | 0.8154 | 0.9030 | | 0.4811 | 5.7895 | 550 | 0.8855 | 0.6962 | 0.8855 | 0.9410 | | 0.4811 | 5.8105 | 552 | 0.9094 | 0.7037 | 0.9094 | 0.9536 | | 0.4811 | 5.8316 | 554 | 0.8306 | 0.7284 | 0.8306 | 0.9114 | | 0.4811 | 5.8526 | 556 | 0.7711 | 0.7607 | 0.7711 | 0.8781 | | 0.4811 | 5.8737 | 558 | 0.7568 | 0.7632 | 0.7568 | 0.8699 | | 0.4811 | 5.8947 | 560 | 0.7710 | 0.7534 | 0.7710 | 0.8781 | | 0.4811 | 5.9158 | 562 | 0.7697 | 0.7448 | 0.7697 | 0.8773 | | 0.4811 | 5.9368 | 564 | 0.7716 | 0.7361 | 0.7716 | 0.8784 | | 0.4811 | 5.9579 | 566 | 0.8040 | 0.6993 | 0.8040 | 0.8967 | | 0.4811 | 5.9789 | 568 | 0.8327 | 0.6901 | 0.8327 | 0.9125 | | 0.4811 | 6.0 | 570 | 0.8404 | 0.6901 | 0.8404 | 0.9167 | | 0.4811 | 6.0211 | 572 | 0.8492 | 0.6818 | 0.8492 | 0.9215 | | 0.4811 | 6.0421 | 574 | 0.8724 | 0.7153 | 0.8724 | 0.9340 | | 0.4811 | 6.0632 | 576 | 0.8440 | 0.7153 | 0.8440 | 0.9187 | | 0.4811 | 6.0842 | 578 | 0.7903 | 0.7361 | 0.7903 | 0.8890 | | 0.4811 | 6.1053 | 580 | 0.7683 | 0.7568 | 0.7683 | 0.8765 | | 0.4811 | 6.1263 | 582 | 0.8036 | 0.7089 | 0.8036 | 0.8964 | | 0.4811 | 6.1474 | 584 | 0.8803 | 0.6918 | 0.8803 | 0.9383 | | 0.4811 | 6.1684 | 586 | 0.8904 | 0.6918 | 0.8904 | 0.9436 | | 0.4811 | 6.1895 | 588 | 0.8325 | 0.7089 | 0.8325 | 0.9124 | | 0.4811 | 6.2105 | 590 | 0.7784 | 0.7421 | 0.7784 | 0.8823 | | 0.4811 | 6.2316 | 592 | 0.7479 | 0.75 | 0.7479 | 0.8648 | | 0.4811 | 6.2526 | 594 | 0.7431 | 0.7468 | 0.7431 | 0.8620 | | 0.4811 | 6.2737 | 596 | 0.7519 | 0.7421 | 0.7519 | 0.8671 | | 0.4811 | 6.2947 | 598 | 0.7830 | 0.7342 | 0.7830 | 0.8849 | | 0.4811 | 6.3158 | 600 | 0.8071 | 0.7342 | 0.8071 | 0.8984 | | 0.4811 | 6.3368 | 602 | 0.8136 | 0.725 | 0.8136 | 0.9020 | | 0.4811 | 6.3579 | 604 | 0.8557 | 0.7059 | 0.8557 | 0.9250 | | 0.4811 | 6.3789 | 606 | 0.8590 | 0.6914 | 0.8590 | 0.9268 | | 0.4811 | 6.4 | 608 | 0.8931 | 0.6667 | 0.8931 | 0.9450 | | 0.4811 | 6.4211 | 610 | 0.9056 | 0.6795 | 0.9056 | 0.9516 | | 0.4811 | 6.4421 | 612 | 0.8982 | 0.6575 | 0.8982 | 0.9477 | | 0.4811 | 6.4632 | 614 | 0.8548 | 0.6906 | 0.8548 | 0.9246 | | 0.4811 | 6.4842 | 616 | 0.8133 | 0.7286 | 0.8133 | 0.9018 | | 0.4811 | 6.5053 | 618 | 0.7825 | 0.7286 | 0.7825 | 0.8846 | | 0.4811 | 6.5263 | 620 | 0.7528 | 0.7286 | 0.7528 | 0.8676 | | 0.4811 | 6.5474 | 622 | 0.7508 | 0.7361 | 0.7508 | 0.8665 | | 0.4811 | 6.5684 | 624 | 0.7626 | 0.7234 | 0.7626 | 0.8732 | | 0.4811 | 6.5895 | 626 | 0.7882 | 0.7234 | 0.7882 | 0.8878 | | 0.4811 | 6.6105 | 628 | 0.8064 | 0.7092 | 0.8064 | 0.8980 | | 0.4811 | 6.6316 | 630 | 0.8403 | 0.6809 | 0.8403 | 0.9167 | | 0.4811 | 6.6526 | 632 | 0.9155 | 0.64 | 0.9155 | 0.9568 | | 0.4811 | 6.6737 | 634 | 0.9231 | 0.6364 | 0.9231 | 0.9608 | | 0.4811 | 6.6947 | 636 | 0.8656 | 0.6957 | 0.8656 | 0.9304 | | 0.4811 | 6.7158 | 638 | 0.8013 | 0.7407 | 0.8013 | 0.8951 | | 0.4811 | 6.7368 | 640 | 0.7869 | 0.7531 | 0.7869 | 0.8871 | | 0.4811 | 6.7579 | 642 | 0.7966 | 0.7421 | 0.7966 | 0.8925 | | 0.4811 | 6.7789 | 644 | 0.8163 | 0.75 | 0.8163 | 0.9035 | | 0.4811 | 6.8 | 646 | 0.8503 | 0.6994 | 0.8503 | 0.9221 | | 0.4811 | 6.8211 | 648 | 0.8598 | 0.7 | 0.8598 | 0.9273 | | 0.4811 | 6.8421 | 650 | 0.8585 | 0.7044 | 0.8585 | 0.9265 | | 0.4811 | 6.8632 | 652 | 0.8502 | 0.7006 | 0.8502 | 0.9221 | | 0.4811 | 6.8842 | 654 | 0.8243 | 0.7097 | 0.8243 | 0.9079 | | 0.4811 | 6.9053 | 656 | 0.8054 | 0.6923 | 0.8054 | 0.8974 | | 0.4811 | 6.9263 | 658 | 0.7997 | 0.7179 | 0.7997 | 0.8943 | | 0.4811 | 6.9474 | 660 | 0.8076 | 0.7215 | 0.8076 | 0.8987 | | 0.4811 | 6.9684 | 662 | 0.8214 | 0.7215 | 0.8214 | 0.9063 | | 0.4811 | 6.9895 | 664 | 0.8254 | 0.7013 | 0.8254 | 0.9085 | | 0.4811 | 7.0105 | 666 | 0.8169 | 0.6980 | 0.8169 | 0.9038 | | 0.4811 | 7.0316 | 668 | 0.8225 | 0.6897 | 0.8225 | 0.9069 | | 0.4811 | 7.0526 | 670 | 0.8333 | 0.6901 | 0.8333 | 0.9128 | | 0.4811 | 7.0737 | 672 | 0.8235 | 0.6853 | 0.8235 | 0.9075 | | 0.4811 | 7.0947 | 674 | 0.8004 | 0.7183 | 0.8004 | 0.8947 | | 0.4811 | 7.1158 | 676 | 0.7771 | 0.6993 | 0.7771 | 0.8815 | | 0.4811 | 7.1368 | 678 | 0.7835 | 0.7042 | 0.7835 | 0.8852 | | 0.4811 | 7.1579 | 680 | 0.7997 | 0.7042 | 0.7997 | 0.8942 | | 0.4811 | 7.1789 | 682 | 0.8404 | 0.6765 | 0.8404 | 0.9167 | | 0.4811 | 7.2 | 684 | 0.9466 | 0.6277 | 0.9466 | 0.9729 | | 0.4811 | 7.2211 | 686 | 0.9833 | 0.5970 | 0.9833 | 0.9916 | | 0.4811 | 7.2421 | 688 | 0.9646 | 0.6212 | 0.9646 | 0.9821 | | 0.4811 | 7.2632 | 690 | 0.9054 | 0.6565 | 0.9054 | 0.9515 | ### Framework versions - Transformers 4.44.2 - Pytorch 2.4.0+cu118 - Datasets 2.21.0 - Tokenizers 0.19.1