tanoManzo commited on
Commit
94664da
·
verified ·
1 Parent(s): 5cc862f

End of training

Browse files
Files changed (2) hide show
  1. README.md +31 -31
  2. model.safetensors +1 -1
README.md CHANGED
@@ -18,13 +18,13 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [AIRI-Institute/gena-lm-bigbird-base-t2t](https://huggingface.co/AIRI-Institute/gena-lm-bigbird-base-t2t) on the None dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 0.4554
22
- - F1 Score: 0.8794
23
- - Precision: 0.8386
24
- - Recall: 0.9243
25
- - Accuracy: 0.8677
26
- - Auc: 0.9401
27
- - Prc: 0.9344
28
 
29
  ## Model description
30
 
@@ -56,30 +56,30 @@ The following hyperparameters were used during training:
56
 
57
  | Training Loss | Epoch | Step | Validation Loss | F1 Score | Precision | Recall | Accuracy | Auc | Prc |
58
  |:-------------:|:------:|:-----:|:---------------:|:--------:|:---------:|:------:|:--------:|:------:|:------:|
59
- | 0.5309 | 0.0840 | 500 | 0.4610 | 0.8214 | 0.7592 | 0.8947 | 0.7969 | 0.8757 | 0.8625 |
60
- | 0.4688 | 0.1681 | 1000 | 0.4717 | 0.8298 | 0.7467 | 0.9336 | 0.8001 | 0.8862 | 0.8753 |
61
- | 0.4569 | 0.2521 | 1500 | 0.4326 | 0.8304 | 0.7461 | 0.9362 | 0.8005 | 0.8889 | 0.8752 |
62
- | 0.4361 | 0.3361 | 2000 | 0.4337 | 0.8186 | 0.8527 | 0.7870 | 0.8180 | 0.9035 | 0.8988 |
63
- | 0.4222 | 0.4202 | 2500 | 0.4968 | 0.8434 | 0.7628 | 0.9430 | 0.8173 | 0.9095 | 0.8979 |
64
- | 0.4233 | 0.5042 | 3000 | 0.3891 | 0.8396 | 0.8674 | 0.8135 | 0.8378 | 0.9207 | 0.9177 |
65
- | 0.4031 | 0.5882 | 3500 | 0.3743 | 0.8564 | 0.8687 | 0.8444 | 0.8522 | 0.9262 | 0.9231 |
66
- | 0.3739 | 0.6723 | 4000 | 0.3970 | 0.8520 | 0.8662 | 0.8383 | 0.8480 | 0.9275 | 0.9269 |
67
- | 0.3891 | 0.7563 | 4500 | 0.4361 | 0.7852 | 0.9181 | 0.6859 | 0.8042 | 0.9277 | 0.9273 |
68
- | 0.3856 | 0.8403 | 5000 | 0.3882 | 0.8518 | 0.8904 | 0.8164 | 0.8517 | 0.9309 | 0.9306 |
69
- | 0.3926 | 0.9244 | 5500 | 0.3291 | 0.8693 | 0.8600 | 0.8789 | 0.8622 | 0.9328 | 0.9320 |
70
- | 0.3737 | 1.0084 | 6000 | 0.3546 | 0.8571 | 0.8783 | 0.8370 | 0.8544 | 0.9331 | 0.9329 |
71
- | 0.346 | 1.0924 | 6500 | 0.4352 | 0.8719 | 0.8378 | 0.9088 | 0.8606 | 0.9345 | 0.9317 |
72
- | 0.3355 | 1.1765 | 7000 | 0.3880 | 0.8665 | 0.8560 | 0.8773 | 0.8590 | 0.9362 | 0.9350 |
73
- | 0.3452 | 1.2605 | 7500 | 0.3991 | 0.8737 | 0.8279 | 0.9249 | 0.8605 | 0.9376 | 0.9368 |
74
- | 0.3618 | 1.3445 | 8000 | 0.3564 | 0.8645 | 0.8804 | 0.8492 | 0.8612 | 0.9381 | 0.9382 |
75
- | 0.3335 | 1.4286 | 8500 | 0.4719 | 0.8376 | 0.9110 | 0.7751 | 0.8432 | 0.9381 | 0.9381 |
76
- | 0.3671 | 1.5126 | 9000 | 0.3808 | 0.8748 | 0.8607 | 0.8895 | 0.8672 | 0.9404 | 0.9405 |
77
- | 0.3505 | 1.5966 | 9500 | 0.4061 | 0.8801 | 0.8690 | 0.8914 | 0.8733 | 0.9403 | 0.9405 |
78
- | 0.3602 | 1.6807 | 10000 | 0.4968 | 0.8485 | 0.9112 | 0.7938 | 0.8521 | 0.9409 | 0.9412 |
79
- | 0.348 | 1.7647 | 10500 | 0.4114 | 0.8781 | 0.8319 | 0.9298 | 0.8654 | 0.9403 | 0.9369 |
80
- | 0.3435 | 1.8487 | 11000 | 0.3816 | 0.8745 | 0.8828 | 0.8663 | 0.8702 | 0.9415 | 0.9399 |
81
- | 0.3701 | 1.9328 | 11500 | 0.3733 | 0.8641 | 0.8865 | 0.8428 | 0.8617 | 0.9395 | 0.9369 |
82
- | 0.3281 | 2.0168 | 12000 | 0.4554 | 0.8794 | 0.8386 | 0.9243 | 0.8677 | 0.9401 | 0.9344 |
83
 
84
 
85
  ### Framework versions
 
18
 
19
  This model is a fine-tuned version of [AIRI-Institute/gena-lm-bigbird-base-t2t](https://huggingface.co/AIRI-Institute/gena-lm-bigbird-base-t2t) on the None dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 0.4979
22
+ - F1 Score: 0.8766
23
+ - Precision: 0.8781
24
+ - Recall: 0.8750
25
+ - Accuracy: 0.8683
26
+ - Auc: 0.9406
27
+ - Prc: 0.9418
28
 
29
  ## Model description
30
 
 
56
 
57
  | Training Loss | Epoch | Step | Validation Loss | F1 Score | Precision | Recall | Accuracy | Auc | Prc |
58
  |:-------------:|:------:|:-----:|:---------------:|:--------:|:---------:|:------:|:--------:|:------:|:------:|
59
+ | 0.5349 | 0.0841 | 500 | 0.4552 | 0.8265 | 0.7778 | 0.8816 | 0.8022 | 0.8727 | 0.8639 |
60
+ | 0.4552 | 0.1682 | 1000 | 0.4734 | 0.8272 | 0.7263 | 0.9607 | 0.7856 | 0.8927 | 0.8877 |
61
+ | 0.4577 | 0.2523 | 1500 | 0.4191 | 0.8381 | 0.7512 | 0.9477 | 0.8044 | 0.9022 | 0.9005 |
62
+ | 0.4282 | 0.3364 | 2000 | 0.4104 | 0.8528 | 0.7777 | 0.9440 | 0.8259 | 0.9128 | 0.8970 |
63
+ | 0.4127 | 0.4205 | 2500 | 0.3636 | 0.8611 | 0.8367 | 0.8870 | 0.8471 | 0.9213 | 0.9216 |
64
+ | 0.4226 | 0.5045 | 3000 | 0.3621 | 0.8623 | 0.8096 | 0.9223 | 0.8426 | 0.9248 | 0.9255 |
65
+ | 0.4231 | 0.5886 | 3500 | 0.3553 | 0.8629 | 0.7931 | 0.9462 | 0.8394 | 0.9317 | 0.9329 |
66
+ | 0.3945 | 0.6727 | 4000 | 0.3843 | 0.8631 | 0.7856 | 0.9575 | 0.8377 | 0.9345 | 0.9341 |
67
+ | 0.3911 | 0.7568 | 4500 | 0.4173 | 0.8681 | 0.8571 | 0.8794 | 0.8572 | 0.9315 | 0.9330 |
68
+ | 0.4233 | 0.8409 | 5000 | 0.3419 | 0.8741 | 0.8249 | 0.9295 | 0.8569 | 0.9355 | 0.9376 |
69
+ | 0.3787 | 0.9250 | 5500 | 0.3880 | 0.8650 | 0.7891 | 0.9572 | 0.8404 | 0.9346 | 0.9357 |
70
+ | 0.3849 | 1.0091 | 6000 | 0.3629 | 0.8766 | 0.8512 | 0.9037 | 0.8641 | 0.9353 | 0.9359 |
71
+ | 0.3522 | 1.0932 | 6500 | 0.3683 | 0.8803 | 0.8558 | 0.9062 | 0.8683 | 0.9381 | 0.9381 |
72
+ | 0.3376 | 1.1773 | 7000 | 0.4292 | 0.8640 | 0.7824 | 0.9644 | 0.8377 | 0.9392 | 0.9373 |
73
+ | 0.365 | 1.2614 | 7500 | 0.4852 | 0.8667 | 0.7858 | 0.9663 | 0.8412 | 0.9403 | 0.9371 |
74
+ | 0.3569 | 1.3454 | 8000 | 0.5700 | 0.8720 | 0.8112 | 0.9427 | 0.8522 | 0.9352 | 0.9287 |
75
+ | 0.3822 | 1.4295 | 8500 | 0.3894 | 0.8817 | 0.8720 | 0.8917 | 0.8722 | 0.9406 | 0.9418 |
76
+ | 0.3391 | 1.5136 | 9000 | 0.4167 | 0.8696 | 0.8863 | 0.8536 | 0.8633 | 0.9413 | 0.9434 |
77
+ | 0.3591 | 1.5977 | 9500 | 0.3554 | 0.8853 | 0.8631 | 0.9087 | 0.8742 | 0.9432 | 0.9436 |
78
+ | 0.3699 | 1.6818 | 10000 | 0.4540 | 0.8812 | 0.8868 | 0.8757 | 0.8739 | 0.9440 | 0.9441 |
79
+ | 0.3777 | 1.7659 | 10500 | 0.4137 | 0.8849 | 0.8583 | 0.9131 | 0.8730 | 0.9421 | 0.9423 |
80
+ | 0.3602 | 1.8500 | 11000 | 0.3798 | 0.8736 | 0.8835 | 0.8640 | 0.8665 | 0.9414 | 0.9444 |
81
+ | 0.3583 | 1.9341 | 11500 | 0.4461 | 0.8840 | 0.8405 | 0.9323 | 0.8693 | 0.9438 | 0.9458 |
82
+ | 0.3573 | 2.0182 | 12000 | 0.4979 | 0.8766 | 0.8781 | 0.8750 | 0.8683 | 0.9406 | 0.9418 |
83
 
84
 
85
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:dc29a07a5e440f2dd5926c46d9c9de90442b35a121f36a4ed0160c01c095f092
3
  size 455871688
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cd9e42cdae6622d728acdd1da70111d6e498fce0c367989c46d9f662d8bd9668
3
  size 455871688