thusken
/

nb-bert-large-user-needs-v2

Safetensors

bert

Generated from Trainer

Model card Files Files and versions Community

thusken commited on Nov 26, 2024

Commit

42c48f0

verified ·

1 Parent(s): 1c5c95c

End of training

Browse files

Files changed (2) hide show

README.md +31 -16
model.safetensors +1 -1

README.md CHANGED Viewed

@@ -1,6 +1,6 @@
 ---
-license: cc-by-4.0
-base_model: NbAiLab/nb-bert-large
 tags:
 - generated_from_trainer
 metrics:
@@ -18,13 +18,13 @@ should probably proofread and complete it, then remove this comment. -->
 # nb-bert-large-user-needs-v2
-This model is a fine-tuned version of [NbAiLab/nb-bert-large](https://huggingface.co/NbAiLab/nb-bert-large) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.6998
-- Accuracy: 0.7947
-- F1: 0.7920
-- Precision: 0.7912
-- Recall: 0.7947
 ## Model description
@@ -44,24 +44,39 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 3e-05
-- train_batch_size: 8
-- eval_batch_size: 64
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
-- num_epochs: 5
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1     | Precision | Recall |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:---------:|:------:|
-| No log        | 1.0   | 188  | 0.6480          | 0.7413   | 0.7120 | 0.6885    | 0.7413 |
-| No log        | 2.0   | 376  | 0.5416          | 0.792    | 0.7794 | 0.7902    | 0.792  |
-| 0.6813        | 3.0   | 564  | 0.5644          | 0.7627   | 0.7564 | 0.7594    | 0.7627 |
-| 0.6813        | 4.0   | 752  | 0.5669          | 0.7733   | 0.7719 | 0.7817    | 0.7733 |
-| 0.6813        | 5.0   | 940  | 0.6998          | 0.7947   | 0.7920 | 0.7912    | 0.7947 |
 ### Framework versions

 ---
+license: apache-2.0
+base_model: ltg/norbert3-large
 tags:
 - generated_from_trainer
 metrics:
 # nb-bert-large-user-needs-v2
+This model is a fine-tuned version of [ltg/norbert3-large](https://huggingface.co/ltg/norbert3-large) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.0724
+- Accuracy: 0.6853
+- F1: 0.6674
+- Precision: 0.6636
+- Recall: 0.6853
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 3e-05
+- train_batch_size: 4
+- eval_batch_size: 32
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
+- num_epochs: 20
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1     | Precision | Recall |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:---------:|:------:|
+| No log        | 1.0   | 375  | 0.8229          | 0.6453   | 0.6076 | 0.6107    | 0.6453 |
+| 0.9043        | 2.0   | 750  | 0.8588          | 0.6667   | 0.6416 | 0.6207    | 0.6667 |
+| 0.744         | 3.0   | 1125 | 0.8531          | 0.656    | 0.6300 | 0.6063    | 0.656  |
+| 0.5288        | 4.0   | 1500 | 1.5591          | 0.6827   | 0.6513 | 0.6425    | 0.6827 |
+| 0.5288        | 5.0   | 1875 | 2.0799          | 0.6507   | 0.6495 | 0.6493    | 0.6507 |
+| 0.1999        | 6.0   | 2250 | 2.7613          | 0.664    | 0.6465 | 0.6476    | 0.664  |
+| 0.0406        | 7.0   | 2625 | 3.0547          | 0.6267   | 0.6044 | 0.6314    | 0.6267 |
+| 0.0402        | 8.0   | 3000 | 2.7554          | 0.672    | 0.6613 | 0.6515    | 0.672  |
+| 0.0402        | 9.0   | 3375 | 2.9426          | 0.6587   | 0.6529 | 0.6478    | 0.6587 |
+| 0.0229        | 10.0  | 3750 | 3.0100          | 0.664    | 0.6505 | 0.6462    | 0.664  |
+| 0.0075        | 11.0  | 4125 | 2.9582          | 0.656    | 0.6375 | 0.6266    | 0.656  |
+| 0.0112        | 12.0  | 4500 | 2.9971          | 0.656    | 0.6424 | 0.6335    | 0.656  |
+| 0.0112        | 13.0  | 4875 | 3.1416          | 0.6587   | 0.6422 | 0.6297    | 0.6587 |
+| 0.0054        | 14.0  | 5250 | 3.1233          | 0.68     | 0.6597 | 0.6495    | 0.68   |
+| 0.003         | 15.0  | 5625 | 3.1611          | 0.6773   | 0.6632 | 0.6657    | 0.6773 |
+| 0.0071        | 16.0  | 6000 | 3.0724          | 0.6853   | 0.6674 | 0.6636    | 0.6853 |
+| 0.0071        | 17.0  | 6375 | 3.0737          | 0.6667   | 0.6538 | 0.6468    | 0.6667 |
+| 0.0029        | 18.0  | 6750 | 3.1641          | 0.6747   | 0.6659 | 0.6599    | 0.6747 |
+| 0.0024        | 19.0  | 7125 | 3.2062          | 0.6693   | 0.6597 | 0.6542    | 0.6693 |
+| 0.0012        | 20.0  | 7500 | 3.2474          | 0.6747   | 0.6650 | 0.6595    | 0.6747 |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:03bde9f6eaa14aaf5091648e8071dff71c9d6878817e3c2babee20756a18b3e5
 size 1417659372

 version https://git-lfs.github.com/spec/v1
+oid sha256:46911afbc9dce0b0d42651b7d46cdbe01cf3b28f6dd11d4769620e491995e65c
 size 1417659372