Aleksandar
/

distilbert-srb-ner

@@ -21,7 +21,7 @@ model_index:
     metric:
       name: Accuracy
       type: accuracy
-      value: 0.9570619691726958
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -31,11 +31,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model was trained from scratch on the wikiann dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.2532
-- Precision: 0.8859
-- Recall: 0.9066
-- F1: 0.8962
-- Accuracy: 0.9571
 ## Model description
@@ -55,27 +55,21 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 16
 - eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 10
 ### Training results
-| Training Loss | Epoch | Step  | Validation Loss | Precision | Recall | F1     | Accuracy |
-|:-------------:|:-----:|:-----:|:---------------:|:---------:|:------:|:------:|:--------:|
-| 0.2615        | 1.0   | 1250  | 0.2126          | 0.8268    | 0.8327 | 0.8297 | 0.9319   |
-| 0.1568        | 2.0   | 2500  | 0.1775          | 0.8695    | 0.8699 | 0.8697 | 0.9472   |
-| 0.1017        | 3.0   | 3750  | 0.1718          | 0.8649    | 0.8857 | 0.8752 | 0.9504   |
-| 0.066         | 4.0   | 5000  | 0.1906          | 0.8734    | 0.8930 | 0.8831 | 0.9530   |
-| 0.0413        | 5.0   | 6250  | 0.2076          | 0.8805    | 0.8992 | 0.8897 | 0.9549   |
-| 0.03          | 6.0   | 7500  | 0.2257          | 0.8758    | 0.9045 | 0.8899 | 0.9554   |
-| 0.0213        | 7.0   | 8750  | 0.2286          | 0.8864    | 0.9015 | 0.8939 | 0.9556   |
-| 0.0157        | 8.0   | 10000 | 0.2454          | 0.8874    | 0.9021 | 0.8947 | 0.9566   |
-| 0.01          | 9.0   | 11250 | 0.2486          | 0.8878    | 0.9043 | 0.8960 | 0.9573   |
-| 0.0076        | 10.0  | 12500 | 0.2532          | 0.8859    | 0.9066 | 0.8962 | 0.9571   |
 ### Framework versions

     metric:
       name: Accuracy
       type: accuracy
+      value: 0.9503250498060186
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model was trained from scratch on the wikiann dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1723
+- Precision: 0.8667
+- Recall: 0.8860
+- F1: 0.8763
+- Accuracy: 0.9503
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 32
 - eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 4
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1     | Accuracy |
+|:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
+| 0.3839        | 1.0   | 625  | 0.2204          | 0.8112    | 0.8367 | 0.8238 | 0.9298   |
+| 0.2004        | 2.0   | 1250 | 0.1805          | 0.8530    | 0.8676 | 0.8602 | 0.9442   |
+| 0.1475        | 3.0   | 1875 | 0.1716          | 0.8536    | 0.8778 | 0.8655 | 0.9467   |
+| 0.0943        | 4.0   | 2500 | 0.1723          | 0.8667    | 0.8860 | 0.8763 | 0.9503   |
 ### Framework versions

config.json CHANGED Viewed

@@ -9,13 +9,13 @@
   "dropout": 0.1,
   "hidden_dim": 3072,
   "id2label": {
-    "0": "LABEL_0",
-    "1": "LABEL_1",
-    "2": "LABEL_2",
-    "3": "LABEL_3",
-    "4": "LABEL_4",
-    "5": "LABEL_5",
-    "6": "LABEL_6"
   },
   "initializer_range": 0.02,
   "label2id": {

   "dropout": 0.1,
   "hidden_dim": 3072,
   "id2label": {
+    "0": "O",
+    "1": "B-per",
+    "2": "I-per",
+    "3": "B-org",
+    "4": "I-org",
+    "5": "B-loc",
+    "6": "I-loc"
   },
   "initializer_range": 0.02,
   "label2id": {

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:ec278e2e6a712f0e95da93b29462d1a5264e4ece5970fd9a42654c335808a512
 size 326138357

 version https://git-lfs.github.com/spec/v1
+oid sha256:a123498a940810867bed475247fd308d558c5d916cf6613e424800c6110509f0
 size 326138357

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:8d0287ce65dbcec4df2dfdbdd416dd8bf259fb567921a3e005e33e8a703ae04a
 size 2671

 version https://git-lfs.github.com/spec/v1
+oid sha256:a4c39897724c9cda6d391e41de70c86fd24f73bbe7fcf68bd625d0c7251be61c
 size 2671