lekhnathrijal
/

bert-question-ner

Token Classification

Transformers

Safetensors

distilbert

Generated from Trainer

Model card Files Files and versions Community

lekhnathrijal commited on Jan 24

Commit

f9b30b6

verified ·

1 Parent(s): b3475e5

ai-research-lab/bert-question-ner

Browse files

Files changed (2) hide show

README.md +16 -21
model.safetensors +1 -1

README.md CHANGED Viewed

@@ -21,11 +21,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilbert/distilbert-base-uncased](https://huggingface.co/distilbert/distilbert-base-uncased) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1998
-- Precision: 0.7435
-- Recall: 0.8125
-- F1: 0.7765
-- Accuracy: 0.9363
 ## Model description
@@ -44,7 +44,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 8e-06
 - train_batch_size: 4
 - eval_batch_size: 4
 - seed: 42
@@ -57,21 +57,16 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss | Precision | Recall | F1     | Accuracy |
 |:-------------:|:------:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
-| No log        | 0.3311 | 100  | 1.1929          | 0.0       | 0.0    | 0.0    | 0.6526   |
-| No log        | 0.6623 | 200  | 0.7091          | 0.3717    | 0.3448 | 0.3577 | 0.7463   |
-| No log        | 0.9934 | 300  | 0.4627          | 0.4419    | 0.5827 | 0.5026 | 0.8561   |
-| No log        | 1.3245 | 400  | 0.3144          | 0.6347    | 0.7077 | 0.6692 | 0.9036   |
-| 0.8387        | 1.6556 | 500  | 0.2558          | 0.6270    | 0.7016 | 0.6622 | 0.9126   |
-| 0.8387        | 1.9868 | 600  | 0.2280          | 0.6944    | 0.7742 | 0.7321 | 0.9233   |
-| 0.8387        | 2.3179 | 700  | 0.2168          | 0.6890    | 0.7460 | 0.7164 | 0.9238   |
-| 0.8387        | 2.6490 | 800  | 0.2111          | 0.7083    | 0.7883 | 0.7462 | 0.9301   |
-| 0.8387        | 2.9801 | 900  | 0.2065          | 0.7230    | 0.7843 | 0.7524 | 0.9313   |
-| 0.2207        | 3.3113 | 1000 | 0.2069          | 0.7288    | 0.7802 | 0.7537 | 0.9318   |
-| 0.2207        | 3.6424 | 1100 | 0.1979          | 0.7274    | 0.7964 | 0.7603 | 0.9333   |
-| 0.2207        | 3.9735 | 1200 | 0.1926          | 0.7412    | 0.8024 | 0.7706 | 0.9366   |
-| 0.2207        | 4.3046 | 1300 | 0.1998          | 0.7435    | 0.8125 | 0.7765 | 0.9363   |
-| 0.2207        | 4.6358 | 1400 | 0.2014          | 0.7375    | 0.8044 | 0.7695 | 0.9386   |
-| 0.1526        | 4.9669 | 1500 | 0.1925          | 0.7467    | 0.8024 | 0.7736 | 0.9381   |
 ### Framework versions

 This model is a fine-tuned version of [distilbert/distilbert-base-uncased](https://huggingface.co/distilbert/distilbert-base-uncased) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.2044
+- Precision: 0.7342
+- Recall: 0.7964
+- F1: 0.7640
+- Accuracy: 0.9338
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 9e-06
 - train_batch_size: 4
 - eval_batch_size: 4
 - seed: 42
 | Training Loss | Epoch  | Step | Validation Loss | Precision | Recall | F1     | Accuracy |
 |:-------------:|:------:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
+| No log        | 0.3311 | 100  | 1.1156          | 0.0       | 0.0    | 0.0    | 0.6528   |
+| No log        | 0.6623 | 200  | 0.6775          | 0.3169    | 0.4012 | 0.3541 | 0.7815   |
+| No log        | 0.9934 | 300  | 0.4010          | 0.5       | 0.6310 | 0.5579 | 0.8771   |
+| No log        | 1.3245 | 400  | 0.2844          | 0.6344    | 0.6996 | 0.6654 | 0.9046   |
+| 0.7464        | 1.6556 | 500  | 0.2394          | 0.6404    | 0.7036 | 0.6705 | 0.9163   |
+| 0.7464        | 1.9868 | 600  | 0.2204          | 0.6774    | 0.7661 | 0.7190 | 0.9241   |
+| 0.7464        | 2.3179 | 700  | 0.2080          | 0.7143    | 0.7460 | 0.7298 | 0.9288   |
+| 0.7464        | 2.6490 | 800  | 0.2044          | 0.7342    | 0.7964 | 0.7640 | 0.9338   |
+| 0.7464        | 2.9801 | 900  | 0.2055          | 0.7227    | 0.7883 | 0.7541 | 0.9346   |
+| 0.2123        | 3.3113 | 1000 | 0.2030          | 0.7361    | 0.7762 | 0.7556 | 0.9353   |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f90e418667e00e4b75a0c4ffd31ec321b96728c3370205c3fbf64c8881cff02a
 size 265485396

 version https://git-lfs.github.com/spec/v1
+oid sha256:0847b1bd372e4ad36e7487e5d93edc5c7660826cc1dad7e5e1b9e04ac9b952ce
 size 265485396