End of training
Browse files
README.md
CHANGED
@@ -3,11 +3,6 @@ license: apache-2.0
|
|
3 |
base_model: distilbert-base-uncased
|
4 |
tags:
|
5 |
- generated_from_trainer
|
6 |
-
metrics:
|
7 |
-
- precision
|
8 |
-
- recall
|
9 |
-
- f1
|
10 |
-
- accuracy
|
11 |
model-index:
|
12 |
- name: trainer_2f
|
13 |
results: []
|
@@ -20,11 +15,16 @@ should probably proofread and complete it, then remove this comment. -->
|
|
20 |
|
21 |
This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on the None dataset.
|
22 |
It achieves the following results on the evaluation set:
|
23 |
-
-
|
24 |
-
-
|
25 |
-
-
|
26 |
-
-
|
27 |
-
-
|
|
|
|
|
|
|
|
|
|
|
28 |
|
29 |
## Model description
|
30 |
|
@@ -51,30 +51,6 @@ The following hyperparameters were used during training:
|
|
51 |
- lr_scheduler_type: linear
|
52 |
- num_epochs: 5
|
53 |
|
54 |
-
### Training results
|
55 |
-
|
56 |
-
| Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1 | Accuracy |
|
57 |
-
|:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
|
58 |
-
| 1.8737 | 0.27 | 30 | 1.6882 | 0.3681 | 0.3669 | 0.3246 | 0.3669 |
|
59 |
-
| 1.4895 | 0.54 | 60 | 1.2905 | 0.6124 | 0.4650 | 0.4154 | 0.4650 |
|
60 |
-
| 1.21 | 0.81 | 90 | 0.9996 | 0.6787 | 0.6779 | 0.6631 | 0.6779 |
|
61 |
-
| 0.8822 | 1.08 | 120 | 0.7850 | 0.7483 | 0.7423 | 0.7338 | 0.7423 |
|
62 |
-
| 0.6126 | 1.35 | 150 | 0.7046 | 0.7708 | 0.7647 | 0.7638 | 0.7647 |
|
63 |
-
| 0.507 | 1.62 | 180 | 0.6944 | 0.7453 | 0.7395 | 0.7374 | 0.7395 |
|
64 |
-
| 0.3995 | 1.89 | 210 | 0.6340 | 0.7906 | 0.7815 | 0.7828 | 0.7815 |
|
65 |
-
| 0.2939 | 2.16 | 240 | 0.6349 | 0.7805 | 0.7759 | 0.7742 | 0.7759 |
|
66 |
-
| 0.2387 | 2.43 | 270 | 0.6617 | 0.7982 | 0.7899 | 0.7891 | 0.7899 |
|
67 |
-
| 0.2309 | 2.7 | 300 | 0.6608 | 0.7991 | 0.7871 | 0.7858 | 0.7871 |
|
68 |
-
| 0.1775 | 2.97 | 330 | 0.6360 | 0.8173 | 0.8095 | 0.8095 | 0.8095 |
|
69 |
-
| 0.1115 | 3.24 | 360 | 0.7238 | 0.7904 | 0.7871 | 0.7863 | 0.7871 |
|
70 |
-
| 0.1048 | 3.51 | 390 | 0.6657 | 0.8103 | 0.8039 | 0.8030 | 0.8039 |
|
71 |
-
| 0.0664 | 3.78 | 420 | 0.7289 | 0.8159 | 0.8095 | 0.8089 | 0.8095 |
|
72 |
-
| 0.0658 | 4.05 | 450 | 0.7678 | 0.8120 | 0.8011 | 0.8012 | 0.8011 |
|
73 |
-
| 0.0259 | 4.32 | 480 | 0.7615 | 0.8198 | 0.8123 | 0.8110 | 0.8123 |
|
74 |
-
| 0.0475 | 4.59 | 510 | 0.7533 | 0.7984 | 0.7927 | 0.7923 | 0.7927 |
|
75 |
-
| 0.0408 | 4.86 | 540 | 0.7645 | 0.7988 | 0.7927 | 0.7920 | 0.7927 |
|
76 |
-
|
77 |
-
|
78 |
### Framework versions
|
79 |
|
80 |
- Transformers 4.39.3
|
|
|
3 |
base_model: distilbert-base-uncased
|
4 |
tags:
|
5 |
- generated_from_trainer
|
|
|
|
|
|
|
|
|
|
|
6 |
model-index:
|
7 |
- name: trainer_2f
|
8 |
results: []
|
|
|
15 |
|
16 |
This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on the None dataset.
|
17 |
It achieves the following results on the evaluation set:
|
18 |
+
- eval_loss: 0.0017
|
19 |
+
- eval_precision: 1.0
|
20 |
+
- eval_recall: 1.0
|
21 |
+
- eval_f1: 1.0
|
22 |
+
- eval_accuracy: 1.0
|
23 |
+
- eval_runtime: 132.697
|
24 |
+
- eval_samples_per_second: 2.69
|
25 |
+
- eval_steps_per_second: 0.173
|
26 |
+
- epoch: 3.24
|
27 |
+
- step: 360
|
28 |
|
29 |
## Model description
|
30 |
|
|
|
51 |
- lr_scheduler_type: linear
|
52 |
- num_epochs: 5
|
53 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
54 |
### Framework versions
|
55 |
|
56 |
- Transformers 4.39.3
|
model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 267847948
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:d88b5be7348706374bd9faa32dbf906dc554b9471ce4c1760eff29cadc16604b
|
3 |
size 267847948
|
runs/Apr05_13-01-00_fd0989d591d3/events.out.tfevents.1712322067.fd0989d591d3.314.2
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:2aa9337c40654b9d5aeeae83e0f5bbf0e862dda85e0a9e3b2cb55638e1343702
|
3 |
+
size 4763
|
runs/Apr05_13-03-18_fd0989d591d3/events.out.tfevents.1712322204.fd0989d591d3.314.3
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:4f18cc8bfbabe9023f3b21cc86d5b58ab3ba6aae6c79a0ee04a333bfd83b8155
|
3 |
+
size 12907
|
training_args.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 4920
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:4951895d9dd29528fa168f04bafb2551bd2607566f5833e7a05e7de18eb99dcf
|
3 |
size 4920
|