End of training
Browse files
README.md
CHANGED
@@ -1,6 +1,6 @@
|
|
1 |
---
|
2 |
license: apache-2.0
|
3 |
-
base_model: distilbert-base-uncased
|
4 |
tags:
|
5 |
- generated_from_trainer
|
6 |
metrics:
|
@@ -18,9 +18,9 @@ should probably proofread and complete it, then remove this comment. -->
|
|
18 |
|
19 |
# test_trainer3b
|
20 |
|
21 |
-
This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on the None dataset.
|
22 |
It achieves the following results on the evaluation set:
|
23 |
-
- Loss:
|
24 |
- Precision: 1.0
|
25 |
- Recall: 1.0
|
26 |
- F1: 1.0
|
@@ -44,18 +44,20 @@ More information needed
|
|
44 |
|
45 |
The following hyperparameters were used during training:
|
46 |
- learning_rate: 2e-05
|
47 |
-
- train_batch_size:
|
48 |
-
- eval_batch_size:
|
49 |
- seed: 42
|
50 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
51 |
- lr_scheduler_type: linear
|
52 |
-
- num_epochs:
|
53 |
|
54 |
### Training results
|
55 |
|
56 |
-
| Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1
|
57 |
-
|
58 |
-
|
|
|
|
|
|
59 |
|
60 |
|
61 |
### Framework versions
|
|
|
1 |
---
|
2 |
license: apache-2.0
|
3 |
+
base_model: distilbert/distilbert-base-uncased
|
4 |
tags:
|
5 |
- generated_from_trainer
|
6 |
metrics:
|
|
|
18 |
|
19 |
# test_trainer3b
|
20 |
|
21 |
+
This model is a fine-tuned version of [distilbert/distilbert-base-uncased](https://huggingface.co/distilbert/distilbert-base-uncased) on the None dataset.
|
22 |
It achieves the following results on the evaluation set:
|
23 |
+
- Loss: 1.4268
|
24 |
- Precision: 1.0
|
25 |
- Recall: 1.0
|
26 |
- F1: 1.0
|
|
|
44 |
|
45 |
The following hyperparameters were used during training:
|
46 |
- learning_rate: 2e-05
|
47 |
+
- train_batch_size: 64
|
48 |
+
- eval_batch_size: 64
|
49 |
- seed: 42
|
50 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
51 |
- lr_scheduler_type: linear
|
52 |
+
- num_epochs: 8
|
53 |
|
54 |
### Training results
|
55 |
|
56 |
+
| Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1 | Accuracy |
|
57 |
+
|:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
|
58 |
+
| 1.9001 | 2.5 | 10 | 1.8054 | 0.5581 | 0.4762 | 0.3941 | 0.4762 |
|
59 |
+
| 1.7074 | 5.0 | 20 | 1.5607 | 0.9429 | 0.9048 | 0.8929 | 0.9048 |
|
60 |
+
| 1.5246 | 7.5 | 30 | 1.4268 | 1.0 | 1.0 | 1.0 | 1.0 |
|
61 |
|
62 |
|
63 |
### Framework versions
|
config.json
CHANGED
@@ -1,5 +1,5 @@
|
|
1 |
{
|
2 |
-
"_name_or_path": "distilbert-base-uncased",
|
3 |
"activation": "gelu",
|
4 |
"architectures": [
|
5 |
"DistilBertForSequenceClassification"
|
|
|
1 |
{
|
2 |
+
"_name_or_path": "distilbert/distilbert-base-uncased",
|
3 |
"activation": "gelu",
|
4 |
"architectures": [
|
5 |
"DistilBertForSequenceClassification"
|
model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 267847948
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:5c742c4ad2978bb415fdcc9a3a86a05ed60270c4e4cfdb0607c101cf43d61f6d
|
3 |
size 267847948
|
runs/Mar11_12-18-43_ed81b744f0b3/events.out.tfevents.1710159530.ed81b744f0b3.1246.0
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:e5d6d3403f12b0438e146419b5162dc72d42a0830fba1228666f3022e4ecead7
|
3 |
+
size 4781
|
runs/Mar11_12-26-15_ed81b744f0b3/events.out.tfevents.1710159981.ed81b744f0b3.1246.1
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:3f3a8661214bcdf6fc165a44ab4cf94ad58b6f2c3d9d6219cba0a86d82c338eb
|
3 |
+
size 7112
|
tokenizer_config.json
CHANGED
@@ -46,7 +46,7 @@
|
|
46 |
"do_basic_tokenize": true,
|
47 |
"do_lower_case": true,
|
48 |
"mask_token": "[MASK]",
|
49 |
-
"model_max_length":
|
50 |
"never_split": null,
|
51 |
"pad_token": "[PAD]",
|
52 |
"sep_token": "[SEP]",
|
|
|
46 |
"do_basic_tokenize": true,
|
47 |
"do_lower_case": true,
|
48 |
"mask_token": "[MASK]",
|
49 |
+
"model_max_length": 1000000000000000019884624838656,
|
50 |
"never_split": null,
|
51 |
"pad_token": "[PAD]",
|
52 |
"sep_token": "[SEP]",
|
training_args.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 4856
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:dab36f7ba75fc136a8ab8e0dc8a78ec969fc38f94b57cedefe091ba249512db5
|
3 |
size 4856
|