SharonTudi commited on
Commit
bc6edac
·
verified ·
1 Parent(s): 4263e87

End of training

Browse files
README.md CHANGED
@@ -3,6 +3,11 @@ license: apache-2.0
3
  base_model: distilbert-base-cased
4
  tags:
5
  - generated_from_trainer
 
 
 
 
 
6
  model-index:
7
  - name: DIALOGUE_second_model
8
  results: []
@@ -15,16 +20,11 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [distilbert-base-cased](https://huggingface.co/distilbert-base-cased) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
- - eval_loss: 1.0869
19
- - eval_precision: 0.8180
20
- - eval_recall: 0.8158
21
- - eval_f1: 0.8160
22
- - eval_accuracy: 0.8158
23
- - eval_runtime: 6.0635
24
- - eval_samples_per_second: 12.534
25
- - eval_steps_per_second: 1.649
26
- - epoch: 9.17
27
- - step: 440
28
 
29
  ## Model description
30
 
@@ -43,7 +43,7 @@ More information needed
43
  ### Training hyperparameters
44
 
45
  The following hyperparameters were used during training:
46
- - learning_rate: 6e-05
47
  - train_batch_size: 8
48
  - eval_batch_size: 8
49
  - seed: 42
@@ -51,6 +51,27 @@ The following hyperparameters were used during training:
51
  - lr_scheduler_type: linear
52
  - num_epochs: 15
53
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
54
  ### Framework versions
55
 
56
  - Transformers 4.36.2
 
3
  base_model: distilbert-base-cased
4
  tags:
5
  - generated_from_trainer
6
+ metrics:
7
+ - precision
8
+ - recall
9
+ - f1
10
+ - accuracy
11
  model-index:
12
  - name: DIALOGUE_second_model
13
  results: []
 
20
 
21
  This model is a fine-tuned version of [distilbert-base-cased](https://huggingface.co/distilbert-base-cased) on the None dataset.
22
  It achieves the following results on the evaluation set:
23
+ - Loss: 0.1121
24
+ - Precision: 0.9762
25
+ - Recall: 0.9737
26
+ - F1: 0.9736
27
+ - Accuracy: 0.9737
 
 
 
 
 
28
 
29
  ## Model description
30
 
 
43
  ### Training hyperparameters
44
 
45
  The following hyperparameters were used during training:
46
+ - learning_rate: 1e-05
47
  - train_batch_size: 8
48
  - eval_batch_size: 8
49
  - seed: 42
 
51
  - lr_scheduler_type: linear
52
  - num_epochs: 15
53
 
54
+ ### Training results
55
+
56
+ | Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1 | Accuracy |
57
+ |:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
58
+ | 1.206 | 1.0 | 48 | 0.8496 | 0.8900 | 0.8421 | 0.8322 | 0.8421 |
59
+ | 0.621 | 2.0 | 96 | 0.2971 | 1.0 | 1.0 | 1.0 | 1.0 |
60
+ | 0.2411 | 3.0 | 144 | 0.1237 | 0.9762 | 0.9737 | 0.9736 | 0.9737 |
61
+ | 0.0988 | 4.0 | 192 | 0.0829 | 0.9762 | 0.9737 | 0.9736 | 0.9737 |
62
+ | 0.0471 | 5.0 | 240 | 0.0926 | 0.9762 | 0.9737 | 0.9736 | 0.9737 |
63
+ | 0.0213 | 6.0 | 288 | 0.0924 | 0.9762 | 0.9737 | 0.9736 | 0.9737 |
64
+ | 0.0142 | 7.0 | 336 | 0.1157 | 0.9762 | 0.9737 | 0.9736 | 0.9737 |
65
+ | 0.0106 | 8.0 | 384 | 0.1067 | 0.9762 | 0.9737 | 0.9736 | 0.9737 |
66
+ | 0.0086 | 9.0 | 432 | 0.1032 | 0.9762 | 0.9737 | 0.9736 | 0.9737 |
67
+ | 0.007 | 10.0 | 480 | 0.1188 | 0.9762 | 0.9737 | 0.9736 | 0.9737 |
68
+ | 0.0064 | 11.0 | 528 | 0.1134 | 0.9762 | 0.9737 | 0.9736 | 0.9737 |
69
+ | 0.0059 | 12.0 | 576 | 0.1122 | 0.9762 | 0.9737 | 0.9736 | 0.9737 |
70
+ | 0.0055 | 13.0 | 624 | 0.1109 | 0.9762 | 0.9737 | 0.9736 | 0.9737 |
71
+ | 0.0053 | 14.0 | 672 | 0.1125 | 0.9762 | 0.9737 | 0.9736 | 0.9737 |
72
+ | 0.0052 | 15.0 | 720 | 0.1121 | 0.9762 | 0.9737 | 0.9736 | 0.9737 |
73
+
74
+
75
  ### Framework versions
76
 
77
  - Transformers 4.36.2
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:44bdcde6597ae303b617bee35f310af8eaa3b973406e1c84f1c5e33c5dc46e9f
3
  size 263150840
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:23a5643049aca9dca0a9756764cfb7d6f39ce033b1d08cd04a4e1b0dc4c49978
3
  size 263150840
runs/Jan21_11-29-58_1a3a957c2842/events.out.tfevents.1705836601.1a3a957c2842.8067.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ec2a3b5a60f5c3fb99bd1dad0214a4df42c2ff478c03643dc08d81433685ab0f
3
+ size 14261
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:56b4b67f6c5e696a88408a4c3d08056ef5660ce1e9333c2009b8bbaf6558042d
3
- size 4664
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d4a7f306b7f3ec24c7f56a4a5a046c71adc46dd964de720e562953f73a7b333a
3
+ size 4728