Senthil commited on
Commit
308fa5a
·
verified ·
1 Parent(s): 182f57a

End of training

Browse files
README.md CHANGED
@@ -1,6 +1,6 @@
1
  ---
2
  license: apache-2.0
3
- base_model: senthil2002/local_distilbert_finetune_model
4
  tags:
5
  - generated_from_trainer
6
  metrics:
@@ -18,13 +18,13 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  # local_distilbert_finetune_model
20
 
21
- This model is a fine-tuned version of [senthil2002/local_distilbert_finetune_model](https://huggingface.co/senthil2002/local_distilbert_finetune_model) on an unknown dataset.
22
  It achieves the following results on the evaluation set:
23
- - Loss: 0.0717
24
- - Precision: 0.9293
25
- - Recall: 0.9394
26
- - F1: 0.9343
27
- - Accuracy: 0.9845
28
 
29
  ## Model description
30
 
@@ -49,20 +49,67 @@ The following hyperparameters were used during training:
49
  - seed: 42
50
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
51
  - lr_scheduler_type: linear
52
- - num_epochs: 3
53
 
54
  ### Training results
55
 
56
- | Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1 | Accuracy |
57
- |:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
58
- | 0.0293 | 1.0 | 878 | 0.0660 | 0.9257 | 0.9334 | 0.9295 | 0.9836 |
59
- | 0.013 | 2.0 | 1756 | 0.0727 | 0.9259 | 0.9374 | 0.9316 | 0.9837 |
60
- | 0.0098 | 3.0 | 2634 | 0.0717 | 0.9293 | 0.9394 | 0.9343 | 0.9845 |
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
61
 
62
 
63
  ### Framework versions
64
 
65
- - Transformers 4.37.2
66
  - Pytorch 2.1.0+cu121
67
  - Datasets 2.17.1
68
  - Tokenizers 0.15.2
 
1
  ---
2
  license: apache-2.0
3
+ base_model: distilbert-base-uncased
4
  tags:
5
  - generated_from_trainer
6
  metrics:
 
18
 
19
  # local_distilbert_finetune_model
20
 
21
+ This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on the None dataset.
22
  It achieves the following results on the evaluation set:
23
+ - Loss: 0.2154
24
+ - Precision: 0.5
25
+ - Recall: 0.5
26
+ - F1: 0.5
27
+ - Accuracy: 0.9231
28
 
29
  ## Model description
30
 
 
49
  - seed: 42
50
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
51
  - lr_scheduler_type: linear
52
+ - num_epochs: 50
53
 
54
  ### Training results
55
 
56
+ | Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1 | Accuracy |
57
+ |:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:---:|:--------:|
58
+ | No log | 1.0 | 1 | 0.8718 | 0.0 | 0.0 | 0.0 | 0.7692 |
59
+ | No log | 2.0 | 2 | 0.8088 | 0.0 | 0.0 | 0.0 | 0.7692 |
60
+ | No log | 3.0 | 3 | 0.7507 | 0.0 | 0.0 | 0.0 | 0.7692 |
61
+ | No log | 4.0 | 4 | 0.6957 | 0.0 | 0.0 | 0.0 | 0.7692 |
62
+ | No log | 5.0 | 5 | 0.6445 | 0.0 | 0.0 | 0.0 | 0.7692 |
63
+ | No log | 6.0 | 6 | 0.5982 | 0.0 | 0.0 | 0.0 | 0.7692 |
64
+ | No log | 7.0 | 7 | 0.5559 | 0.0 | 0.0 | 0.0 | 0.7692 |
65
+ | No log | 8.0 | 8 | 0.5177 | 0.0 | 0.0 | 0.0 | 0.7692 |
66
+ | No log | 9.0 | 9 | 0.4832 | 0.0 | 0.0 | 0.0 | 0.8462 |
67
+ | No log | 10.0 | 10 | 0.4523 | 0.5 | 0.5 | 0.5 | 0.9231 |
68
+ | No log | 11.0 | 11 | 0.4243 | 0.5 | 0.5 | 0.5 | 0.9231 |
69
+ | No log | 12.0 | 12 | 0.3996 | 0.5 | 0.5 | 0.5 | 0.9231 |
70
+ | No log | 13.0 | 13 | 0.3778 | 0.5 | 0.5 | 0.5 | 0.9231 |
71
+ | No log | 14.0 | 14 | 0.3592 | 0.5 | 0.5 | 0.5 | 0.9231 |
72
+ | No log | 15.0 | 15 | 0.3428 | 0.5 | 0.5 | 0.5 | 0.9231 |
73
+ | No log | 16.0 | 16 | 0.3293 | 0.5 | 0.5 | 0.5 | 0.9231 |
74
+ | No log | 17.0 | 17 | 0.3180 | 0.5 | 0.5 | 0.5 | 0.9231 |
75
+ | No log | 18.0 | 18 | 0.3087 | 0.5 | 0.5 | 0.5 | 0.9231 |
76
+ | No log | 19.0 | 19 | 0.3003 | 0.5 | 0.5 | 0.5 | 0.9231 |
77
+ | No log | 20.0 | 20 | 0.2933 | 0.5 | 0.5 | 0.5 | 0.9231 |
78
+ | No log | 21.0 | 21 | 0.2865 | 0.5 | 0.5 | 0.5 | 0.9231 |
79
+ | No log | 22.0 | 22 | 0.2807 | 0.5 | 0.5 | 0.5 | 0.9231 |
80
+ | No log | 23.0 | 23 | 0.2755 | 0.5 | 0.5 | 0.5 | 0.9231 |
81
+ | No log | 24.0 | 24 | 0.2689 | 0.5 | 0.5 | 0.5 | 0.9231 |
82
+ | No log | 25.0 | 25 | 0.2628 | 0.5 | 0.5 | 0.5 | 0.9231 |
83
+ | No log | 26.0 | 26 | 0.2573 | 0.5 | 0.5 | 0.5 | 0.9231 |
84
+ | No log | 27.0 | 27 | 0.2528 | 0.5 | 0.5 | 0.5 | 0.9231 |
85
+ | No log | 28.0 | 28 | 0.2487 | 0.5 | 0.5 | 0.5 | 0.9231 |
86
+ | No log | 29.0 | 29 | 0.2451 | 0.5 | 0.5 | 0.5 | 0.9231 |
87
+ | No log | 30.0 | 30 | 0.2420 | 0.5 | 0.5 | 0.5 | 0.9231 |
88
+ | No log | 31.0 | 31 | 0.2392 | 0.5 | 0.5 | 0.5 | 0.9231 |
89
+ | No log | 32.0 | 32 | 0.2363 | 0.5 | 0.5 | 0.5 | 0.9231 |
90
+ | No log | 33.0 | 33 | 0.2335 | 0.5 | 0.5 | 0.5 | 0.9231 |
91
+ | No log | 34.0 | 34 | 0.2310 | 0.5 | 0.5 | 0.5 | 0.9231 |
92
+ | No log | 35.0 | 35 | 0.2288 | 0.5 | 0.5 | 0.5 | 0.9231 |
93
+ | No log | 36.0 | 36 | 0.2267 | 0.5 | 0.5 | 0.5 | 0.9231 |
94
+ | No log | 37.0 | 37 | 0.2247 | 0.5 | 0.5 | 0.5 | 0.9231 |
95
+ | No log | 38.0 | 38 | 0.2230 | 0.5 | 0.5 | 0.5 | 0.9231 |
96
+ | No log | 39.0 | 39 | 0.2216 | 0.5 | 0.5 | 0.5 | 0.9231 |
97
+ | No log | 40.0 | 40 | 0.2205 | 0.5 | 0.5 | 0.5 | 0.9231 |
98
+ | No log | 41.0 | 41 | 0.2196 | 0.5 | 0.5 | 0.5 | 0.9231 |
99
+ | No log | 42.0 | 42 | 0.2187 | 0.5 | 0.5 | 0.5 | 0.9231 |
100
+ | No log | 43.0 | 43 | 0.2180 | 0.5 | 0.5 | 0.5 | 0.9231 |
101
+ | No log | 44.0 | 44 | 0.2173 | 0.5 | 0.5 | 0.5 | 0.9231 |
102
+ | No log | 45.0 | 45 | 0.2168 | 0.5 | 0.5 | 0.5 | 0.9231 |
103
+ | No log | 46.0 | 46 | 0.2163 | 0.5 | 0.5 | 0.5 | 0.9231 |
104
+ | No log | 47.0 | 47 | 0.2159 | 0.5 | 0.5 | 0.5 | 0.9231 |
105
+ | No log | 48.0 | 48 | 0.2157 | 0.5 | 0.5 | 0.5 | 0.9231 |
106
+ | No log | 49.0 | 49 | 0.2155 | 0.5 | 0.5 | 0.5 | 0.9231 |
107
+ | No log | 50.0 | 50 | 0.2154 | 0.5 | 0.5 | 0.5 | 0.9231 |
108
 
109
 
110
  ### Framework versions
111
 
112
+ - Transformers 4.38.1
113
  - Pytorch 2.1.0+cu121
114
  - Datasets 2.17.1
115
  - Tokenizers 0.15.2
config.json CHANGED
@@ -1,5 +1,5 @@
1
  {
2
- "_name_or_path": "senthil2002/local_distilbert_finetune_model",
3
  "activation": "gelu",
4
  "architectures": [
5
  "DistilBertForTokenClassification"
@@ -10,26 +10,14 @@
10
  "hidden_dim": 3072,
11
  "id2label": {
12
  "0": "O",
13
- "1": "B-PER",
14
- "2": "I-PER",
15
- "3": "B-ORG",
16
- "4": "I-ORG",
17
- "5": "B-LOC",
18
- "6": "I-LOC",
19
- "7": "B-MISC",
20
- "8": "I-MISC"
21
  },
22
  "initializer_range": 0.02,
23
  "label2id": {
24
- "B-LOC": 5,
25
- "B-MISC": 7,
26
- "B-ORG": 3,
27
- "B-PER": 1,
28
- "I-LOC": 6,
29
- "I-MISC": 8,
30
- "I-ORG": 4,
31
- "I-PER": 2,
32
- "O": 0
33
  },
34
  "max_position_embeddings": 512,
35
  "model_type": "distilbert",
@@ -41,6 +29,6 @@
41
  "sinusoidal_pos_embds": false,
42
  "tie_weights_": true,
43
  "torch_dtype": "float32",
44
- "transformers_version": "4.37.2",
45
  "vocab_size": 30522
46
  }
 
1
  {
2
+ "_name_or_path": "distilbert-base-uncased",
3
  "activation": "gelu",
4
  "architectures": [
5
  "DistilBertForTokenClassification"
 
10
  "hidden_dim": 3072,
11
  "id2label": {
12
  "0": "O",
13
+ "1": "customer_status",
14
+ "2": "date"
 
 
 
 
 
 
15
  },
16
  "initializer_range": 0.02,
17
  "label2id": {
18
+ "O": 0,
19
+ "customer_status": 1,
20
+ "date": 2
 
 
 
 
 
 
21
  },
22
  "max_position_embeddings": 512,
23
  "model_type": "distilbert",
 
29
  "sinusoidal_pos_embds": false,
30
  "tie_weights_": true,
31
  "torch_dtype": "float32",
32
+ "transformers_version": "4.38.1",
33
  "vocab_size": 30522
34
  }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:438250a36e703036a02174ee68b9ef2a0ab76762569ebc2e7f08632282a2bca7
3
- size 265491548
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2cb0f2598c82f7faa36cb827548bac50967ebf9b3a378fc10ca322645c804d69
3
+ size 265473092
runs/Feb28_12-57-00_39e85f2a5c68/events.out.tfevents.1709125183.39e85f2a5c68.499.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:78b2d81131b9703fc5427e493bc84b830bcf6a1775cd6c111ea72a82d92a13be
3
+ size 6377
runs/Feb28_12-57-00_39e85f2a5c68/events.out.tfevents.1709125190.39e85f2a5c68.499.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:965e42257451f3f7212b8818e39298cb43392fca6fe5ad713275271952a0794a
3
+ size 551
runs/Feb28_13-00-06_39e85f2a5c68/events.out.tfevents.1709125217.39e85f2a5c68.499.2 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7215ddb854f490f617f368a041bf15cb895d49d159bcb66abee49abdb9a79e2f
3
+ size 28139
runs/Feb28_13-00-06_39e85f2a5c68/events.out.tfevents.1709125232.39e85f2a5c68.499.3 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a59ac278eecaef2a762b84542a15f74b55903cb47f9b9613dd9fe35c80ed4726
3
+ size 551
special_tokens_map.json CHANGED
@@ -1,37 +1,7 @@
1
  {
2
- "cls_token": {
3
- "content": "[CLS]",
4
- "lstrip": false,
5
- "normalized": false,
6
- "rstrip": false,
7
- "single_word": false
8
- },
9
- "mask_token": {
10
- "content": "[MASK]",
11
- "lstrip": false,
12
- "normalized": false,
13
- "rstrip": false,
14
- "single_word": false
15
- },
16
- "pad_token": {
17
- "content": "[PAD]",
18
- "lstrip": false,
19
- "normalized": false,
20
- "rstrip": false,
21
- "single_word": false
22
- },
23
- "sep_token": {
24
- "content": "[SEP]",
25
- "lstrip": false,
26
- "normalized": false,
27
- "rstrip": false,
28
- "single_word": false
29
- },
30
- "unk_token": {
31
- "content": "[UNK]",
32
- "lstrip": false,
33
- "normalized": false,
34
- "rstrip": false,
35
- "single_word": false
36
- }
37
  }
 
1
  {
2
+ "cls_token": "[CLS]",
3
+ "mask_token": "[MASK]",
4
+ "pad_token": "[PAD]",
5
+ "sep_token": "[SEP]",
6
+ "unk_token": "[UNK]"
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
7
  }
tokenizer.json CHANGED
@@ -1,6 +1,11 @@
1
  {
2
  "version": "1.0",
3
- "truncation": null,
 
 
 
 
 
4
  "padding": null,
5
  "added_tokens": [
6
  {
 
1
  {
2
  "version": "1.0",
3
+ "truncation": {
4
+ "direction": "Right",
5
+ "max_length": 512,
6
+ "strategy": "LongestFirst",
7
+ "stride": 0
8
+ },
9
  "padding": null,
10
  "added_tokens": [
11
  {
tokenizer_config.json CHANGED
@@ -45,15 +45,11 @@
45
  "cls_token": "[CLS]",
46
  "do_lower_case": true,
47
  "mask_token": "[MASK]",
48
- "max_length": 512,
49
  "model_max_length": 512,
50
  "pad_token": "[PAD]",
51
  "sep_token": "[SEP]",
52
- "stride": 0,
53
  "strip_accents": null,
54
  "tokenize_chinese_chars": true,
55
  "tokenizer_class": "DistilBertTokenizer",
56
- "truncation_side": "right",
57
- "truncation_strategy": "longest_first",
58
  "unk_token": "[UNK]"
59
  }
 
45
  "cls_token": "[CLS]",
46
  "do_lower_case": true,
47
  "mask_token": "[MASK]",
 
48
  "model_max_length": 512,
49
  "pad_token": "[PAD]",
50
  "sep_token": "[SEP]",
 
51
  "strip_accents": null,
52
  "tokenize_chinese_chars": true,
53
  "tokenizer_class": "DistilBertTokenizer",
 
 
54
  "unk_token": "[UNK]"
55
  }
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:3bf62bb2dcbae6cdfa16178cf8b7328074cfc57d338671f2f6c8c9d38c0b6d36
3
- size 4728
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3ff86dd6f29b9fcb51db62d51599f5afa7f08feacc8a99d48083d9949f4cf6dd
3
+ size 4920