Souvik123 commited on
Commit
a2f9035
1 Parent(s): 6457c76

End of training

Browse files
README.md ADDED
@@ -0,0 +1,205 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: cc-by-4.0
3
+ base_model: deepset/roberta-base-squad2
4
+ tags:
5
+ - generated_from_trainer
6
+ model-index:
7
+ - name: bankstatementmodelver8
8
+ results: []
9
+ ---
10
+
11
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
12
+ should probably proofread and complete it, then remove this comment. -->
13
+
14
+ # bankstatementmodelver8
15
+
16
+ This model is a fine-tuned version of [deepset/roberta-base-squad2](https://huggingface.co/deepset/roberta-base-squad2) on an unknown dataset.
17
+ It achieves the following results on the evaluation set:
18
+ - Loss: 0.0
19
+
20
+ ## Model description
21
+
22
+ More information needed
23
+
24
+ ## Intended uses & limitations
25
+
26
+ More information needed
27
+
28
+ ## Training and evaluation data
29
+
30
+ More information needed
31
+
32
+ ## Training procedure
33
+
34
+ ### Training hyperparameters
35
+
36
+ The following hyperparameters were used during training:
37
+ - learning_rate: 2e-05
38
+ - train_batch_size: 16
39
+ - eval_batch_size: 11
40
+ - seed: 42
41
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
+ - lr_scheduler_type: linear
43
+ - num_epochs: 150
44
+
45
+ ### Training results
46
+
47
+ | Training Loss | Epoch | Step | Validation Loss |
48
+ |:-------------:|:-----:|:------:|:---------------:|
49
+ | 0.1067 | 1.0 | 981 | 0.0322 |
50
+ | 0.0357 | 2.0 | 1962 | 0.0228 |
51
+ | 0.0239 | 3.0 | 2943 | 0.0172 |
52
+ | 0.0253 | 4.0 | 3924 | 0.0158 |
53
+ | 0.0206 | 5.0 | 4905 | 0.0127 |
54
+ | 0.0168 | 6.0 | 5886 | 0.0160 |
55
+ | 0.0158 | 7.0 | 6867 | 0.0154 |
56
+ | 0.0169 | 8.0 | 7848 | 0.0134 |
57
+ | 0.0162 | 9.0 | 8829 | 0.0081 |
58
+ | 0.0162 | 10.0 | 9810 | 0.0101 |
59
+ | 0.0126 | 11.0 | 10791 | 0.0082 |
60
+ | 0.0128 | 12.0 | 11772 | 0.0080 |
61
+ | 0.013 | 13.0 | 12753 | 0.0119 |
62
+ | 0.0117 | 14.0 | 13734 | 0.0105 |
63
+ | 0.0117 | 15.0 | 14715 | 0.0106 |
64
+ | 0.0112 | 16.0 | 15696 | 0.0100 |
65
+ | 0.0103 | 17.0 | 16677 | 0.0078 |
66
+ | 0.0075 | 18.0 | 17658 | 0.0060 |
67
+ | 0.0057 | 19.0 | 18639 | 0.0088 |
68
+ | 0.0077 | 20.0 | 19620 | 0.0076 |
69
+ | 0.006 | 21.0 | 20601 | 0.0149 |
70
+ | 0.0065 | 22.0 | 21582 | 0.0062 |
71
+ | 0.0093 | 23.0 | 22563 | 0.0081 |
72
+ | 0.0045 | 24.0 | 23544 | 0.0054 |
73
+ | 0.005 | 25.0 | 24525 | 0.0048 |
74
+ | 0.0068 | 26.0 | 25506 | 0.0122 |
75
+ | 0.0063 | 27.0 | 26487 | 0.0038 |
76
+ | 0.0043 | 28.0 | 27468 | 0.0063 |
77
+ | 0.0055 | 29.0 | 28449 | 0.0096 |
78
+ | 0.0034 | 30.0 | 29430 | 0.0045 |
79
+ | 0.0033 | 31.0 | 30411 | 0.0025 |
80
+ | 0.0027 | 32.0 | 31392 | 0.0047 |
81
+ | 0.002 | 33.0 | 32373 | 0.0053 |
82
+ | 0.0055 | 34.0 | 33354 | 0.0026 |
83
+ | 0.0044 | 35.0 | 34335 | 0.0010 |
84
+ | 0.0047 | 36.0 | 35316 | 0.0008 |
85
+ | 0.0019 | 37.0 | 36297 | 0.0011 |
86
+ | 0.0006 | 38.0 | 37278 | 0.0030 |
87
+ | 0.0015 | 39.0 | 38259 | 0.0010 |
88
+ | 0.0005 | 40.0 | 39240 | 0.0008 |
89
+ | 0.0018 | 41.0 | 40221 | 0.0001 |
90
+ | 0.0026 | 42.0 | 41202 | 0.0017 |
91
+ | 0.0 | 43.0 | 42183 | 0.0002 |
92
+ | 0.002 | 44.0 | 43164 | 0.0009 |
93
+ | 0.0012 | 45.0 | 44145 | 0.0000 |
94
+ | 0.0018 | 46.0 | 45126 | 0.0110 |
95
+ | 0.0006 | 47.0 | 46107 | 0.0018 |
96
+ | 0.0016 | 48.0 | 47088 | 0.0000 |
97
+ | 0.0017 | 49.0 | 48069 | 0.0000 |
98
+ | 0.0014 | 50.0 | 49050 | 0.0000 |
99
+ | 0.0001 | 51.0 | 50031 | 0.0000 |
100
+ | 0.0018 | 52.0 | 51012 | 0.0020 |
101
+ | 0.0001 | 53.0 | 51993 | 0.0001 |
102
+ | 0.0009 | 54.0 | 52974 | 0.0040 |
103
+ | 0.0021 | 55.0 | 53955 | 0.0000 |
104
+ | 0.0018 | 56.0 | 54936 | 0.0000 |
105
+ | 0.0005 | 57.0 | 55917 | 0.0000 |
106
+ | 0.0 | 58.0 | 56898 | 0.0000 |
107
+ | 0.0014 | 59.0 | 57879 | 0.0000 |
108
+ | 0.0008 | 60.0 | 58860 | 0.0000 |
109
+ | 0.0002 | 61.0 | 59841 | 0.0000 |
110
+ | 0.0018 | 62.0 | 60822 | 0.0000 |
111
+ | 0.0016 | 63.0 | 61803 | 0.0003 |
112
+ | 0.0 | 64.0 | 62784 | 0.0000 |
113
+ | 0.0001 | 65.0 | 63765 | 0.0000 |
114
+ | 0.0014 | 66.0 | 64746 | 0.0004 |
115
+ | 0.0006 | 67.0 | 65727 | 0.0000 |
116
+ | 0.0 | 68.0 | 66708 | 0.0000 |
117
+ | 0.0 | 69.0 | 67689 | 0.0000 |
118
+ | 0.0002 | 70.0 | 68670 | 0.0000 |
119
+ | 0.0001 | 71.0 | 69651 | 0.0000 |
120
+ | 0.0 | 72.0 | 70632 | 0.0000 |
121
+ | 0.0005 | 73.0 | 71613 | 0.0000 |
122
+ | 0.0009 | 74.0 | 72594 | 0.0000 |
123
+ | 0.0007 | 75.0 | 73575 | 0.0000 |
124
+ | 0.0 | 76.0 | 74556 | 0.0005 |
125
+ | 0.0 | 77.0 | 75537 | 0.0000 |
126
+ | 0.0 | 78.0 | 76518 | 0.0000 |
127
+ | 0.0004 | 79.0 | 77499 | 0.0000 |
128
+ | 0.0001 | 80.0 | 78480 | 0.0000 |
129
+ | 0.0 | 81.0 | 79461 | 0.0000 |
130
+ | 0.0013 | 82.0 | 80442 | 0.0000 |
131
+ | 0.0 | 83.0 | 81423 | 0.0000 |
132
+ | 0.0 | 84.0 | 82404 | 0.0000 |
133
+ | 0.0 | 85.0 | 83385 | 0.0000 |
134
+ | 0.0001 | 86.0 | 84366 | 0.0000 |
135
+ | 0.001 | 87.0 | 85347 | 0.0000 |
136
+ | 0.0 | 88.0 | 86328 | 0.0000 |
137
+ | 0.0001 | 89.0 | 87309 | 0.0000 |
138
+ | 0.0004 | 90.0 | 88290 | 0.0000 |
139
+ | 0.0 | 91.0 | 89271 | 0.0000 |
140
+ | 0.0 | 92.0 | 90252 | 0.0000 |
141
+ | 0.0 | 93.0 | 91233 | 0.0000 |
142
+ | 0.001 | 94.0 | 92214 | 0.0000 |
143
+ | 0.0 | 95.0 | 93195 | 0.0000 |
144
+ | 0.0 | 96.0 | 94176 | 0.0000 |
145
+ | 0.0 | 97.0 | 95157 | 0.0000 |
146
+ | 0.0007 | 98.0 | 96138 | 0.0000 |
147
+ | 0.0 | 99.0 | 97119 | 0.0000 |
148
+ | 0.0 | 100.0 | 98100 | 0.0000 |
149
+ | 0.0 | 101.0 | 99081 | 0.0000 |
150
+ | 0.0 | 102.0 | 100062 | 0.0000 |
151
+ | 0.0 | 103.0 | 101043 | 0.0 |
152
+ | 0.0 | 104.0 | 102024 | 0.0000 |
153
+ | 0.0 | 105.0 | 103005 | 0.0000 |
154
+ | 0.0 | 106.0 | 103986 | 0.0000 |
155
+ | 0.0 | 107.0 | 104967 | 0.0 |
156
+ | 0.0 | 108.0 | 105948 | 0.0000 |
157
+ | 0.0006 | 109.0 | 106929 | 0.0000 |
158
+ | 0.0 | 110.0 | 107910 | 0.0000 |
159
+ | 0.0 | 111.0 | 108891 | 0.0 |
160
+ | 0.0 | 112.0 | 109872 | 0.0 |
161
+ | 0.0 | 113.0 | 110853 | 0.0 |
162
+ | 0.0 | 114.0 | 111834 | 0.0 |
163
+ | 0.0 | 115.0 | 112815 | 0.0000 |
164
+ | 0.0 | 116.0 | 113796 | 0.0000 |
165
+ | 0.0 | 117.0 | 114777 | 0.0000 |
166
+ | 0.0 | 118.0 | 115758 | 0.0000 |
167
+ | 0.0 | 119.0 | 116739 | 0.0000 |
168
+ | 0.0 | 120.0 | 117720 | 0.0 |
169
+ | 0.0 | 121.0 | 118701 | 0.0 |
170
+ | 0.0 | 122.0 | 119682 | 0.0 |
171
+ | 0.0 | 123.0 | 120663 | 0.0 |
172
+ | 0.0013 | 124.0 | 121644 | 0.0000 |
173
+ | 0.0 | 125.0 | 122625 | 0.0000 |
174
+ | 0.0 | 126.0 | 123606 | 0.0000 |
175
+ | 0.0 | 127.0 | 124587 | 0.0000 |
176
+ | 0.0 | 128.0 | 125568 | 0.0000 |
177
+ | 0.0 | 129.0 | 126549 | 0.0000 |
178
+ | 0.0 | 130.0 | 127530 | 0.0 |
179
+ | 0.0 | 131.0 | 128511 | 0.0 |
180
+ | 0.0 | 132.0 | 129492 | 0.0 |
181
+ | 0.0 | 133.0 | 130473 | 0.0 |
182
+ | 0.0 | 134.0 | 131454 | 0.0 |
183
+ | 0.0 | 135.0 | 132435 | 0.0 |
184
+ | 0.0 | 136.0 | 133416 | 0.0 |
185
+ | 0.0 | 137.0 | 134397 | 0.0 |
186
+ | 0.0 | 138.0 | 135378 | 0.0 |
187
+ | 0.0 | 139.0 | 136359 | 0.0 |
188
+ | 0.0 | 140.0 | 137340 | 0.0 |
189
+ | 0.0 | 141.0 | 138321 | 0.0 |
190
+ | 0.0 | 142.0 | 139302 | 0.0 |
191
+ | 0.0 | 143.0 | 140283 | 0.0 |
192
+ | 0.0 | 144.0 | 141264 | 0.0 |
193
+ | 0.0 | 145.0 | 142245 | 0.0 |
194
+ | 0.0 | 146.0 | 143226 | 0.0 |
195
+ | 0.0 | 147.0 | 144207 | 0.0 |
196
+ | 0.0 | 148.0 | 145188 | 0.0 |
197
+ | 0.0 | 149.0 | 146169 | 0.0 |
198
+ | 0.0 | 150.0 | 147150 | 0.0 |
199
+
200
+
201
+ ### Framework versions
202
+
203
+ - Transformers 4.33.2
204
+ - Pytorch 2.0.1+cu118
205
+ - Tokenizers 0.13.3
config.json ADDED
@@ -0,0 +1,30 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_name_or_path": "deepset/roberta-base-squad2",
3
+ "architectures": [
4
+ "RobertaForQuestionAnswering"
5
+ ],
6
+ "attention_probs_dropout_prob": 0.1,
7
+ "bos_token_id": 0,
8
+ "classifier_dropout": null,
9
+ "eos_token_id": 2,
10
+ "gradient_checkpointing": false,
11
+ "hidden_act": "gelu",
12
+ "hidden_dropout_prob": 0.1,
13
+ "hidden_size": 768,
14
+ "initializer_range": 0.02,
15
+ "intermediate_size": 3072,
16
+ "language": "english",
17
+ "layer_norm_eps": 1e-05,
18
+ "max_position_embeddings": 514,
19
+ "model_type": "roberta",
20
+ "name": "Roberta",
21
+ "num_attention_heads": 12,
22
+ "num_hidden_layers": 12,
23
+ "pad_token_id": 1,
24
+ "position_embedding_type": "absolute",
25
+ "torch_dtype": "float32",
26
+ "transformers_version": "4.33.2",
27
+ "type_vocab_size": 1,
28
+ "use_cache": true,
29
+ "vocab_size": 50265
30
+ }
merges.txt ADDED
The diff for this file is too large to render. See raw diff
 
pytorch_model.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6d315c596261f94d97ec8c7f1ad98773a503c9126b2900c496d900fa851f4873
3
+ size 496294633
special_tokens_map.json ADDED
@@ -0,0 +1,51 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "bos_token": {
3
+ "content": "<s>",
4
+ "lstrip": false,
5
+ "normalized": true,
6
+ "rstrip": false,
7
+ "single_word": false
8
+ },
9
+ "cls_token": {
10
+ "content": "<s>",
11
+ "lstrip": false,
12
+ "normalized": true,
13
+ "rstrip": false,
14
+ "single_word": false
15
+ },
16
+ "eos_token": {
17
+ "content": "</s>",
18
+ "lstrip": false,
19
+ "normalized": true,
20
+ "rstrip": false,
21
+ "single_word": false
22
+ },
23
+ "mask_token": {
24
+ "content": "<mask>",
25
+ "lstrip": true,
26
+ "normalized": true,
27
+ "rstrip": false,
28
+ "single_word": false
29
+ },
30
+ "pad_token": {
31
+ "content": "<pad>",
32
+ "lstrip": false,
33
+ "normalized": true,
34
+ "rstrip": false,
35
+ "single_word": false
36
+ },
37
+ "sep_token": {
38
+ "content": "</s>",
39
+ "lstrip": false,
40
+ "normalized": true,
41
+ "rstrip": false,
42
+ "single_word": false
43
+ },
44
+ "unk_token": {
45
+ "content": "<unk>",
46
+ "lstrip": false,
47
+ "normalized": true,
48
+ "rstrip": false,
49
+ "single_word": false
50
+ }
51
+ }
tokenizer.json ADDED
The diff for this file is too large to render. See raw diff
 
tokenizer_config.json ADDED
@@ -0,0 +1,66 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "add_prefix_space": false,
3
+ "bos_token": {
4
+ "__type": "AddedToken",
5
+ "content": "<s>",
6
+ "lstrip": false,
7
+ "normalized": true,
8
+ "rstrip": false,
9
+ "single_word": false
10
+ },
11
+ "clean_up_tokenization_spaces": true,
12
+ "cls_token": {
13
+ "__type": "AddedToken",
14
+ "content": "<s>",
15
+ "lstrip": false,
16
+ "normalized": true,
17
+ "rstrip": false,
18
+ "single_word": false
19
+ },
20
+ "do_lower_case": false,
21
+ "eos_token": {
22
+ "__type": "AddedToken",
23
+ "content": "</s>",
24
+ "lstrip": false,
25
+ "normalized": true,
26
+ "rstrip": false,
27
+ "single_word": false
28
+ },
29
+ "errors": "replace",
30
+ "full_tokenizer_file": null,
31
+ "mask_token": {
32
+ "__type": "AddedToken",
33
+ "content": "<mask>",
34
+ "lstrip": true,
35
+ "normalized": true,
36
+ "rstrip": false,
37
+ "single_word": false
38
+ },
39
+ "model_max_length": 512,
40
+ "pad_token": {
41
+ "__type": "AddedToken",
42
+ "content": "<pad>",
43
+ "lstrip": false,
44
+ "normalized": true,
45
+ "rstrip": false,
46
+ "single_word": false
47
+ },
48
+ "sep_token": {
49
+ "__type": "AddedToken",
50
+ "content": "</s>",
51
+ "lstrip": false,
52
+ "normalized": true,
53
+ "rstrip": false,
54
+ "single_word": false
55
+ },
56
+ "tokenizer_class": "RobertaTokenizer",
57
+ "trim_offsets": true,
58
+ "unk_token": {
59
+ "__type": "AddedToken",
60
+ "content": "<unk>",
61
+ "lstrip": false,
62
+ "normalized": true,
63
+ "rstrip": false,
64
+ "single_word": false
65
+ }
66
+ }
training_args.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d1ce61e8822306e187b3b5d9f0c6b09218dd1956121a5f9073c269464eb6e0ac
3
+ size 4027
vocab.json ADDED
The diff for this file is too large to render. See raw diff