badokorach commited on
Commit
cbcb30c
1 Parent(s): 8bee362

Model save

Browse files
Files changed (2) hide show
  1. README.md +19 -9
  2. model.safetensors +1 -1
README.md CHANGED
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [distilbert-base-cased-distilled-squad](https://huggingface.co/distilbert-base-cased-distilled-squad) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 2.5134
19
 
20
  ## Model description
21
 
@@ -35,22 +35,32 @@ More information needed
35
 
36
  The following hyperparameters were used during training:
37
  - learning_rate: 2e-05
38
- - train_batch_size: 4
39
- - eval_batch_size: 4
40
  - seed: 42
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: linear
43
- - num_epochs: 5
44
 
45
  ### Training results
46
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:-----:|:----:|:---------------:|
49
- | No log | 1.0 | 232 | 1.9711 |
50
- | No log | 2.0 | 464 | 1.9990 |
51
- | 1.843 | 3.0 | 696 | 2.1236 |
52
- | 1.843 | 4.0 | 928 | 2.3468 |
53
- | 0.9414 | 5.0 | 1160 | 2.5134 |
 
 
 
 
 
 
 
 
 
 
54
 
55
 
56
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [distilbert-base-cased-distilled-squad](https://huggingface.co/distilbert-base-cased-distilled-squad) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 3.5287
19
 
20
  ## Model description
21
 
 
35
 
36
  The following hyperparameters were used during training:
37
  - learning_rate: 2e-05
38
+ - train_batch_size: 8
39
+ - eval_batch_size: 8
40
  - seed: 42
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: linear
43
+ - num_epochs: 15
44
 
45
  ### Training results
46
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:-----:|:----:|:---------------:|
49
+ | No log | 1.0 | 116 | 1.9383 |
50
+ | No log | 2.0 | 232 | 1.9901 |
51
+ | No log | 3.0 | 348 | 2.0780 |
52
+ | No log | 4.0 | 464 | 2.2501 |
53
+ | 1.4804 | 5.0 | 580 | 2.4190 |
54
+ | 1.4804 | 6.0 | 696 | 2.5925 |
55
+ | 1.4804 | 7.0 | 812 | 2.7649 |
56
+ | 1.4804 | 8.0 | 928 | 2.9029 |
57
+ | 0.5119 | 9.0 | 1044 | 3.0296 |
58
+ | 0.5119 | 10.0 | 1160 | 3.1669 |
59
+ | 0.5119 | 11.0 | 1276 | 3.3412 |
60
+ | 0.5119 | 12.0 | 1392 | 3.3165 |
61
+ | 0.2287 | 13.0 | 1508 | 3.4167 |
62
+ | 0.2287 | 14.0 | 1624 | 3.5039 |
63
+ | 0.2287 | 15.0 | 1740 | 3.5287 |
64
 
65
 
66
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:337d96e54f3579a7def50baf6b6fa70e1a20fb11a628968783943292194bde8d
3
  size 260782152
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:655b2c4c1e1fb0eb57e7f106f98def874bab70c67b9353e65f729f788842aa97
3
  size 260782152