bradmin commited on
Commit
abb6ce9
·
1 Parent(s): c7b98c8

Model save

Browse files
README.md CHANGED
@@ -17,7 +17,7 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model is a fine-tuned version of [EleutherAI/polyglot-ko-1.3b](https://huggingface.co/EleutherAI/polyglot-ko-1.3b) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 0.6373
21
  - Accuracy: 0.0
22
 
23
  ## Model description
@@ -38,11 +38,11 @@ More information needed
38
 
39
  The following hyperparameters were used during training:
40
  - learning_rate: 9e-06
41
- - train_batch_size: 8
42
- - eval_batch_size: 8
43
  - seed: 2023
44
  - gradient_accumulation_steps: 8
45
- - total_train_batch_size: 64
46
  - optimizer: Adam with betas=(0.9,0.95) and epsilon=1e-08
47
  - lr_scheduler_type: cosine
48
  - num_epochs: 1
@@ -51,9 +51,13 @@ The following hyperparameters were used during training:
51
 
52
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
53
  |:-------------:|:-----:|:----:|:---------------:|:--------:|
54
- | 0.6248 | 0.26 | 500 | 0.6433 | 0.0 |
55
- | 0.6114 | 0.52 | 1000 | 0.6330 | 0.0 |
56
- | 0.5908 | 0.79 | 1500 | 0.6373 | 0.0 |
 
 
 
 
57
 
58
 
59
  ### Framework versions
 
17
 
18
  This model is a fine-tuned version of [EleutherAI/polyglot-ko-1.3b](https://huggingface.co/EleutherAI/polyglot-ko-1.3b) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 0.0084
21
  - Accuracy: 0.0
22
 
23
  ## Model description
 
38
 
39
  The following hyperparameters were used during training:
40
  - learning_rate: 9e-06
41
+ - train_batch_size: 4
42
+ - eval_batch_size: 4
43
  - seed: 2023
44
  - gradient_accumulation_steps: 8
45
+ - total_train_batch_size: 32
46
  - optimizer: Adam with betas=(0.9,0.95) and epsilon=1e-08
47
  - lr_scheduler_type: cosine
48
  - num_epochs: 1
 
51
 
52
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
53
  |:-------------:|:-----:|:----:|:---------------:|:--------:|
54
+ | 0.1952 | 0.13 | 100 | 0.0789 | 0.0 |
55
+ | 0.0753 | 0.25 | 200 | 0.0183 | 0.0 |
56
+ | 0.0789 | 0.38 | 300 | 0.0273 | 0.0 |
57
+ | 0.1131 | 0.5 | 400 | 0.0146 | 0.0 |
58
+ | 0.0998 | 0.63 | 500 | 0.0134 | 0.0 |
59
+ | 0.0686 | 0.75 | 600 | 0.0079 | 0.0 |
60
+ | 0.069 | 0.88 | 700 | 0.0084 | 0.0 |
61
 
62
 
63
  ### Framework versions
model-00001-of-00002.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:fa921308589e7411f3692babd0727171ac426365299c390d992563c8fc4e9b88
3
  size 4946584368
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a2e26edc880e9132f173d5853794f0bed677e08fea90c641c4da8395f3f7dd57
3
  size 4946584368
model-00002-of-00002.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:5bc1d25dfc6b97ebba8dade0bc0ce31f8e3b56cd0eb8a485f13d32b94faa5400
3
  size 134284024
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:41fda3d8403e5006bde60680c3b44f3c1fecb62561722249f0dff0e3744cbaf0
3
  size 134284024
runs/Nov03_15-00-04_job-run-7635edf5-214b-40d7-8da7-225d9122921d-master-0/events.out.tfevents.1699023610.job-run-7635edf5-214b-40d7-8da7-225d9122921d-master-0.1.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e352c03facfe9491365d261997e0541256b41d0c824afd834e57a21b01a47213
3
- size 17643
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:13f2981fc4a4974c05956611b614f7672ce7951466e4126871cd5e46cc93ee7e
3
+ size 19410