Model save

Browse files

Files changed (4) hide show

README.md +11 -7
model-00001-of-00002.safetensors +1 -1
model-00002-of-00002.safetensors +1 -1
runs/Nov03_15-00-04_job-run-7635edf5-214b-40d7-8da7-225d9122921d-master-0/events.out.tfevents.1699023610.job-run-7635edf5-214b-40d7-8da7-225d9122921d-master-0.1.0 +2 -2

README.md CHANGED Viewed

@@ -17,7 +17,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [EleutherAI/polyglot-ko-1.3b](https://huggingface.co/EleutherAI/polyglot-ko-1.3b) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.6373
 - Accuracy: 0.0
 ## Model description
@@ -38,11 +38,11 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 9e-06
-- train_batch_size: 8
-- eval_batch_size: 8
 - seed: 2023
 - gradient_accumulation_steps: 8
-- total_train_batch_size: 64
 - optimizer: Adam with betas=(0.9,0.95) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - num_epochs: 1
@@ -51,9 +51,13 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
-| 0.6248        | 0.26  | 500  | 0.6433          | 0.0      |
-| 0.6114        | 0.52  | 1000 | 0.6330          | 0.0      |
-| 0.5908        | 0.79  | 1500 | 0.6373          | 0.0      |
 ### Framework versions

 This model is a fine-tuned version of [EleutherAI/polyglot-ko-1.3b](https://huggingface.co/EleutherAI/polyglot-ko-1.3b) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0084
 - Accuracy: 0.0
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 9e-06
+- train_batch_size: 4
+- eval_batch_size: 4
 - seed: 2023
 - gradient_accumulation_steps: 8
+- total_train_batch_size: 32
 - optimizer: Adam with betas=(0.9,0.95) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - num_epochs: 1
 | Training Loss | Epoch | Step | Validation Loss | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
+| 0.1952        | 0.13  | 100  | 0.0789          | 0.0      |
+| 0.0753        | 0.25  | 200  | 0.0183          | 0.0      |
+| 0.0789        | 0.38  | 300  | 0.0273          | 0.0      |
+| 0.1131        | 0.5   | 400  | 0.0146          | 0.0      |
+| 0.0998        | 0.63  | 500  | 0.0134          | 0.0      |
+| 0.0686        | 0.75  | 600  | 0.0079          | 0.0      |
+| 0.069         | 0.88  | 700  | 0.0084          | 0.0      |
 ### Framework versions

model-00001-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:fa921308589e7411f3692babd0727171ac426365299c390d992563c8fc4e9b88
 size 4946584368

 version https://git-lfs.github.com/spec/v1
+oid sha256:a2e26edc880e9132f173d5853794f0bed677e08fea90c641c4da8395f3f7dd57
 size 4946584368

model-00002-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:5bc1d25dfc6b97ebba8dade0bc0ce31f8e3b56cd0eb8a485f13d32b94faa5400
 size 134284024

 version https://git-lfs.github.com/spec/v1
+oid sha256:41fda3d8403e5006bde60680c3b44f3c1fecb62561722249f0dff0e3744cbaf0
 size 134284024

runs/Nov03_15-00-04_job-run-7635edf5-214b-40d7-8da7-225d9122921d-master-0/events.out.tfevents.1699023610.job-run-7635edf5-214b-40d7-8da7-225d9122921d-master-0.1.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e352c03facfe9491365d261997e0541256b41d0c824afd834e57a21b01a47213
-size 17643

 version https://git-lfs.github.com/spec/v1
+oid sha256:13f2981fc4a4974c05956611b614f7672ce7951466e4126871cd5e46cc93ee7e
+size 19410