nayohan
/

polyglot-ko-12.8b-Inst

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

nayohan commited on Nov 17, 2023

Commit

b685114

·

1 Parent(s): ae87ae7

Update README.md

Files changed (1) hide show

README.md +5 -5

README.md CHANGED Viewed

@@ -12,22 +12,22 @@ tags:
 base_model: EleutherAI/polyglot-ko-12.8b
 ---
-This model is a instruct-tuned poylglot-ko-12.8b model, using 10% [Kullm, OIG, KoAlpaca] Instruction dataset.
 ## Training hyperparameters
 - learning_rate: 5e-5
 - seed: 42
-- distributed_type: multi-GPU (A100 80G)
 - train_batch_size: 4
-- num_devices: 4
-- gradient_accumulation_steps: 4
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - num_epochs: 2.0
 ## Framework versions
 - Transformers 4.35.0
-- Pytorch 2.0.1+cu118
 - Datasets 2.14.6
 - deepspeed 0.11.1
 - accelerate 0.24.1

 base_model: EleutherAI/polyglot-ko-12.8b
 ---
+This model is a instruct-tuned poylglot-ko-12.8b model, using 10% [Kullm, OIG, KoAlpaca] Instruction dataset. -> 29step
 ## Training hyperparameters
 - learning_rate: 5e-5
 - seed: 42
+- distributed_type: multi-GPU (A100 40G) + CPU offloading (512GB)
+- num_devices: 1
 - train_batch_size: 4
+- gradient_accumulation_steps: 16
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - num_epochs: 2.0
 ## Framework versions
 - Transformers 4.35.0
+- Pytorch 2.0.1+cu117
 - Datasets 2.14.6
 - deepspeed 0.11.1
 - accelerate 0.24.1