REILX
/

Llama-3-8B-Instruct-750Mb-lora

text-generation-inference

Model card Files Files and versions Community

REILX commited on May 11

Commit

639e5a2

•

1 Parent(s): 048f179

Update README.md

Files changed (1) hide show

README.md +17 -1

README.md CHANGED Viewed

@@ -43,4 +43,20 @@ https://github.com/hiyouga/LLaMA-Factory
 - https://huggingface.co/datasets/LooksJuicy/ruozhiba
 - https://huggingface.co/datasets/microsoft/orca-math-word-problems-200k
 - https://huggingface.co/datasets/alexl83/AlpacaDataCleaned
-- https://huggingface.co/datasets/Sao10K/Claude-3-Opus-Instruct-5K

 - https://huggingface.co/datasets/LooksJuicy/ruozhiba
 - https://huggingface.co/datasets/microsoft/orca-math-word-problems-200k
 - https://huggingface.co/datasets/alexl83/AlpacaDataCleaned
+- https://huggingface.co/datasets/Sao10K/Claude-3-Opus-Instruct-5K
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 5e-05
+- train_batch_size: 4
+- eval_batch_size: 8
+- seed: 42
+- distributed_type: multi-GPU
+- num_devices: 8
+- gradient_accumulation_steps: 4
+- total_train_batch_size: 128
+- total_eval_batch_size: 64
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: cosine
+- lr_scheduler_warmup_steps: 300
+- num_epochs: 1.0