macadeliccc
/

Mistral-7B-v0.2-OpenHermes

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

macadeliccc commited on Mar 25, 2024

Commit

9253029

·

verified ·

1 Parent(s): b45bad5

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -21,7 +21,7 @@ SFT Training Params:
 + Batch Size: 8
 + Gradient Accumulation steps: 4
 + Dataset: teknium/OpenHermes-2.5 (200k split contains a slight bias towards rp and theory of life)
-+ R: 16
 + Lora Alpha: 16
 Training Time: 13 hours on A100

 + Batch Size: 8
 + Gradient Accumulation steps: 4
 + Dataset: teknium/OpenHermes-2.5 (200k split contains a slight bias towards rp and theory of life)
++ r: 16
 + Lora Alpha: 16
 Training Time: 13 hours on A100