macadeliccc
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -21,7 +21,7 @@ SFT Training Params:
|
|
21 |
+ Batch Size: 8
|
22 |
+ Gradient Accumulation steps: 4
|
23 |
+ Dataset: teknium/OpenHermes-2.5 (200k split contains a slight bias towards rp and theory of life)
|
24 |
-
+
|
25 |
+ Lora Alpha: 16
|
26 |
|
27 |
Training Time: 13 hours on A100
|
|
|
21 |
+ Batch Size: 8
|
22 |
+ Gradient Accumulation steps: 4
|
23 |
+ Dataset: teknium/OpenHermes-2.5 (200k split contains a slight bias towards rp and theory of life)
|
24 |
+
+ r: 16
|
25 |
+ Lora Alpha: 16
|
26 |
|
27 |
Training Time: 13 hours on A100
|