Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,6 @@
|
|
1 |
-
|
2 |
-
|
3 |
-
|
|
|
|
|
|
|
|
1 |
+
Trained on 100k dumped messages from the 'chan' todd proxy. I could not dedupe the dataset but it has had serious
|
2 |
+
effect on the llama7b I used. Calls me master a whole bunch more now.
|
3 |
+
|
4 |
+
Content isn't SFW so be aware. Trained in 4-bit for 3 epochs, I think it overfit and really needed just 2.
|
5 |
+
|
6 |
+
Tested in 4-bit and FP16 on plain HF llama-7b, maybe it works on derivative models of the same beaks.
|