Doctor-Shotgun
/

Mixtral-8x7B-Instruct-v0.1-limarp

Text Generation

text-generation-inference

Model card Files Files and versions Community

Doctor-Shotgun commited on Dec 23, 2023

Commit

329077d

•

1 Parent(s): c294b06

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -15,7 +15,9 @@ Experimental model, using a limarp qlora trained at 10k ctx length (greater than
 Note that all modules were trained, including 'gate'. There are some reports that perhaps training the 'gate' module isn't fully functional at the moment. In cursory testing this appears to obey the limarp alpaca prompt format correctly.
-Not extensively tested for quality, YMMV.
 ## Usage:
 The intended prompt format is the Alpaca instruction format of LimaRP v3:

 Note that all modules were trained, including 'gate'. There are some reports that perhaps training the 'gate' module isn't fully functional at the moment. In cursory testing this appears to obey the limarp alpaca prompt format correctly.
+Not extensively tested for quality, YMMV. Would try with temp ~1.5-2 and min-p of ~0.03-0.05 since mixtral does appear to be highly confident on its responses.
+[EXL2 Quants](https://huggingface.co/Doctor-Shotgun/Mixtral-8x7B-Instruct-limarp-v0.1-exl2)
 ## Usage:
 The intended prompt format is the Alpaca instruction format of LimaRP v3: