UQFF
Collection
UQFF models. Examples for each in the model card!
โข
15 items
โข
Updated
โข
9
mistralai/Mistral-7B-Instruct-v0.3
, UQFF quantization
Run with mistral.rs. Documentation: UQFF docs.
Quantization type(s) | Example |
---|---|
FP8 | ./mistralrs-server -i plain -m EricB/Mistral-7B-Instruct-v0.3-UQFF --from-uqff mistral0.3-7b-instruct-f8e4m3.uqff |
HQQ4 | ./mistralrs-server -i plain -m EricB/Mistral-7B-Instruct-v0.3-UQFF --from-uqff mistral0.3-7b-instruct-hqq4.uqff |
HQQ8 | ./mistralrs-server -i plain -m EricB/Mistral-7B-Instruct-v0.3-UQFF --from-uqff mistral0.3-7b-instruct-hqq8.uqff |
Q3K | ./mistralrs-server -i plain -m EricB/Mistral-7B-Instruct-v0.3-UQFF --from-uqff mistral0.3-7b-instruct-q3k.uqff |
Q4K | ./mistralrs-server -i plain -m EricB/Mistral-7B-Instruct-v0.3-UQFF --from-uqff mistral0.3-7b-instruct-q4k.uqff |
Q5K | ./mistralrs-server -i plain -m EricB/Mistral-7B-Instruct-v0.3-UQFF --from-uqff mistral0.3-7b-instruct-q5k.uqff |
Q8_0 | ./mistralrs-server -i plain -m EricB/Mistral-7B-Instruct-v0.3-UQFF --from-uqff mistral0.3-7b-instruct-q8_0.uqff |
Base model
mistralai/Mistral-7B-v0.3