Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,25 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
tags:
|
3 |
+
- uqff
|
4 |
+
- mistral.rs
|
5 |
+
base_model: microsoft/Phi-3.5-mini-instruct
|
6 |
+
base_model_relation: quantized
|
7 |
+
---
|
8 |
+
|
9 |
+
<!-- Autogenerated from user input. -->
|
10 |
+
|
11 |
+
# `microsoft/Phi-3.5-mini-instruct`, UQFF quantization
|
12 |
+
|
13 |
+
|
14 |
+
Run with [mistral.rs](https://github.com/EricLBuehler/mistral.rs). Documentation: [UQFF docs](https://github.com/EricLBuehler/mistral.rs/blob/master/docs/UQFF.md).
|
15 |
+
|
16 |
+
1) **Flexible** 🌀: Multiple quantization formats in *one* file format with *one* framework to run them all.
|
17 |
+
2) **Reliable** 🔒: Compatibility ensured with *embedded* and *checked* semantic versioning information from day 1.
|
18 |
+
3) **Easy** 🤗: Download UQFF models *easily* and *quickly* from Hugging Face, or use a local file.
|
19 |
+
3) **Customizable** 🛠️: Make and publish your own UQFF files in minutes.
|
20 |
+
## Files
|
21 |
+
|
22 |
+
|Name|Quantization type(s)|Example|
|
23 |
+
|--|--|--|
|
24 |
+
|phi3.5-mini-instruct-q4k.uqff|Q4K|`./mistralrs-server plain -m microsoft/Phi-3.5-mini-instruct --from-uqff EricB/Phi-3.5-mini-instruct-UQFF/phi3.5-mini-instruct-q4k.uqff`|
|
25 |
+
|phi3.5-mini-instruct-hqq4.uqff|HQQ4|`./mistralrs-server plain -m microsoft/Phi-3.5-mini-instruct --from-uqff EricB/Phi-3.5-mini-instruct-UQFF/phi3.5-mini-instruct-hqq4.uqff`|
|