Update README.md
Browse files
README.md
CHANGED
@@ -20,10 +20,10 @@ library_name: transformers
|
|
20 |
[](https://huggingface.co/fhamborg/phi-4-4bit-gptq)
|
21 |
|
22 |
## Model Description
|
23 |
-
This is a **4-bit
|
24 |
|
25 |
- **Base Model**: [Phi-4](https://huggingface.co/...)
|
26 |
-
- **Quantization**:
|
27 |
- **Format**: `safetensors`
|
28 |
- **Tokenizer**: Uses standard `vocab.json` and `merges.txt`
|
29 |
|
|
|
20 |
[](https://huggingface.co/fhamborg/phi-4-4bit-gptq)
|
21 |
|
22 |
## Model Description
|
23 |
+
This is a **4-bit quantized** version of the Phi-4 transformer model, optimized for **efficient inference** while maintaining performance.
|
24 |
|
25 |
- **Base Model**: [Phi-4](https://huggingface.co/...)
|
26 |
+
- **Quantization**: autoround and bnb (4-bit)
|
27 |
- **Format**: `safetensors`
|
28 |
- **Tokenizer**: Uses standard `vocab.json` and `merges.txt`
|
29 |
|