fhamborg commited on
Commit
1641df8
·
verified ·
1 Parent(s): 1d49485

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -20,10 +20,10 @@ library_name: transformers
20
  [![Model](https://img.shields.io/badge/HuggingFace-Phi--4--GPTQ-orange)](https://huggingface.co/fhamborg/phi-4-4bit-gptq)
21
 
22
  ## Model Description
23
- This is a **4-bit GPTQ-quantized** version of the Phi-4 transformer model, optimized for **efficient inference** while maintaining performance.
24
 
25
  - **Base Model**: [Phi-4](https://huggingface.co/...)
26
- - **Quantization**: GPTQ (4-bit)
27
  - **Format**: `safetensors`
28
  - **Tokenizer**: Uses standard `vocab.json` and `merges.txt`
29
 
 
20
  [![Model](https://img.shields.io/badge/HuggingFace-Phi--4--GPTQ-orange)](https://huggingface.co/fhamborg/phi-4-4bit-gptq)
21
 
22
  ## Model Description
23
+ This is a **4-bit quantized** version of the Phi-4 transformer model, optimized for **efficient inference** while maintaining performance.
24
 
25
  - **Base Model**: [Phi-4](https://huggingface.co/...)
26
+ - **Quantization**: autoround and bnb (4-bit)
27
  - **Format**: `safetensors`
28
  - **Tokenizer**: Uses standard `vocab.json` and `merges.txt`
29