Update README.md
Browse files
README.md
CHANGED
@@ -24,4 +24,10 @@ base_model: mistralai/Mistral-7B-Instruct-v0.3
|
|
24 |
|
25 |
This model is ready for optimized inference using the Marlin mixed-precision kernels in vLLM: https://github.com/vllm-project/vllm
|
26 |
|
27 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
24 |
|
25 |
This model is ready for optimized inference using the Marlin mixed-precision kernels in vLLM: https://github.com/vllm-project/vllm
|
26 |
|
27 |
+
Simply start this model as an inference server with:
|
28 |
+
```bash
|
29 |
+
python -m vllm.entrypoints.openai.api_server --model neuralmagic/Mistral-7B-Instruct-v0.3-GPTQ-4bit
|
30 |
+
```
|
31 |
+
|
32 |
+
![image/png](https://cdn-uploads.huggingface.co/production/uploads/60466e4b4f40b01b66151416/SC_tYXjoS3yIoOYtfqZ2E.png)
|
33 |
+
|