Update README.md
Browse files
README.md
CHANGED
@@ -18,7 +18,7 @@ tags:
|
|
18 |
This model page includes GGUF versions of [relay-v0.1-Mistral-Nemo-2407](https://huggingface.co/danlou/relay-v0.1-Mistral-Nemo-2407).
|
19 |
For more details about this model, please see that model page.
|
20 |
|
21 |
-
**Note**: If you have access to a CUDA GPU, it's highly
|
22 |
|
23 |
## Custom Preset for LM Studio
|
24 |
|
|
|
18 |
This model page includes GGUF versions of [relay-v0.1-Mistral-Nemo-2407](https://huggingface.co/danlou/relay-v0.1-Mistral-Nemo-2407).
|
19 |
For more details about this model, please see that model page.
|
20 |
|
21 |
+
**Note**: If you have access to a CUDA GPU, it's highly recommended you use the [main version (HF)](https://huggingface.co/danlou/relay-v0.1-Mistral-Nemo-2407) of the model with the [relaylm.py](https://github.com/danlou/relay/blob/main/relaylm.py) script, which supports better use of commands (e.g., system messages). The `relaylm.py` script also supports [4bit](https://huggingface.co/danlou/relay-v0.1-Mistral-Nemo-2407-4bit) and [8bit](https://huggingface.co/danlou/relay-v0.1-Mistral-Nemo-2407-8bit) bitsandbytes quants.
|
22 |
|
23 |
## Custom Preset for LM Studio
|
24 |
|