danlou
/

relay-v0.1-Mistral-Nemo-2407-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

danlou commited on Dec 14, 2024

Commit

b73e6b5

·

verified ·

1 Parent(s): 2e3d0f6

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -18,7 +18,7 @@ tags:
 This model page includes GGUF versions of [relay-v0.1-Mistral-Nemo-2407](https://huggingface.co/danlou/relay-v0.1-Mistral-Nemo-2407).
 For more details about this model, please see that model page.
-**Note**: If you have access to a CUDA GPU, it's highly recommend you use the [main version (HF)](https://huggingface.co/danlou/relay-v0.1-Mistral-Nemo-2407) of the model with the [relaylm.py](https://github.com/danlou/relay/blob/main/relaylm.py) script, which supports better use of commands (e.g., system messages). The `relaylm.py` script also supports [4bit](https://huggingface.co/danlou/relay-v0.1-Mistral-Nemo-2407-4bit) and [8bit](https://huggingface.co/danlou/relay-v0.1-Mistral-Nemo-2407-8bit) bitsandbytes quants.
 ## Custom Preset for LM Studio

 This model page includes GGUF versions of [relay-v0.1-Mistral-Nemo-2407](https://huggingface.co/danlou/relay-v0.1-Mistral-Nemo-2407).
 For more details about this model, please see that model page.
+**Note**: If you have access to a CUDA GPU, it's highly recommended you use the [main version (HF)](https://huggingface.co/danlou/relay-v0.1-Mistral-Nemo-2407) of the model with the [relaylm.py](https://github.com/danlou/relay/blob/main/relaylm.py) script, which supports better use of commands (e.g., system messages). The `relaylm.py` script also supports [4bit](https://huggingface.co/danlou/relay-v0.1-Mistral-Nemo-2407-4bit) and [8bit](https://huggingface.co/danlou/relay-v0.1-Mistral-Nemo-2407-8bit) bitsandbytes quants.
 ## Custom Preset for LM Studio