Upload README.md with huggingface_hub
Browse files
README.md
ADDED
@@ -0,0 +1,23 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
library_name: transformers
|
3 |
+
tags:
|
4 |
+
- llama-cpp
|
5 |
+
- gguf-my-lora
|
6 |
+
base_model: salni84/fine-tuned-llama3.2
|
7 |
+
---
|
8 |
+
|
9 |
+
# salni84/fine-tuned-llama3.2-Q8_0-GGUF
|
10 |
+
This LoRA adapter was converted to GGUF format from [`salni84/fine-tuned-llama3.2`](https://huggingface.co/salni84/fine-tuned-llama3.2) via the ggml.ai's [GGUF-my-lora](https://huggingface.co/spaces/ggml-org/gguf-my-lora) space.
|
11 |
+
Refer to the [original adapter repository](https://huggingface.co/salni84/fine-tuned-llama3.2) for more details.
|
12 |
+
|
13 |
+
## Use with llama.cpp
|
14 |
+
|
15 |
+
```bash
|
16 |
+
# with cli
|
17 |
+
llama-cli -m base_model.gguf --lora fine-tuned-llama3.2-q8_0.gguf (...other args)
|
18 |
+
|
19 |
+
# with server
|
20 |
+
llama-server -m base_model.gguf --lora fine-tuned-llama3.2-q8_0.gguf (...other args)
|
21 |
+
```
|
22 |
+
|
23 |
+
To know more about LoRA usage with llama.cpp server, refer to the [llama.cpp server documentation](https://github.com/ggerganov/llama.cpp/blob/master/examples/server/README.md).
|