Update README.md
Browse files
README.md
CHANGED
@@ -1,12 +1,15 @@
|
|
1 |
-
---
|
2 |
-
base_model: nvidia/Llama-3.1-Minitron-4B-Width-Base
|
3 |
-
license: other
|
4 |
-
license_name: nvidia-open-model-license
|
5 |
-
license_link:
|
6 |
-
|
7 |
-
|
8 |
-
-
|
9 |
-
|
|
|
|
|
|
|
10 |
|
11 |
# ijohn07/Llama-3.1-Minitron-4B-Width-Base-Q5_K_M-GGUF
|
12 |
This model was converted to GGUF format from [`nvidia/Llama-3.1-Minitron-4B-Width-Base`](https://huggingface.co/nvidia/Llama-3.1-Minitron-4B-Width-Base) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
|
@@ -50,4 +53,4 @@ Step 3: Run inference through the main binary.
|
|
50 |
or
|
51 |
```
|
52 |
./llama-server --hf-repo ijohn07/Llama-3.1-Minitron-4B-Width-Base-Q5_K_M-GGUF --hf-file llama-3.1-minitron-4b-width-base-q5_k_m-imat.gguf -c 2048
|
53 |
-
```
|
|
|
1 |
+
---
|
2 |
+
base_model: nvidia/Llama-3.1-Minitron-4B-Width-Base
|
3 |
+
license: other
|
4 |
+
license_name: nvidia-open-model-license
|
5 |
+
license_link: >-
|
6 |
+
https://developer.download.nvidia.com/licenses/nvidia-open-model-license-agreement-june-2024.pdf
|
7 |
+
tags:
|
8 |
+
- llama-cpp
|
9 |
+
- gguf-my-repo
|
10 |
+
datasets:
|
11 |
+
- Gryphe/Opus-WritingPrompts
|
12 |
+
---
|
13 |
|
14 |
# ijohn07/Llama-3.1-Minitron-4B-Width-Base-Q5_K_M-GGUF
|
15 |
This model was converted to GGUF format from [`nvidia/Llama-3.1-Minitron-4B-Width-Base`](https://huggingface.co/nvidia/Llama-3.1-Minitron-4B-Width-Base) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
|
|
|
53 |
or
|
54 |
```
|
55 |
./llama-server --hf-repo ijohn07/Llama-3.1-Minitron-4B-Width-Base-Q5_K_M-GGUF --hf-file llama-3.1-minitron-4b-width-base-q5_k_m-imat.gguf -c 2048
|
56 |
+
```
|