second-state
/

Meta-Llama-3.1-8B-Instruct-GGUF

Text Generation

Model card Files Files and versions Community

apepkuss79 commited on Jul 30, 2024

Commit

f24f6a2

·

verified ·

1 Parent(s): 2989685

Update README.md

Files changed (1) hide show

README.md +20 -8

README.md CHANGED Viewed

@@ -65,13 +65,25 @@ tags:
 - Run as LlamaEdge service
-  ```bash
-  wasmedge --dir .:. --nn-preload default:GGML:AUTO:Meta-Llama-3.1-8B-Instruct-Q5_K_M.gguf \
-    llama-api-server.wasm \
-    --prompt-template llama-3-chat \
-    --ctx-size 128000 \
-    --model-name Llama-3.1-8b
-  ```
 - Run as LlamaEdge command app
@@ -79,7 +91,7 @@ tags:
   wasmedge --dir .:. --nn-preload default:GGML:AUTO:Meta-Llama-3.1-8B-Instruct-Q5_K_M.gguf \
     llama-chat.wasm \
     --prompt-template llama-3-chat \
-    --ctx-size 128000 \
   ```
 ## Quantized GGUF Models

 - Run as LlamaEdge service
+  - Chat
+    ```bash
+    wasmedge --dir .:. --nn-preload default:GGML:AUTO:Meta-Llama-3.1-8B-Instruct-Q5_K_M.gguf \
+      llama-api-server.wasm \
+      --prompt-template llama-3-chat \
+      --ctx-size 128000 \
+      --model-name Llama-3.1-8b
+    ```
+  - Tool use
+    ```bash
+    wasmedge --dir .:. --nn-preload default:GGML:AUTO:Meta-Llama-3.1-8B-Instruct-Q5_K_M.gguf \
+      llama-api-server.wasm \
+      --prompt-template llama-3-tool \
+      --ctx-size 128000 \
+      --model-name Llama-3.1-8b
+    ```
 - Run as LlamaEdge command app
   wasmedge --dir .:. --nn-preload default:GGML:AUTO:Meta-Llama-3.1-8B-Instruct-Q5_K_M.gguf \
     llama-chat.wasm \
     --prompt-template llama-3-chat \
+    --ctx-size 128000
   ```
 ## Quantized GGUF Models