Add Q2 model

Signed-off-by: Xin Liu <[email protected]>

Files changed (3) hide show

.gitattributes CHANGED Viewed

@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+*.gguf filter=lfs diff=lfs merge=lfs -text

README.md CHANGED Viewed

@@ -1,3 +1,56 @@
 ---
-license: apache-2.0
 ---

 ---
+base_model: cloudyu/Yi-34Bx2-MoE-60B
+license: cc-by-nc-4.0
+model_creator: cloudyu
+model_name: Yi 34Bx2 MoE 60B
+model_type: mistral
+quantized_by: Second State Inc.
+tags:
+- moe
 ---
+<!-- header start -->
+<!-- 200823 -->
+<div style="width: auto; margin-left: auto; margin-right: auto">
+<img src="https://github.com/second-state/LlamaEdge/raw/dev/assets/logo.svg" style="width: 100%; min-width: 400px; display: block; margin: auto;">
+</div>
+<hr style="margin-top: 1.0em; margin-bottom: 1.0em;">
+<!-- header end -->
+# Yi-34Bx2-MoE-60B-GGUF
+## Original Model
+[cloudyu/Yi-34Bx2-MoE-60B](https://huggingface.co/cloudyu/Yi-34Bx2-MoE-60B)
+## Run with LlamaEdge
+- LlamaEdge version: [v0.2.4](https://github.com/second-state/LlamaEdge/releases/tag/0.2.4)
+- Prompt template
+  - Prompt type: `chatml`
+  - Prompt string
+    ```text
+    <|im_start|>system
+    {system_message}<|im_end|>
+    <|im_start|>user
+    {prompt}<|im_end|>
+    <|im_start|>assistant
+    ```
+  - Reverse prompt: `<|im_end|>`
+- Run as LlamaEdge service
+  ```bash
+  wasmedge --dir .:. --nn-preload default:GGML:AUTO:Yi-34B-Chat-ggml-model-q4_0.gguf llama-api-server.wasm -p chatml -r '<|im_end|>'
+  ```
+- Run as LlamaEdge command app
+  ```bash
+  wasmedge --dir .:. --nn-preload default:GGML:AUTO:Yi-34B-Chat-ggml-model-q4_0.gguf llama-chat.wasm -p chatml -r '<|im_end|>'
+  ```

Yi-34Bx2-MoE-60B-Q2_K.gguf ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:a51505254c1b7072b61e8f2b3e7ce380237d7da6e10fddbada956d6ca7503ced
+size 22391949280