Xin Liu
Add Q2 model
145dcde
|
raw
history blame
1.35 kB
metadata
base_model: cloudyu/Yi-34Bx2-MoE-60B
license: cc-by-nc-4.0
model_creator: cloudyu
model_name: Yi 34Bx2 MoE 60B
model_type: mistral
quantized_by: Second State Inc.
tags:
  - moe

Yi-34Bx2-MoE-60B-GGUF

Original Model

cloudyu/Yi-34Bx2-MoE-60B

Run with LlamaEdge

  • LlamaEdge version: v0.2.4

  • Prompt template

    • Prompt type: chatml

    • Prompt string

      <|im_start|>system
      {system_message}<|im_end|>
      <|im_start|>user
      {prompt}<|im_end|>
      <|im_start|>assistant
      
    • Reverse prompt: <|im_end|>

  • Run as LlamaEdge service

    wasmedge --dir .:. --nn-preload default:GGML:AUTO:Yi-34B-Chat-ggml-model-q4_0.gguf llama-api-server.wasm -p chatml -r '<|im_end|>'
    
  • Run as LlamaEdge command app

    wasmedge --dir .:. --nn-preload default:GGML:AUTO:Yi-34B-Chat-ggml-model-q4_0.gguf llama-chat.wasm -p chatml -r '<|im_end|>'