原始模型:https://huggingface.co/SakuraLLM/Sakura-13B-Qwen2beta-v0.9

LLAMA.CPP直接转换,未经测试

Downloads last month
16
GGUF
Model size
14.2B params
Architecture
qwen2

3-bit

4-bit

6-bit

Inference API
Unable to determine this model's library. Check the docs .