原始模型:https://huggingface.co/SakuraLLM/Sakura-13B-Qwen2beta-v0.9
https://huggingface.co/SakuraLLM/Sakura-13B-Qwen2beta-v0.9
LLAMA.CPP直接转换,未经测试
3-bit
4-bit
6-bit