Kunger
/

Sakura-13B-Qwen2beta-v0.9-GGUF

Inference Endpoints

Model card Files Files and versions Community

原始模型：https://huggingface.co/SakuraLLM/Sakura-13B-Qwen2beta-v0.9

LLAMA.CPP直接转换，未经测试

Downloads last month: 16

GGUF

Model size

14.2B params

Architecture

qwen2

3-bit

4-bit

6-bit

Inference API

Unable to determine this model's library. Check the docs .