FractalGPT
/

RuQwen2.5-3B-Instruct-AWQ

Text Generation

text-generation-inference

Inference Endpoints

4-bit precision

Model card Files Files and versions Community

marattt commited on 4 days ago

Commit

f9d6809

•

1 Parent(s): fcc96b1

Update README.md

Files changed (1) hide show

README.md +1 -0

README.md CHANGED Viewed

@@ -30,6 +30,7 @@ pipeline_tag: text-generation
 - **Attention Heads (GQA)**: 24 for Q, 4 for KV
 - **Context Length**: Supports a full context of 131,072 tokens and generation of up to 8,192 tokens
 - **Quantization**: AWQ 4 bit
 ### Requirements
 The code of Qwen2.5 has been in the latest Hugging face transformers and we advise you to use the latest version of transformers.

 - **Attention Heads (GQA)**: 24 for Q, 4 for KV
 - **Context Length**: Supports a full context of 131,072 tokens and generation of up to 8,192 tokens
 - **Quantization**: AWQ 4 bit
+- **Base model**: Qwen/Qwen2.5-3B-Instruct-AWQ
 ### Requirements
 The code of Qwen2.5 has been in the latest Hugging face transformers and we advise you to use the latest version of transformers.