Qwen
/

Qwen2.5-Coder-32B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

cyente commited on 7 days ago

Commit

5047849

•

1 Parent(s): a36cd5d

Update README.md

Files changed (1) hide show

README.md +0 -2

README.md CHANGED Viewed

@@ -34,8 +34,6 @@ As of now, Qwen2.5-Coder has covered six mainstream model sizes, 0.5, 1.5, 3, 7,
 - Number of Paramaters (Non-Embedding): 31.0B
 - Number of Layers: 64
 - Number of Attention Heads (GQA): 40 for Q and 8 for KV
-- Context Length: 131,072 tokens
 - Context Length: Full 131,072 tokens
     - Please refer to [this section](#processing-long-texts) for detailed instructions on how to deploy Qwen2.5 for handling long texts.

 - Number of Paramaters (Non-Embedding): 31.0B
 - Number of Layers: 64
 - Number of Attention Heads (GQA): 40 for Q and 8 for KV
 - Context Length: Full 131,072 tokens
     - Please refer to [this section](#processing-long-texts) for detailed instructions on how to deploy Qwen2.5 for handling long texts.