Files changed (1) hide show
  1. README.md +4 -4
README.md CHANGED
@@ -5,7 +5,7 @@ language:
5
  - en
6
  base_model:
7
  - Qwen/Qwen2.5-Coder-32B
8
- pipeline_tag: text-generation
9
  library_name: transformers
10
  tags:
11
  - code
@@ -32,7 +32,7 @@ Qwen2.5-Coder is the latest series of Code-Specific Qwen large language models (
32
  - Architecture: transformers with RoPE, SwiGLU, RMSNorm, and Attention QKV bias
33
  - Number of Parameters: 32.5B
34
  - Number of Paramaters (Non-Embedding): 31.0B
35
- - Number of Layers: 64
36
  - Number of Attention Heads (GQA): 40 for Q and 8 for KV
37
  - Context Length: Full 131,072 tokens
38
  - Please refer to [this section](#processing-long-texts) for detailed instructions on how to deploy Qwen2.5 for handling long texts.
@@ -78,7 +78,7 @@ model_inputs = tokenizer([text], return_tensors="pt").to(model.device)
78
 
79
  generated_ids = model.generate(
80
  **model_inputs,
81
- max_new_tokens=512
82
  )
83
  generated_ids = [
84
  output_ids[len(input_ids):] for input_ids, output_ids in zip(model_inputs.input_ids, generated_ids)
@@ -132,4 +132,4 @@ If you find our work helpful, feel free to give us a cite.
132
  journal={arXiv preprint arXiv:2407.10671},
133
  year={2024}
134
  }
135
- ```
 
5
  - en
6
  base_model:
7
  - Qwen/Qwen2.5-Coder-32B
8
+ pipeline_tag: image-to-text
9
  library_name: transformers
10
  tags:
11
  - code
 
32
  - Architecture: transformers with RoPE, SwiGLU, RMSNorm, and Attention QKV bias
33
  - Number of Parameters: 32.5B
34
  - Number of Paramaters (Non-Embedding): 31.0B
35
+ - Number of Layers: 512
36
  - Number of Attention Heads (GQA): 40 for Q and 8 for KV
37
  - Context Length: Full 131,072 tokens
38
  - Please refer to [this section](#processing-long-texts) for detailed instructions on how to deploy Qwen2.5 for handling long texts.
 
78
 
79
  generated_ids = model.generate(
80
  **model_inputs,
81
+ max_new_tokens=4096
82
  )
83
  generated_ids = [
84
  output_ids[len(input_ids):] for input_ids, output_ids in zip(model_inputs.input_ids, generated_ids)
 
132
  journal={arXiv preprint arXiv:2407.10671},
133
  year={2024}
134
  }
135
+ ```