jklj077 commited on
Commit
d9db0f6
1 Parent(s): 675011f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -18,7 +18,7 @@ Qwen2.5 is the latest series of Qwen large language models. For Qwen2.5, we rele
18
  - **Long-context Support** up to 128K tokens and can generate up to 8K tokens.
19
  - **Multilingual support** for over 29 languages, including Chinese, English, French, Spanish, Portuguese, German, Italian, Russian, Japanese, Korean, Vietnamese, Thai, Arabic, and more.
20
 
21
- **This repo contains the base 72B Qwen2.5 model**, which has the following features:
22
  - Type: Causal Language Models
23
  - Training Stage: Pretraining
24
  - Architecture: transformers with RoPE, SwiGLU, RMSNorm, Attention QKV bias and tied word embeddings
 
18
  - **Long-context Support** up to 128K tokens and can generate up to 8K tokens.
19
  - **Multilingual support** for over 29 languages, including Chinese, English, French, Spanish, Portuguese, German, Italian, Russian, Japanese, Korean, Vietnamese, Thai, Arabic, and more.
20
 
21
+ **This repo contains the base 3B Qwen2.5 model**, which has the following features:
22
  - Type: Causal Language Models
23
  - Training Stage: Pretraining
24
  - Architecture: transformers with RoPE, SwiGLU, RMSNorm, Attention QKV bias and tied word embeddings