Qwen
/

Qwen2.5-3B

jklj077 commited on Sep 17

Commit

d9db0f6

•

1 Parent(s): 675011f

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -18,7 +18,7 @@ Qwen2.5 is the latest series of Qwen large language models. For Qwen2.5, we rele
 - **Long-context Support** up to 128K tokens and can generate up to 8K tokens.
 - **Multilingual support** for over 29 languages, including Chinese, English, French, Spanish, Portuguese, German, Italian, Russian, Japanese, Korean, Vietnamese, Thai, Arabic, and more.
-**This repo contains the base 72B Qwen2.5 model**, which has the following features:
 - Type: Causal Language Models
 - Training Stage: Pretraining
 - Architecture: transformers with RoPE, SwiGLU, RMSNorm, Attention QKV bias and tied word embeddings

 - **Long-context Support** up to 128K tokens and can generate up to 8K tokens.
 - **Multilingual support** for over 29 languages, including Chinese, English, French, Spanish, Portuguese, German, Italian, Russian, Japanese, Korean, Vietnamese, Thai, Arabic, and more.
+**This repo contains the base 3B Qwen2.5 model**, which has the following features:
 - Type: Causal Language Models
 - Training Stage: Pretraining
 - Architecture: transformers with RoPE, SwiGLU, RMSNorm, Attention QKV bias and tied word embeddings