Update README.md
Browse files
README.md
CHANGED
@@ -11,15 +11,30 @@ pinned: false
|
|
11 |
<img src="https://cdn-uploads.huggingface.co/production/uploads/656d9adce8bf55919aca7c3f/j4NWHo9opdcv3jV69T91k.png" width="1000" height="500">
|
12 |
</div>
|
13 |
|
|
|
14 |
|
15 |
-
#
|
|
|
|
|
|
|
|
|
|
|
|
|
16 |
|
17 |
Add Yi-1.5 benchmark results, like
|
18 |
|
19 |
-
|
20 |
|
|
|
21 |
|
22 |

|
23 |
|
24 |
-
# News
|
|
|
|
|
|
|
|
|
|
|
|
|
25 |
|
|
|
|
11 |
<img src="https://cdn-uploads.huggingface.co/production/uploads/656d9adce8bf55919aca7c3f/j4NWHo9opdcv3jV69T91k.png" width="1000" height="500">
|
12 |
</div>
|
13 |
|
14 |
+
Yi model family is a series of language and multimodal models. It is based on 6B and 34B pretrained language models, then extended to **chat** models, 200K **long context** models, depth-upscaled models (9B), and **vision**-language models.
|
15 |
|
16 |
+
# ⭐️ Highlights
|
17 |
+
|
18 |
+
- **Strong performance**: Yi-1.5-34B is on par with or surpasses GPT-3.5 in commonsense reasoning, college exams, math, coding, reading comprehension, and human preference win-rate on multiple evaluation benchmarks.
|
19 |
+
|
20 |
+
- **Cost-effective**: For 6B, 9B, and 34B, you can perform inference on consumer-grade hardware (like the RTX 4090). Additionally, 34B is large enough with complex reasoning and emergent abilities, giving a nice performance-cost balance.
|
21 |
+
|
22 |
+
# 📊 Benchmarks
|
23 |
|
24 |
Add Yi-1.5 benchmark results, like
|
25 |
|
26 |
+
xx base models achieve strong performance on a wide range of benchmarks like MMLU, and xx finetuned chat models deliver strong human preference rate on major evaluation platforms like AlpacaEval and Chatbot Arena.
|
27 |
|
28 |
+

|
29 |
|
30 |

|
31 |
|
32 |
+
# 📰 News
|
33 |
+
|
34 |
+
- 2024-03-16: The Yi-9B-200K is open-sourced and available to the public.
|
35 |
+
|
36 |
+
- 2024-03-08: Yi Tech Report is published!
|
37 |
+
|
38 |
+
- 2024-03-07: The long text capability of the Yi-34B-200K has been enhanced.
|
39 |
|
40 |
+
For complete news history, see [News](xx.md).
|