Update README.md
Browse files
README.md
CHANGED
@@ -79,19 +79,6 @@ Breeze-7B-Instruct-64k-v0.1 can solve tasks such as question answering and summa
|
|
79 |
|
80 |
|
81 |
\* Few-shot learning cannot effectively guide the model to generate the proper answer.
|
82 |
-
|
83 |
-
**Category ACC of TMMLU+ (5 shot)**
|
84 |
-
|
85 |
-
| Models | STEM | Social Science | Humanities | Other | ↑ AVG |
|
86 |
-
|----------------------------------|--------------|----------------|------------|------------|-------|
|
87 |
-
| Yi-34B | 56.03 | 73.06 | 61.12 | 62.19 | 63.10 |
|
88 |
-
| Qwen-14B | 46.51 | 58.20 | 51.12 | 49.38 | 51.30 |
|
89 |
-
| Yi-6B | 41.14 | 57.77 | 50.22 | 49.39 | 49.63 |
|
90 |
-
| Qwen-7B | 28.25 | 47.80 | 43.14 | 42.17 | 42.84 |
|
91 |
-
| **Breeze-7B-Base-v0.1** | 35.74 | 46.08 | 40.29 | 39.27 | 40.35 |
|
92 |
-
| Mistral-7B-v0.1 | 33.01 | 42.23 | 35.86 | 37.63 | 36.93 |
|
93 |
-
|
94 |
-
|
95 |
|
96 |
|
97 |
## Chat Model Performance
|
|
|
79 |
|
80 |
|
81 |
\* Few-shot learning cannot effectively guide the model to generate the proper answer.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
82 |
|
83 |
|
84 |
## Chat Model Performance
|