upload readme
Browse files
README.md
CHANGED
@@ -50,7 +50,6 @@ We conducted a comprehensive evaluation of InternLM using the open-source evalua
|
|
50 |
| CMMLU (5-shot) | 62.2 | 50.6 | 65.2 |
|
51 |
| BBH (3-shot CoT) | **41.9** | 41.5 | 36.5 |
|
52 |
| MATH (0-shot CoT) | **40.2** | 15.5 | 21.4 |
|
53 |
-
| HumanEval | 43.3 | 50.0 | 47.6 |
|
54 |
| GPQA (0-shot) | **27.8** | 23.7 | 27.3 |
|
55 |
|
56 |
- The evaluation results were obtained from [OpenCompass](https://github.com/internLM/OpenCompass/), and evaluation configuration can be found in the configuration files provided by [OpenCompass](https://github.com/internLM/OpenCompass/).
|
@@ -199,7 +198,6 @@ InternLM2.5 ,即书生·浦语大模型第 2.5 代,开源了面向实用场
|
|
199 |
| CMMLU (5-shot) | 62.2 | 50.6 | 65.2 |
|
200 |
| BBH (3-shot CoT) | **41.9** | 41.5 | 36.5 |
|
201 |
| MATH (0-shot CoT) | **40.2** | 15.5 | 21.4 |
|
202 |
-
| HumanEval | 43.3 | 50.0 | 47.6 |
|
203 |
| GPQA (0-shot) | **27.8** | 23.7 | 27.3 |
|
204 |
|
205 |
- 以上评测结果基于 [OpenCompass](https://github.com/internLM/OpenCompass/) 获得,具体测试细节可参见 [OpenCompass](https://github.com/internLM/OpenCompass/) 中提供的配置文件。
|
|
|
50 |
| CMMLU (5-shot) | 62.2 | 50.6 | 65.2 |
|
51 |
| BBH (3-shot CoT) | **41.9** | 41.5 | 36.5 |
|
52 |
| MATH (0-shot CoT) | **40.2** | 15.5 | 21.4 |
|
|
|
53 |
| GPQA (0-shot) | **27.8** | 23.7 | 27.3 |
|
54 |
|
55 |
- The evaluation results were obtained from [OpenCompass](https://github.com/internLM/OpenCompass/), and evaluation configuration can be found in the configuration files provided by [OpenCompass](https://github.com/internLM/OpenCompass/).
|
|
|
198 |
| CMMLU (5-shot) | 62.2 | 50.6 | 65.2 |
|
199 |
| BBH (3-shot CoT) | **41.9** | 41.5 | 36.5 |
|
200 |
| MATH (0-shot CoT) | **40.2** | 15.5 | 21.4 |
|
|
|
201 |
| GPQA (0-shot) | **27.8** | 23.7 | 27.3 |
|
202 |
|
203 |
- 以上评测结果基于 [OpenCompass](https://github.com/internLM/OpenCompass/) 获得,具体测试细节可参见 [OpenCompass](https://github.com/internLM/OpenCompass/) 中提供的配置文件。
|