Commit
021226a
1 Parent(s): 191cf06

Adding Evaluation Results (#4)

Browse files

- Adding Evaluation Results (88be99d7c5271061b828346eba3dcf6187a73387)


Co-authored-by: Open LLM Leaderboard PR Bot <[email protected]>

Files changed (1) hide show
  1. README.md +17 -4
README.md CHANGED
@@ -9,12 +9,12 @@ tags:
9
  - qwen
10
  - moe
11
  base_model: Qwen/Qwen1.5-MoE-A2.7B
12
- model-index:
13
- - name: models/Qwen1.5-MoE-A2.7B-Wikihow
14
- results: []
15
  datasets:
16
  - HuggingFaceTB/cosmopedia
17
  pipeline_tag: text-generation
 
 
 
18
  ---
19
 
20
  # models/Qwen1.5-MoE-A2.7B-Wikihow
@@ -156,4 +156,17 @@ special_tokens:
156
  - Transformers 4.40.0.dev0
157
  - Pytorch 2.2.0+cu121
158
  - Datasets 2.18.0
159
- - Tokenizers 0.15.2
 
 
 
 
 
 
 
 
 
 
 
 
 
 
9
  - qwen
10
  - moe
11
  base_model: Qwen/Qwen1.5-MoE-A2.7B
 
 
 
12
  datasets:
13
  - HuggingFaceTB/cosmopedia
14
  pipeline_tag: text-generation
15
+ model-index:
16
+ - name: models/Qwen1.5-MoE-A2.7B-Wikihow
17
+ results: []
18
  ---
19
 
20
  # models/Qwen1.5-MoE-A2.7B-Wikihow
 
156
  - Transformers 4.40.0.dev0
157
  - Pytorch 2.2.0+cu121
158
  - Datasets 2.18.0
159
+ - Tokenizers 0.15.2
160
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
161
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_MaziyarPanahi__Qwen1.5-MoE-A2.7B-Wikihow)
162
+
163
+ | Metric |Value|
164
+ |-------------------|----:|
165
+ |Avg. |11.43|
166
+ |IFEval (0-Shot) |29.54|
167
+ |BBH (3-Shot) |15.47|
168
+ |MATH Lvl 5 (4-Shot)| 2.87|
169
+ |GPQA (0-shot) | 3.36|
170
+ |MuSR (0-shot) | 2.01|
171
+ |MMLU-PRO (5-shot) |15.34|
172
+