Adding Evaluation Results

#2
by sthenno - opened
Files changed (1) hide show
  1. README.md +14 -0
README.md CHANGED
@@ -231,3 +231,17 @@ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-le
231
  |MuSR (0-shot) |14.56|
232
  |MMLU-PRO (5-shot) |47.69|
233
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
231
  |MuSR (0-shot) |14.56|
232
  |MMLU-PRO (5-shot) |47.69|
233
 
234
+
235
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
236
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/sthenno__tempesthenno-ppo-ckpt40-details)
237
+
238
+ | Metric |Value|
239
+ |-------------------|----:|
240
+ |Avg. |42.74|
241
+ |IFEval (0-Shot) |79.23|
242
+ |BBH (3-Shot) |50.57|
243
+ |MATH Lvl 5 (4-Shot)|47.36|
244
+ |GPQA (0-shot) |17.00|
245
+ |MuSR (0-shot) |14.56|
246
+ |MMLU-PRO (5-shot) |47.69|
247
+