Weyaxi commited on
Commit
625aa68
·
verified ·
1 Parent(s): 403c2e2

Reformat leaderboard

Browse files
Files changed (1) hide show
  1. README.md +10 -13
README.md CHANGED
@@ -363,7 +363,17 @@ model.generate(**gen_input)
363
  - https://huggingface.co/bartowski/Einstein-v6.1-Llama3-8B-exl2
364
 
365
  # 🎯 [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
 
366
 
 
 
 
 
 
 
 
 
 
367
 
368
  # 🤖 Additional information about training
369
 
@@ -392,16 +402,3 @@ Thanks to all open source AI community.
392
  If you would like to support me:
393
 
394
  [☕ Buy Me a Coffee](https://www.buymeacoffee.com/weyaxi)
395
- # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
396
- Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_Weyaxi__Einstein-v6.1-Llama3-8B)
397
-
398
- | Metric |Value|
399
- |---------------------------------|----:|
400
- |Avg. |68.60|
401
- |AI2 Reasoning Challenge (25-Shot)|62.46|
402
- |HellaSwag (10-Shot) |82.41|
403
- |MMLU (5-Shot) |66.19|
404
- |TruthfulQA (0-shot) |55.10|
405
- |Winogrande (5-shot) |79.32|
406
- |GSM8k (5-shot) |66.11|
407
-
 
363
  - https://huggingface.co/bartowski/Einstein-v6.1-Llama3-8B-exl2
364
 
365
  # 🎯 [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
366
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_Weyaxi__Einstein-v6.1-Llama3-8B)
367
 
368
+ | Metric |Value|
369
+ |---------------------------------|----:|
370
+ |Avg. |68.60|
371
+ |AI2 Reasoning Challenge (25-Shot)|62.46|
372
+ |HellaSwag (10-Shot) |82.41|
373
+ |MMLU (5-Shot) |66.19|
374
+ |TruthfulQA (0-shot) |55.10|
375
+ |Winogrande (5-shot) |79.32|
376
+ |GSM8k (5-shot) |66.11|
377
 
378
  # 🤖 Additional information about training
379
 
 
402
  If you would like to support me:
403
 
404
  [☕ Buy Me a Coffee](https://www.buymeacoffee.com/weyaxi)