Update README.md
Browse files
README.md
CHANGED
@@ -53,8 +53,10 @@ The following models were included in the merge:
|
|
53 |
[Detailed Results + Failed GSM8K](https://huggingface.co/datasets/open-llm-leaderboard/details_ABX-AI__Silver-Sun-11B)
|
54 |
|
55 |
|
56 |
-
>[NOTE]
|
|
|
57 |
>By removing the GSM8K score, the average is VERY close to upstage/SOLAR-10.7B-v1.0 (74.20), which would make sense.
|
|
|
58 |
|
59 |
| Metric |Value|
|
60 |
|---------------------------------|----:|
|
|
|
53 |
[Detailed Results + Failed GSM8K](https://huggingface.co/datasets/open-llm-leaderboard/details_ABX-AI__Silver-Sun-11B)
|
54 |
|
55 |
|
56 |
+
>[!NOTE]
|
57 |
+
>I had to remove GSM8K from the results and manually re-average the rest. GSM8K failed inexplicably, and it should not have.
|
58 |
>By removing the GSM8K score, the average is VERY close to upstage/SOLAR-10.7B-v1.0 (74.20), which would make sense.
|
59 |
+
>Feel free to ignore the actual average and use the other scores individually for reference.
|
60 |
|
61 |
| Metric |Value|
|
62 |
|---------------------------------|----:|
|