Updated notes
Browse files
README.md
CHANGED
@@ -109,7 +109,10 @@ model-index:
|
|
109 |
---
|
110 |
# ZEUS
|
111 |
|
112 |
-
|
|
|
|
|
|
|
113 |
|
114 |
## Merge Details
|
115 |
|
@@ -169,3 +172,4 @@ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-le
|
|
169 |
|MuSR (0-shot) | 8.57|
|
170 |
|MMLU-PRO (5-shot) |32.14|
|
171 |
|
|
|
|
109 |
---
|
110 |
# ZEUS
|
111 |
|
112 |
+
Inspired by [Dampfinchen/Llama-3.1-8B-Ultra-Instruct](https://huggingface.co/Dampfinchen/Llama-3.1-8B-Ultra-Instruct),
|
113 |
+
the goal of this merge was to create an abliterated, conversational AI restricted to 8B parameters that's coherent over long conversations.
|
114 |
+
After testing "Ultra-Instruct" with various parameters, its grammar in responses would degrade over time.
|
115 |
+
While more extensive testing still needs to be done, prelimary results seem to show these problems are fixed.
|
116 |
|
117 |
## Merge Details
|
118 |
|
|
|
172 |
|MuSR (0-shot) | 8.57|
|
173 |
|MMLU-PRO (5-shot) |32.14|
|
174 |
|
175 |
+
* Falls about 1 point behind "Ultra-Instruct" on IFEval and BBH, but everything else is a significant improvement.
|