T145 commited on
Commit
ca27044
·
verified ·
1 Parent(s): 0081c80

Updated notes

Browse files
Files changed (1) hide show
  1. README.md +5 -1
README.md CHANGED
@@ -109,7 +109,10 @@ model-index:
109
  ---
110
  # ZEUS
111
 
112
- This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
 
 
 
113
 
114
  ## Merge Details
115
 
@@ -169,3 +172,4 @@ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-le
169
  |MuSR (0-shot) | 8.57|
170
  |MMLU-PRO (5-shot) |32.14|
171
 
 
 
109
  ---
110
  # ZEUS
111
 
112
+ Inspired by [Dampfinchen/Llama-3.1-8B-Ultra-Instruct](https://huggingface.co/Dampfinchen/Llama-3.1-8B-Ultra-Instruct),
113
+ the goal of this merge was to create an abliterated, conversational AI restricted to 8B parameters that's coherent over long conversations.
114
+ After testing "Ultra-Instruct" with various parameters, its grammar in responses would degrade over time.
115
+ While more extensive testing still needs to be done, prelimary results seem to show these problems are fixed.
116
 
117
  ## Merge Details
118
 
 
172
  |MuSR (0-shot) | 8.57|
173
  |MMLU-PRO (5-shot) |32.14|
174
 
175
+ * Falls about 1 point behind "Ultra-Instruct" on IFEval and BBH, but everything else is a significant improvement.