Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -19,7 +19,7 @@ Additional quants are uploading...
 The NEO Class tech was created after countless investigations and over 120 lab experiments backed by
 real world testing and qualitative results.
-NEO Class results:
 Better overall function, instruction following, output quality and stronger connections to ideas, concepts and the world in general.
@@ -33,6 +33,30 @@ Perplexity drop of 1191 points for Neo Class Imatrix quant of IQ4XS VS regular q
 (lower is better)
 <B> Model Notes: </B>
 Maximum context is 8k. Please see original model maker's page for details, and usage information for this model.

 The NEO Class tech was created after countless investigations and over 120 lab experiments backed by
 real world testing and qualitative results.
+<b>NEO Class results: </b>
 Better overall function, instruction following, output quality and stronger connections to ideas, concepts and the world in general.
 (lower is better)
+<B> A Funny thing happened on the way to the "lab" ... </b>
+Although this model uses a "Llama3" template we found that Command-R's template worked better specifically for creative purposes.
+This applies to both normal quants and Neo quants.
+Here is Command-R's template:
+{
+  "name": "Cohere Command R",
+  "inference_params": {
+    "input_prefix": "<|END_OF_TURN_TOKEN|><|START_OF_TURN_TOKEN|><|USER_TOKEN|>",
+    "input_suffix": "<|END_OF_TURN_TOKEN|><|START_OF_TURN_TOKEN|><|CHATBOT_TOKEN|>",
+    "antiprompt": [
+      "<|START_OF_TURN_TOKEN|>",
+      "<|END_OF_TURN_TOKEN|>"
+    ],
+    "pre_prompt_prefix": "<|START_OF_TURN_TOKEN|><|SYSTEM_TOKEN|>",
+    "pre_prompt_suffix": ""
+  }
+}
+This was "interesting" issue was confirmed by multiple users.
 <B> Model Notes: </B>
 Maximum context is 8k. Please see original model maker's page for details, and usage information for this model.