Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -66,6 +66,33 @@ Special thanks to the model creators at SAO10K for making such a fantastic model
 [ https://huggingface.co/Sao10K/L3-8B-Stheno-v3.2 ]
 <h3> Sample Prompt and Model's Compared:</h3>
 Prompt tested with "temp=0" to ensure compliance, 2048 context (model supports 8192 context / 8k), and "chat" template for LLAMA3.

 [ https://huggingface.co/Sao10K/L3-8B-Stheno-v3.2 ]
+<B>Settings: CHAT / ROLEPLAY and/or SMOOTHER operation of this model:</B>
+In "KoboldCpp" or  "oobabooga/text-generation-webui" or "Silly Tavern" ;
+Set the "Smoothing_factor" to 1.5 to 2.5
+: in KoboldCpp -> Settings->Samplers->Advanced-> "Smooth_F"
+: in text-generation-webui -> parameters -> lower right.
+: In Silly Tavern this is called: "Smoothing"
+NOTE: For "text-generation-webui"
+-> if using GGUFs you need to use "llama_HF" (which involves downloading some config files from the SOURCE version of this model)
+Source versions (and config files) of my models are here:
+https://huggingface.co/collections/DavidAU/d-au-source-files-for-gguf-exl2-awq-gptq-hqq-etc-etc-66b55cb8ba25f914cbf210be
+OTHER OPTIONS:
+- Increase rep pen to 1.1 to 1.15 (you don't need to do this if you use "smoothing_factor")
+- If the interface/program you are using to run AI MODELS supports "Quadratic Sampling" ("smoothing") just make the adjustment as noted.
 <h3> Sample Prompt and Model's Compared:</h3>
 Prompt tested with "temp=0" to ensure compliance, 2048 context (model supports 8192 context / 8k), and "chat" template for LLAMA3.