DavidAU
/

L2-Psyonic-Cetacean-Ultra-Colossal-32B-GGUF

Model card Files Files and versions Community

DavidAU commited on Nov 12

Commit

0c65e54

•

1 Parent(s): cbb7e45

Update README.md

Files changed (1) hide show

README.md +27 -0

README.md CHANGED Viewed

@@ -133,6 +133,33 @@ Quants:
 Please note for Q2k quant you may need to raise rep pen and lower temp to account for quality loss at this quant level.
 <B>Model Template:</B>
 This is a custom model, and requires ChatML OR Alpaca OR Vicuna template, but may work with other template(s) and has maximum context of 4k / 4096.

 Please note for Q2k quant you may need to raise rep pen and lower temp to account for quality loss at this quant level.
+<B>Settings: CHAT / ROLEPLAY and/or SMOOTHER operation of this model:</B>
+In "KoboldCpp" or  "oobabooga/text-generation-webui" or "Silly Tavern" ;
+Set the "Smoothing_factor" to 1.5 to 2.5
+: in KoboldCpp -> Settings->Samplers->Advanced-> "Smooth_F"
+: in text-generation-webui -> parameters -> lower right.
+: In Silly Tavern this is called: "Smoothing"
+NOTE: For "text-generation-webui"
+-> if using GGUFs you need to use "llama_HF" (which involves downloading some config files from the SOURCE version of this model)
+Source versions (and config files) of my models are here:
+https://huggingface.co/collections/DavidAU/d-au-source-files-for-gguf-exl2-awq-gptq-hqq-etc-etc-66b55cb8ba25f914cbf210be
+OTHER OPTIONS:
+- Increase rep pen to 1.1 to 1.15 (you don't need to do this if you use "smoothing_factor")
+- If the interface/program you are using to run AI MODELS supports "Quadratic Sampling" ("smoothing") just make the adjustment as noted.
 <B>Model Template:</B>
 This is a custom model, and requires ChatML OR Alpaca OR Vicuna template, but may work with other template(s) and has maximum context of 4k / 4096.