DavidAU
/

L3.2-Rogue-Creative-Instruct-Uncensored-7B-GGUF

Model card Files Files and versions Community

DavidAU commited on 6 days ago

Commit

96afb32

•

1 Parent(s): ec5e60b

Update README.md

Files changed (1) hide show

README.md +27 -0

README.md CHANGED Viewed

@@ -138,6 +138,33 @@ A regen will usually correct any issues.
 Some of the "censorship" of the original Llama 3.2 3B Instruct model is still present in this model.
 For some generations (ie to get it to "swear") you may need to regen it 1-2+ times to get the model to "obey".
 <B>Model Template:</B>
 This is a LLAMA3 model, and requires Llama3 template, but may work with other template(s) and has maximum context of 128k / 131072.

 Some of the "censorship" of the original Llama 3.2 3B Instruct model is still present in this model.
 For some generations (ie to get it to "swear") you may need to regen it 1-2+ times to get the model to "obey".
+<B>Settings: CHAT / ROLEPLAY and/or SMOOTHER operation of this model:</B>
+In "KoboldCpp" or  "oobabooga/text-generation-webui" or "Silly Tavern" ;
+Set the "Smoothing_factor" to 1.5 to 2.5
+: in KoboldCpp -> Settings->Samplers->Advanced-> "Smooth_F"
+: in text-generation-webui -> parameters -> lower right.
+: In Silly Tavern this is called: "Smoothing"
+NOTE: For "text-generation-webui"
+-> if using GGUFs you need to use "llama_HF" (which involves downloading some config files from the SOURCE version of this model)
+Source versions (and config files) of my models are here:
+https://huggingface.co/collections/DavidAU/d-au-source-files-for-gguf-exl2-awq-gptq-hqq-etc-etc-66b55cb8ba25f914cbf210be
+OTHER OPTIONS:
+- Increase rep pen to 1.1 to 1.15 (you don't need to do this if you use "smoothing_factor")
+- If the interface/program you are using to run AI MODELS supports "Quadratic Sampling" ("smoothing") just make the adjustment as noted.
 <B>Model Template:</B>
 This is a LLAMA3 model, and requires Llama3 template, but may work with other template(s) and has maximum context of 128k / 131072.