DavidAU
/

Maximizing-Model-Performance-All-Quants-Types-And-Full-Precision-by-Samplers_Parameters

parameters guide

model generation

role play settings

quant selection

iq quants vs q quants

optimal model setting

gibberish fixes

instructing following

quality generation

quality settings

llamacpp server

model generation steering

model generation fixes

text generation webui

Model card Files Files and versions Community

DavidAU commited on Nov 15, 2024

Commit

fc102ef

·

verified ·

1 Parent(s): b7b76a3

Update README.md

Files changed (1) hide show

README.md +25 -7

README.md CHANGED Viewed

@@ -580,19 +580,37 @@ However, you should also check / test operation of:
 a] Affects per token generation:
 - top_a
-- epsilon_cutoff
-- eta_cutoff
-- no_repeat_ngram_size
 b] Affects generation including phrase, sentence, paragraph and entire generation:
-- no_repeat_ngram_size
-- encoder_repetition_penalty
-- guidance_scale (with "Negative prompt" ) => this is like a pre-prompt/system role prompt.
 - Disabling (BOS TOKEN) this can make the replies more creative.
 - Custom stopping strings
-Note: "no_repeat_ngram_size" appears in both because it can impact per token OR per phrase depending on settings.
 <B>MAIN ADVANCED SAMPLERS (affects per token AND overall generation): </B>

 a] Affects per token generation:
 - top_a
+- epsilon_cutoff - see note 4
+- eta_cutoff - see note 4
+- no_repeat_ngram_size - see note 1.
 b] Affects generation including phrase, sentence, paragraph and entire generation:
+- no_repeat_ngram_size - see note 1.
+- encoder_repetition_penalty "Hallucinations filter" - see note #2.
+- guidance_scale (with "Negative prompt" ) => this is like a pre-prompt/system role prompt - see note #3.
 - Disabling (BOS TOKEN) this can make the replies more creative.
 - Custom stopping strings
+Note 1:
+"no_repeat_ngram_size" appears in both because it can impact per token OR per phrase depending on settings. This can also drastically affect sentence,
+paragraph and general flow of the output.
+Note 2:
+This parameter if set to LESS than 1 causing the model to "jump" around a lot more , whereas above 1 causes the model to focus more on the immediate surroundings.
+If the model is crafting a "scene", a setting of less than 1 causes the model to jump around the room, outside, etc etc ; if less than 1 then it focuses the model more on
+the moment, the immediate surroundings, the POV character and details in the setting.
+Note 3:
+This is a powerful method to send instructions / directives to the model on how to process your prompt(s) each time. See [ https://arxiv.org/pdf/2306.17806 ]
+Note 4:
+These control selection of tokens, in some case providing more relevant and/or more options. See [ https://arxiv.org/pdf/2210.15191 ]
 <B>MAIN ADVANCED SAMPLERS (affects per token AND overall generation): </B>