DavidAU
/

Maximizing-Model-Performance-All-Quants-Types-And-Full-Precision-by-Samplers_Parameters

@@ -166,7 +166,9 @@ Then test "at temp" to see the MODELS in action. (5-10 generations recommended)
 PENALITY SAMPLERS:
 ------------------------------------------------------------------------------
---repeat-last-n N                       	last n tokens to consider for penalize (default: 64, 0 = disabled, -1	= ctx_size)
 ("repetition_penalty_range" in oobabooga/text-generation-webui , "rp_range" in kobold)
 THIS IS CRITICAL. Too high you can get all kinds of issues (repeat words, sentences, paragraphs or "gibberish"), especially with class 3 or 4 models.
@@ -174,7 +176,9 @@ THIS IS CRITICAL. Too high you can get all kinds of issues (repeat words, senten
 This setting also works in conjunction with all other "rep pens" below.
---repeat-penalty N                      	penalize repeat sequence of tokens (default: 1.0, 1.0 = disabled)
 (commonly called "rep pen")
 Generally this is set from 1.0 to 1.15 ; smallest increments are best IE: 1.01... 1,.02 or even 1.001... 1.002.
@@ -182,7 +186,9 @@ Generally this is set from 1.0 to 1.15 ; smallest increments are best IE: 1.01..
 This affects creativity of the model over all , not just how words are penalized.
---presence-penalty N                  	repeat alpha presence penalty (default: 0.0, 0.0 = disabled)
 Generally leave this at zero IF repeat-last-n is 256 or less. You may want to use this for higher repeat-last-n settings.
@@ -191,7 +197,9 @@ CLASS 3: 0.05 may assist generation BUT SET "--repeat-last-n" to 512 or less. Be
 CLASS 4: 0.1 to 0.25 may assist generation BUT SET "--repeat-last-n" to 64
---frequency-penalty N                	repeat alpha frequency penalty (default: 0.0, 0.0 = disabled)
 Generally leave this at zero IF repeat-last-n is 512 or less. You may want to use this for higher repeat-last-n settings.
@@ -208,24 +216,33 @@ SECONDARY SAMPLERS / FILTERS:
 ------------------------------------------------------------------------------
---tfs N                                 		tail free sampling, parameter z (default: 1.0, 1.0 = disabled)
 Tries to detect a tail of low-probability tokens in the distribution and removes those tokens. The closer to 0, the more discarded tokens.
 ( https://www.trentonbricken.com/Tail-Free-Sampling/ )
---typical N                             	locally typical sampling, parameter p (default: 1.0, 1.0 = disabled)
 If not set to 1, select only tokens that are at least this much more likely to appear than random tokens, given the prior text.
---mirostat N                            	use Mirostat sampling.
-                                        		"Top K", "Nucleus", "Tail Free" (TFS) and "Locally Typical" (TYPICAL) samplers are ignored if used.
                                         		(default: 0, 0 = disabled, 1 = Mirostat, 2 = Mirostat 2.0)
---mirostat-lr N                         	Mirostat learning rate, parameter eta (default: 0.1)  " mirostat_tau "
---mirostat-ent N                       	Mirostat target entropy, parameter tau (default: 5.0) " mirostat_eta "
 Activates the Mirostat sampling technique. It aims to control perplexity during sampling. See the paper. (https://arxiv.org/abs/2007.14966)
@@ -244,8 +261,13 @@ For Class 3 models it is suggested to use this to assist with generation (min se
 For Class 4 models it is highly recommended with Microstat 1 or 2 + mirostat-lr @ 6 to 8 and mirostat_eta at .1 to .5
---dynatemp-range N                   	dynamic temperature range (default: 0.0, 0.0 = disabled)
---dynatemp-exp N                       	dynamic temperature exponent (default: 1.0)
 In: oobabooga/text-generation-webui (has on/off, and high / low) :
@@ -268,11 +290,15 @@ To set manually (IE: Api, lmstudio, etc) using "range" and "exp" ; this is a bit
 This is both an enhancement and in some ways fixes issues in a model when too little temp (or too much/too much of the same) affects generation.
---xtc-probability N                  	xtc probability (default: 0.0, 0.0 = disabled)
 Probability that the removal will actually happen. 0 disables the sampler. 1 makes it always happen.
---xtc-threshold N                       	xtc threshold (default: 0.1, 1.0 = disabled)
 If 2 or more tokens have probability above this threshold, consider removing all but the last one.
@@ -281,7 +307,9 @@ Suggest you experiment with this one, with other advanced samplers disabled to s
--l,    --logit-bias TOKEN_ID(+/-)BIAS   modifies the likelihood of token appearing in the completion,
                                         		i.e. `--logit-bias 15043+1` to increase likelihood of token ' Hello',
                                         		or `--logit-bias 15043-1` to decrease likelihood of token ' Hello'
@@ -301,14 +329,21 @@ OTHER:
 ------------------------------------------------------------------------------
--s,    --seed SEED                     	 RNG seed (default: -1, use random seed for -1)
---samplers SAMPLERS                  samplers that will be used for generation in the order, separated by ';'
-                                       			 (default: top_k;tfs_z;typ_p;top_p;min_p;xtc;temperature)
---sampling-seq SEQUENCE         simplified sequence for samplers that will be used (default: kfypmxt)
---ignore-eos                            	ignore end of stream token and continue generating (implies --logit-bias EOS-inf)
 ------------------------------------------------------------------------------

 PENALITY SAMPLERS:
 ------------------------------------------------------------------------------
+--repeat-last-n N
+last n tokens to consider for penalize (default: 64, 0 = disabled, -1	= ctx_size)
 ("repetition_penalty_range" in oobabooga/text-generation-webui , "rp_range" in kobold)
 THIS IS CRITICAL. Too high you can get all kinds of issues (repeat words, sentences, paragraphs or "gibberish"), especially with class 3 or 4 models.
 This setting also works in conjunction with all other "rep pens" below.
+--repeat-penalty N
+penalize repeat sequence of tokens (default: 1.0, 1.0 = disabled)
 (commonly called "rep pen")
 Generally this is set from 1.0 to 1.15 ; smallest increments are best IE: 1.01... 1,.02 or even 1.001... 1.002.
 This affects creativity of the model over all , not just how words are penalized.
+--presence-penalty N
+repeat alpha presence penalty (default: 0.0, 0.0 = disabled)
 Generally leave this at zero IF repeat-last-n is 256 or less. You may want to use this for higher repeat-last-n settings.
 CLASS 4: 0.1 to 0.25 may assist generation BUT SET "--repeat-last-n" to 64
+--frequency-penalty N
+repeat alpha frequency penalty (default: 0.0, 0.0 = disabled)
 Generally leave this at zero IF repeat-last-n is 512 or less. You may want to use this for higher repeat-last-n settings.
 ------------------------------------------------------------------------------
+--tfs N
+tail free sampling, parameter z (default: 1.0, 1.0 = disabled)
 Tries to detect a tail of low-probability tokens in the distribution and removes those tokens. The closer to 0, the more discarded tokens.
 ( https://www.trentonbricken.com/Tail-Free-Sampling/ )
+--typical N
+locally typical sampling, parameter p (default: 1.0, 1.0 = disabled)
 If not set to 1, select only tokens that are at least this much more likely to appear than random tokens, given the prior text.
+--mirostat N
+use Mirostat sampling. "Top K", "Nucleus", "Tail Free" (TFS) and "Locally Typical" (TYPICAL) samplers are ignored if used.
                                         		(default: 0, 0 = disabled, 1 = Mirostat, 2 = Mirostat 2.0)
+--mirostat-lr N
+Mirostat learning rate, parameter eta (default: 0.1)  " mirostat_tau "
+--mirostat-ent N
+Mirostat target entropy, parameter tau (default: 5.0) " mirostat_eta "
 Activates the Mirostat sampling technique. It aims to control perplexity during sampling. See the paper. (https://arxiv.org/abs/2007.14966)
 For Class 4 models it is highly recommended with Microstat 1 or 2 + mirostat-lr @ 6 to 8 and mirostat_eta at .1 to .5
+--dynatemp-range N
+dynamic temperature range (default: 0.0, 0.0 = disabled)
+--dynatemp-exp N
+dynamic temperature exponent (default: 1.0)
 In: oobabooga/text-generation-webui (has on/off, and high / low) :
 This is both an enhancement and in some ways fixes issues in a model when too little temp (or too much/too much of the same) affects generation.
+--xtc-probability N
+xtc probability (default: 0.0, 0.0 = disabled)
 Probability that the removal will actually happen. 0 disables the sampler. 1 makes it always happen.
+--xtc-threshold N
+xtc threshold (default: 0.1, 1.0 = disabled)
 If 2 or more tokens have probability above this threshold, consider removing all but the last one.
+-l,    --logit-bias TOKEN_ID(+/-)BIAS
+modifies the likelihood of token appearing in the completion,
                                         		i.e. `--logit-bias 15043+1` to increase likelihood of token ' Hello',
                                         		or `--logit-bias 15043-1` to decrease likelihood of token ' Hello'
 ------------------------------------------------------------------------------
+-s,    --seed SEED
+RNG seed (default: -1, use random seed for -1)
+--samplers SAMPLERS
+samplers that will be used for generation in the order, separated by ';' (default: top_k;tfs_z;typ_p;top_p;min_p;xtc;temperature)
+--sampling-seq SEQUENCE
+simplified sequence for samplers that will be used (default: kfypmxt)
+--ignore-eos
+ignore end of stream token and continue generating (implies --logit-bias EOS-inf)
 ------------------------------------------------------------------------------