Commit
·
69f1f49
1
Parent(s):
aa0ac4c
Update README.md
Browse files
README.md
CHANGED
@@ -7,6 +7,7 @@ Experimental quantization.
|
|
7 |
Working inference code (regular inference with autogptq does not work without return_token_type_ids=False, didn't get it to work with textgen-webui):
|
8 |
|
9 |
from auto_gptq import AutoGPTQForCausalLM, BaseQuantizeConfig
|
|
|
10 |
from transformers import AutoTokenizer, TextGenerationPipeline
|
11 |
|
12 |
|
|
|
7 |
Working inference code (regular inference with autogptq does not work without return_token_type_ids=False, didn't get it to work with textgen-webui):
|
8 |
|
9 |
from auto_gptq import AutoGPTQForCausalLM, BaseQuantizeConfig
|
10 |
+
|
11 |
from transformers import AutoTokenizer, TextGenerationPipeline
|
12 |
|
13 |
|