sakuraumi
/

Sakura-13B-Galgame

Text Generation

text-generation-inference

Model card Files Files and versions

sakuraumi commited on Aug 26, 2023

Commit

5f49373

·

1 Parent(s): 08b9e21

Update README.md

Files changed (1) hide show

README.md +4 -0

README.md CHANGED Viewed

@@ -61,6 +61,10 @@ prompt = "Human: \n" + query + "\n\nAssistant: \n"
 | max new token | 512 |
 | min new token | 1 |
 其余推理流程与LLaMA2一致
 # 微调

 | max new token | 512 |
 | min new token | 1 |
+- 量化：
+在`model.generate()`中添加参数`load_in_8bit=True`或`load_in_4bit=True`
 其余推理流程与LLaMA2一致
 # 微调