Langboat
/

bloom-800m-zh

Text Generation

text-generation-inference

Model card Files Files and versions Community

wangyulong commited on Aug 31, 2022

Commit

1f63158

·

1 Parent(s): e34c6a3

Update README.md

Files changed (1) hide show

README.md +12 -3

README.md CHANGED Viewed

@@ -4,11 +4,20 @@ language:
 - zh
 pipeline_tag: text-generation
 widget:
-- text: "生命、宇宙以及一切的终极答案是"
 ---
-The model is based on [bigscience/bloom-1b1](https://huggingface.co/bigscience/bloom-1b1).
-To reduce GPU memory usage, we pruned its vocabulary from 250880 to 46145 with Chinese corpus. So the total parameter is 800m now.

 - zh
 pipeline_tag: text-generation
 widget:
+- text: "中国的首都是"
 ---
+This model is based on [bigscience/bloom-1b1](https://huggingface.co/bigscience/bloom-1b1).
+We pruned its vocabulary from 250880 to 46145 with Chinese corpus to reduce GPU memory usage. So the total parameter is 389m now.
+# How to use
+```python
+from transformers import BloomTokenizerFast, BloomForCausalLM
+tokenizer = BloomTokenizerFast.from_pretrained('Langboat/bloom-800m-zh')
+model = BloomForCausalLM.from_pretrained('Langboat/bloom-800m-zh')
+print(tokenizer.batch_decode(model.generate(tokenizer.encode('中国的首都是', return_tensors='pt'))))
+```