jed351
/

gpt2_base_zh-hk-lihkg

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

jed351 commited on Feb 14, 2023

Commit

391f513

•

1 Parent(s): 903e918

Update README.md

Files changed (1) hide show

README.md +14 -3

README.md CHANGED Viewed

@@ -35,7 +35,19 @@ The tool can be found [here](https://github.com/ayaka14732/lihkg-scraper).
 Please also check out the [Bart model](https://huggingface.co/Ayaka/bart-base-cantonese) created by her.
-## Training procedure
 Please refer to the [script](https://github.com/huggingface/transformers/tree/main/examples/pytorch/language-modeling)
 provided by Huggingface.
@@ -44,8 +56,6 @@ provided by Huggingface.
 The model was trained for 400,000 steps with batch size 5 (~2epoches) on 2 NVIDIA Quadro RTX6000 for around 40 hours at the Research Computing Services of Imperial College London.
 ### How to use it?
 ```
 from transformers import AutoTokenizer
@@ -62,6 +72,7 @@ string = output[0]['generated_text'].replace(' ', '')
 print(string)
 ```
 ### Framework versions
 - Transformers 4.26.0.dev0

 Please also check out the [Bart model](https://huggingface.co/Ayaka/bart-base-cantonese) created by her.
+### Limitations
+The model was trained on ~10GB of data scrapped from LIHKG.
+It might contain violent and rude languages so as the text generated by the model.
+Please do not use it for anything other than research or entertainment.
+The comments on LIHKG also tend to be very short.
+Thus the model cannot generate anything more than a line. In a lot of occasions might not even generate new tokens.
+### Training procedure
 Please refer to the [script](https://github.com/huggingface/transformers/tree/main/examples/pytorch/language-modeling)
 provided by Huggingface.
 The model was trained for 400,000 steps with batch size 5 (~2epoches) on 2 NVIDIA Quadro RTX6000 for around 40 hours at the Research Computing Services of Imperial College London.
 ### How to use it?
 ```
 from transformers import AutoTokenizer
 print(string)
 ```
 ### Framework versions
 - Transformers 4.26.0.dev0