FredZhang7 commited on
Commit
4e4ce90
1 Parent(s): 10bf5ca

Complete data preprocessing description

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -21,9 +21,9 @@ datasets:
21
 
22
  ## Fast Anime PromptGen
23
 
24
- The main model `pytorch_model.bin` is trained on **80K** anime tags, all with **up_score ≥ 8** and without the [blacklisted tags](./blacklist.txt), fetched from the [Safebooru API endpoint](https://safebooru.donmai.us/posts/random.json).
25
- I didn't release the V1 model because it only generated gibberish prompts. After trying all means to correct that behavior, I eventually figured that the cause of the gibberish prompts is not from the model or training duration, but rather from the random usernames present in the training data.
26
- Here's the complete [prompt preprocessing](./preprocess.py).
27
 
28
 
29
  Todo:
 
21
 
22
  ## Fast Anime PromptGen
23
 
24
+ The main model (`pytorch_model.bin`) is trained on **80K** anime tags, all with **up_score ≥ 8** and without the [blacklisted tags](./blacklist.txt), fetched from the [Safebooru API endpoint](https://safebooru.donmai.us/posts/random.json).
25
+ I didn't release the V1 model because it only generated gibberish prompts. After trying all means to correct that behavior, I eventually figured that the cause of the gibberish prompts is not from the model or training duration, but rather from the random usernames in the training data.
26
+ Here's the complete [prompt preprocessing algorithm](./preprocess.py).
27
 
28
 
29
  Todo: