Update README.md
Browse files
README.md
CHANGED
@@ -2,7 +2,7 @@
|
|
2 |
license: creativeml-openrail-m
|
3 |
---
|
4 |
|
5 |
-
Trigger words
|
6 |
|
7 |
```
|
8 |
Anisphia, Euphyllia, Tilty, OyamaMahiro, OyamaMihari
|
@@ -14,7 +14,7 @@ For `0324_all_aniscreen_tags`, I accidentally tag all the character images with
|
|
14 |
For `0325_aniscreen_fanart_styles`, things are done correctly (anime screenshots tagged as `aniscreen`, fanart tagged as `fanart`).
|
15 |
|
16 |
|
17 |
-
|
18 |
|
19 |
Default settings are
|
20 |
- loha net dim 8, conv dim 4, alpha 1
|
@@ -28,17 +28,28 @@ The configuration json files can otherwsie be found in the `config` subdirectori
|
|
28 |
However, some experiments concern the effect of tags for which I regenerate the txt file and the difference can not be seen from the configuration file in this case.
|
29 |
For now this concerns `05tag` for which tags are only used with probability 0.5.
|
30 |
|
31 |
-
|
32 |
|
33 |
For a thorough comparaison please refer to the `generated_samples` folder.
|
34 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
35 |
1. I barely see any difference for training at clip skip 1 and 2.
|
36 |
2. Setting text encoder learning rate to be half of that of unet makes training two times slower while I cannot see how it helps.
|
37 |
3. The difference between lora, locon, and loha are very subtle.
|
38 |
4. Training at higher resolution helps generating more complex backgrounds etc, but it is very time-consuming and most of the time it isn't worth it (simpler to just switch base model) unless this is exactly the goal of the lora you're training.
|
39 |
-
5. What I see that makes the biggest difference is in captioning. The common wisdom that we should prune anything that we want to be attach to the trigger word is exactly the way to go for. No tags at all is terrible, especially for style training. Having all the tags remove the traits from subjects if these tags are not used during sampling.
|
40 |
|
41 |
-
|
42 |
|
43 |
Here is the composition of the datasets
|
44 |
```
|
|
|
2 |
license: creativeml-openrail-m
|
3 |
---
|
4 |
|
5 |
+
### Trigger words
|
6 |
|
7 |
```
|
8 |
Anisphia, Euphyllia, Tilty, OyamaMahiro, OyamaMihari
|
|
|
14 |
For `0325_aniscreen_fanart_styles`, things are done correctly (anime screenshots tagged as `aniscreen`, fanart tagged as `fanart`).
|
15 |
|
16 |
|
17 |
+
### Settings
|
18 |
|
19 |
Default settings are
|
20 |
- loha net dim 8, conv dim 4, alpha 1
|
|
|
28 |
However, some experiments concern the effect of tags for which I regenerate the txt file and the difference can not be seen from the configuration file in this case.
|
29 |
For now this concerns `05tag` for which tags are only used with probability 0.5.
|
30 |
|
31 |
+
### Some observations
|
32 |
|
33 |
For a thorough comparaison please refer to the `generated_samples` folder.
|
34 |
|
35 |
+
#### Captioning
|
36 |
+
|
37 |
+
Dataset, in general, is the most important out of all.
|
38 |
+
The common wisdom that we should prune anything that we want to be attach to the trigger word is exactly the way to go for.
|
39 |
+
No tags at all is terrible, especially for style training.
|
40 |
+
Having all the tags remove the traits from subjects if these tags are not used during sampling (not completely true but more or less the case).
|
41 |
+
|
42 |
+
![00066-20230326090858](https://huggingface.co/alea31415/LyCORIS-experiments/resolve/main/generated_samples/00066-20230326090858.png)
|
43 |
+
|
44 |
+
|
45 |
+
#### Others
|
46 |
+
|
47 |
1. I barely see any difference for training at clip skip 1 and 2.
|
48 |
2. Setting text encoder learning rate to be half of that of unet makes training two times slower while I cannot see how it helps.
|
49 |
3. The difference between lora, locon, and loha are very subtle.
|
50 |
4. Training at higher resolution helps generating more complex backgrounds etc, but it is very time-consuming and most of the time it isn't worth it (simpler to just switch base model) unless this is exactly the goal of the lora you're training.
|
|
|
51 |
|
52 |
+
### Datasets
|
53 |
|
54 |
Here is the composition of the datasets
|
55 |
```
|