jimmycarter commited on
Commit
b5f5bfc
1 Parent(s): f40a179

Model card auto-generated by SimpleTuner

Browse files
Files changed (1) hide show
  1. README.md +381 -0
README.md ADDED
@@ -0,0 +1,381 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: other
3
+ base_model: "black-forest-labs/FLUX.1-dev"
4
+ tags:
5
+ - flux
6
+ - flux-diffusers
7
+ - text-to-image
8
+ - diffusers
9
+ - simpletuner
10
+ - not-for-all-audiences
11
+ - lora
12
+ - template:sd-lora
13
+ - lycoris
14
+ inference: true
15
+ widget:
16
+ - text: 'unconditional (blank prompt)'
17
+ parameters:
18
+ negative_prompt: 'blurry, cropped, ugly'
19
+ output:
20
+ url: ./assets/image_0_0.png
21
+ - text: 'In this scene from the animated series "Helluva Boss," Loona, the wolf-like receptionist of the Immediate Murder Professionals (I.M.P), is depicted leaning against a wall outside the office. She is casually engrossed in her phone, displaying her typical aloof and detached demeanor. Loona''s appearance includes her usual whitish fur, light grey hair, black-tipped ears, and red eyes, complemented by her punk-inspired attire featuring a black choker with spikes, a dark grey top, fingerless wrist-length black gloves, and black shorts.'
22
+ parameters:
23
+ negative_prompt: 'blurry, cropped, ugly'
24
+ output:
25
+ url: ./assets/image_1_0.png
26
+ - text: 'Loona shrugs with an exasperated expression, her red eyes wide and frustrated, as she seemingly questions or challenges something said in the I.M.P office. Still from Helluva boss. Loona''s appearance includes her usual whitish fur, light grey hair, black-tipped ears, and red eyes, complemented by her punk-inspired attire featuring a black choker with spikes, a dark grey top, fingerless wrist-length black gloves, and black shorts.'
27
+ parameters:
28
+ negative_prompt: 'blurry, cropped, ugly'
29
+ output:
30
+ url: ./assets/image_2_0.png
31
+ - text: 'A scene from the animated series "Helluva Boss," set in the office. Loona, the wolf-like receptionist with white fur, black-tipped ears, and red eyes, is seated on a couch, facing towards the viewer. Loona''s appearance is complemented by her punk-inspired attire featuring a black choker with spikes, a dark grey top, fingerless wrist-length black gloves, and black shorts. She holds a piece of paper that says,"Welcome to Losercity, jerks". In the background, the office has a striped wall pattern and visible damage on the ceiling, indicating a chaotic or rough environment. On the right side of the image, two imp characters appear to be engaged in conversation.'
32
+ parameters:
33
+ negative_prompt: 'blurry, cropped, ugly'
34
+ output:
35
+ url: ./assets/image_3_0.png
36
+ - text: 'Loona from Helluva Boss is dressed in an oversized taco costume, looking visibly irritated and embarrassed. Her red eyes convey her annoyance as she crosses her arms and glares to the side. Loona''s appearance includes her usual whitish fur, light grey hair, black-tipped ears, and red eyes'
37
+ parameters:
38
+ negative_prompt: 'blurry, cropped, ugly'
39
+ output:
40
+ url: ./assets/image_4_0.png
41
+ - text: 'Loona is standing next to Blitzo (Helluva boss)'
42
+ parameters:
43
+ negative_prompt: 'blurry, cropped, ugly'
44
+ output:
45
+ url: ./assets/image_5_0.png
46
+ - text: 'In this "Helluva Boss" scene, Loona, the wolf-like receptionist, stands in an elevator with a tense and irritated expression, her teeth bared in a snarl. Blitzø, the red demon with distinctive black and white horns, leans close and makes an adorable look, as if asking for a favor. The ornate elevator setting hints at a tense moment, possibly involving a challenging mission or conflict within the I.M.P team.'
47
+ parameters:
48
+ negative_prompt: 'blurry, cropped, ugly'
49
+ output:
50
+ url: ./assets/image_6_0.png
51
+ - text: 'a 2D simple drawing of a madeleine cake, with a green cloud drawn next to it'
52
+ parameters:
53
+ negative_prompt: 'blurry, cropped, ugly'
54
+ output:
55
+ url: ./assets/image_7_0.png
56
+ - text: 'a 3D captivating YouTube thumbnail depicting of a full detailed,it''s on a party real people like, on front there is a giant pulling a nose of a black African real like lady down to size of elephant nose,be creative and unique'
57
+ parameters:
58
+ negative_prompt: 'blurry, cropped, ugly'
59
+ output:
60
+ url: ./assets/image_8_0.png
61
+ - text: 'Whiskers the cat. Whiskers becomes a mentor to other animals.Impressed by Whiskers'' intelligence, other animals in the neighborhood seek his guidance. Whiskers sets up a virtual learning platform using AI technology, where animals can ask questions, receive personalized lessons, and acquire knowledge in various subjects. Whiskers becomes a mentor, helping others unlock their potential.'
62
+ parameters:
63
+ negative_prompt: 'blurry, cropped, ugly'
64
+ output:
65
+ url: ./assets/image_9_0.png
66
+ - text: 'As the stock market fluctuates, the investor remains calm and collected at their desk, surrounded by charts and graphs. Their tailored suit and polished briefcase are a symbol of their expertise and experience in the world of finance. '
67
+ parameters:
68
+ negative_prompt: 'blurry, cropped, ugly'
69
+ output:
70
+ url: ./assets/image_10_0.png
71
+ - text: 'loona from helluva boss is eating a donut'
72
+ parameters:
73
+ negative_prompt: 'blurry, cropped, ugly'
74
+ output:
75
+ url: ./assets/image_11_0.png
76
+ ---
77
+
78
+ # flux-training-losercity-next-lycoris14
79
+
80
+ This is a LyCORIS adapter derived from [black-forest-labs/FLUX.1-dev](https://huggingface.co/black-forest-labs/FLUX.1-dev).
81
+
82
+
83
+ The main validation prompt used during training was:
84
+
85
+
86
+
87
+ ```
88
+ loona from helluva boss is eating a donut
89
+ ```
90
+
91
+ ## Validation settings
92
+ - CFG: `3.5`
93
+ - CFG Rescale: `0.0`
94
+ - Steps: `15`
95
+ - Sampler: `None`
96
+ - Seed: `42`
97
+ - Resolution: `1024`
98
+
99
+ Note: The validation settings are not necessarily the same as the [training settings](#training-settings).
100
+
101
+ You can find some example images in the following gallery:
102
+
103
+
104
+ <Gallery />
105
+
106
+ The text encoder **was not** trained.
107
+ You may reuse the base model text encoder for inference.
108
+
109
+
110
+ ## Training settings
111
+
112
+ - Training epochs: 0
113
+ - Training steps: 100
114
+ - Learning rate: 3e-06
115
+ - Effective batch size: 16
116
+ - Micro-batch size: 1
117
+ - Gradient accumulation steps: 16
118
+ - Number of GPUs: 1
119
+ - Prediction type: flow-matching
120
+ - Rescaled betas zero SNR: False
121
+ - Optimizer: adamw_bf16
122
+ - Precision: Pure BF16
123
+ - Quantised: Yes: fp8-quanto
124
+ - Xformers: Not used
125
+ - LyCORIS Config:
126
+ ```json
127
+ {
128
+ "algo": "lokr",
129
+ "multiplier": 1.0,
130
+ "linear_dim": 1000000,
131
+ "linear_alpha": 1,
132
+ "factor": 10,
133
+ "full_matrix": true,
134
+ "apply_preset": {
135
+ "target_module": [
136
+ "FluxTransformerBlock",
137
+ "FluxSingleTransformerBlock"
138
+ ],
139
+ "name_algo_map": {
140
+ "transformer_blocks.[0-7].ff*": {
141
+ "algo": "lokr",
142
+ "factor": 4,
143
+ "linear_dim": 1000000,
144
+ "linear_alpha": 1,
145
+ "full_matrix": true
146
+ },
147
+ "transformer_blocks.[0-7]*": {
148
+ "algo": "lokr",
149
+ "factor": 8,
150
+ "linear_dim": 1000000,
151
+ "linear_alpha": 1,
152
+ "full_matrix": true
153
+ },
154
+ "transformer_blocks.[8-15].ff*": {
155
+ "algo": "lokr",
156
+ "factor": 6,
157
+ "linear_dim": 1000000,
158
+ "linear_alpha": 1,
159
+ "full_matrix": true
160
+ },
161
+ "transformer_blocks.[8-15]*": {
162
+ "algo": "lokr",
163
+ "factor": 12,
164
+ "linear_dim": 1000000,
165
+ "linear_alpha": 1,
166
+ "full_matrix": true
167
+ },
168
+ "transformer_blocks.[16-18].ff*": {
169
+ "algo": "lokr",
170
+ "factor": 12,
171
+ "linear_dim": 1000000,
172
+ "linear_alpha": 1,
173
+ "full_matrix": true
174
+ },
175
+ "transformer_blocks.[16-18]*": {
176
+ "algo": "lokr",
177
+ "factor": 24,
178
+ "linear_dim": 1000000,
179
+ "linear_alpha": 1,
180
+ "full_matrix": true
181
+ },
182
+ "single_transformer_blocks.[0-15].ff*": {
183
+ "algo": "lokr",
184
+ "factor": 8,
185
+ "linear_dim": 1000000,
186
+ "linear_alpha": 1,
187
+ "full_matrix": true
188
+ },
189
+ "single_transformer_blocks.[0-15]*": {
190
+ "algo": "lokr",
191
+ "factor": 16,
192
+ "linear_dim": 1000000,
193
+ "linear_alpha": 1,
194
+ "full_matrix": true
195
+ },
196
+ "single_transformer_blocks.[16-23].ff*": {
197
+ "algo": "lokr",
198
+ "factor": 6,
199
+ "linear_dim": 1000000,
200
+ "linear_alpha": 1,
201
+ "full_matrix": true
202
+ },
203
+ "single_transformer_blocks.[16-23]*": {
204
+ "algo": "lokr",
205
+ "factor": 12,
206
+ "linear_dim": 1000000,
207
+ "linear_alpha": 1,
208
+ "full_matrix": true
209
+ },
210
+ "single_transformer_blocks.[24-37].ff*": {
211
+ "algo": "lokr",
212
+ "factor": 4,
213
+ "linear_dim": 1000000,
214
+ "linear_alpha": 1,
215
+ "full_matrix": true
216
+ },
217
+ "single_transformer_blocks.[24-37]*": {
218
+ "algo": "lokr",
219
+ "factor": 8,
220
+ "linear_dim": 1000000,
221
+ "linear_alpha": 1,
222
+ "full_matrix": true
223
+ }
224
+ },
225
+ "use_fnmatch": true
226
+ }
227
+ }
228
+ ```
229
+
230
+ ## Datasets
231
+
232
+ ### default_dataset_arb
233
+ - Repeats: 9999
234
+ - Total number of images: 41
235
+ - Total number of aspect buckets: 11
236
+ - Resolution: 1.33 megapixels
237
+ - Cropped: False
238
+ - Crop style: None
239
+ - Crop aspect: None
240
+ ### default_dataset_arb2
241
+ - Repeats: 9999
242
+ - Total number of images: 2565
243
+ - Total number of aspect buckets: 1
244
+ - Resolution: 1.33 megapixels
245
+ - Cropped: False
246
+ - Crop style: None
247
+ - Crop aspect: None
248
+ ### default_dataset_arb3
249
+ - Repeats: 9999
250
+ - Total number of images: 3220
251
+ - Total number of aspect buckets: 24
252
+ - Resolution: 1.33 megapixels
253
+ - Cropped: False
254
+ - Crop style: None
255
+ - Crop aspect: None
256
+ ### default_dataset
257
+ - Repeats: 9999
258
+ - Total number of images: 42
259
+ - Total number of aspect buckets: 1
260
+ - Resolution: 1.048576 megapixels
261
+ - Cropped: True
262
+ - Crop style: center
263
+ - Crop aspect: square
264
+ ### default_dataset_512
265
+ - Repeats: 9999
266
+ - Total number of images: 42
267
+ - Total number of aspect buckets: 1
268
+ - Resolution: 0.262144 megapixels
269
+ - Cropped: True
270
+ - Crop style: center
271
+ - Crop aspect: square
272
+ ### default_dataset_640
273
+ - Repeats: 9999
274
+ - Total number of images: 42
275
+ - Total number of aspect buckets: 1
276
+ - Resolution: 0.4096 megapixels
277
+ - Cropped: True
278
+ - Crop style: center
279
+ - Crop aspect: square
280
+ ### default_dataset_768
281
+ - Repeats: 9999
282
+ - Total number of images: 42
283
+ - Total number of aspect buckets: 1
284
+ - Resolution: 0.589824 megapixels
285
+ - Cropped: True
286
+ - Crop style: center
287
+ - Crop aspect: square
288
+ ### default_dataset_896
289
+ - Repeats: 9999
290
+ - Total number of images: 42
291
+ - Total number of aspect buckets: 1
292
+ - Resolution: 0.802816 megapixels
293
+ - Cropped: True
294
+ - Crop style: center
295
+ - Crop aspect: square
296
+ ### default_dataset_uncaptioned
297
+ - Repeats: 9999
298
+ - Total number of images: 2565
299
+ - Total number of aspect buckets: 1
300
+ - Resolution: 1.048576 megapixels
301
+ - Cropped: True
302
+ - Crop style: center
303
+ - Crop aspect: square
304
+ ### default_dataset_uncaptioned_512
305
+ - Repeats: 9999
306
+ - Total number of images: 2565
307
+ - Total number of aspect buckets: 1
308
+ - Resolution: 0.262144 megapixels
309
+ - Cropped: True
310
+ - Crop style: center
311
+ - Crop aspect: square
312
+ ### default_dataset_art
313
+ - Repeats: 9999
314
+ - Total number of images: 2482
315
+ - Total number of aspect buckets: 1
316
+ - Resolution: 1.048576 megapixels
317
+ - Cropped: True
318
+ - Crop style: center
319
+ - Crop aspect: square
320
+ ### default_dataset_art_512
321
+ - Repeats: 9999
322
+ - Total number of images: 3193
323
+ - Total number of aspect buckets: 1
324
+ - Resolution: 0.262144 megapixels
325
+ - Cropped: True
326
+ - Crop style: center
327
+ - Crop aspect: square
328
+ ### default_dataset_art_640
329
+ - Repeats: 9999
330
+ - Total number of images: 3115
331
+ - Total number of aspect buckets: 1
332
+ - Resolution: 0.4096 megapixels
333
+ - Cropped: True
334
+ - Crop style: random
335
+ - Crop aspect: square
336
+ ### default_dataset_art_768
337
+ - Repeats: 9999
338
+ - Total number of images: 2989
339
+ - Total number of aspect buckets: 1
340
+ - Resolution: 0.589824 megapixels
341
+ - Cropped: True
342
+ - Crop style: random
343
+ - Crop aspect: square
344
+ ### default_dataset_art_896
345
+ - Repeats: 9999
346
+ - Total number of images: 2787
347
+ - Total number of aspect buckets: 1
348
+ - Resolution: 0.802816 megapixels
349
+ - Cropped: True
350
+ - Crop style: random
351
+ - Crop aspect: square
352
+
353
+
354
+ ## Inference
355
+
356
+
357
+ ```python
358
+ import torch
359
+ from diffusers import DiffusionPipeline
360
+ from lycoris import create_lycoris_from_weights
361
+
362
+ model_id = 'black-forest-labs/FLUX.1-dev'
363
+ adapter_id = 'pytorch_lora_weights.safetensors' # you will have to download this manually
364
+ lora_scale = 1.0
365
+ wrapper, _ = create_lycoris_from_weights(lora_scale, adapter_id, pipeline.transformer)
366
+ wrapper.merge_to()
367
+
368
+ prompt = "loona from helluva boss is eating a donut"
369
+
370
+ pipeline.to('cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu')
371
+ image = pipeline(
372
+ prompt=prompt,
373
+ num_inference_steps=15,
374
+ generator=torch.Generator(device='cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu').manual_seed(1641421826),
375
+ width=1024,
376
+ height=1024,
377
+ guidance_scale=3.5,
378
+ ).images[0]
379
+ image.save("output.png", format="PNG")
380
+ ```
381
+