codehappy
/

puzzlebox-xl

Model card Files Files and versions Community

codehappy commited on Dec 12, 2024

Commit

4a2cfca

·

verified ·

1 Parent(s): fbdb5f1

Update README.md

Files changed (1) hide show

README.md +13 -0

README.md CHANGED Viewed

@@ -1,3 +1,16 @@
 ---
 license: creativeml-openrail-m
 ---

 ---
 license: creativeml-openrail-m
+base_model:
+- stabilityai/stable-diffusion-xl-base-1.0
 ---
+A latent diffusion model (LDM) geared toward illustration, style composability, and sample variety. Addresses a few deficiencies with the SDXL base model.
+* Architecture: SD XL (base model is v1.0)
+* Training procedure: U-Net fully unfrozen, all-parameter continued pretraining at LR between 3e-8 and 3e-7 for 14,290,000 steps (at epoch 14).
+Trained on the Puzzle Box dataset, a large collection of permissively licensed images from the public Internet (or generated by previous Puzzle Box models). Each image has from 3 to 15 different captions which are used interchangably during training. There are 8.2 million images and 54 million captions in the dataset.
+The model is substantially better than the base SDXL model at producing images that look like film photographs, any kind of cartoon art, or old artist styles. It's also heavily tuned toward personal aesthetic preference.
+Prompt adherence is unusually good; aesthetics are generally better by human evaluation for generations between 1/4 and 1/2 megapixel in size.