nitrosocke
/

Arcane-Diffusion

StableDiffusionPipeline

stable-diffusion

Inference Endpoints

Model card Files Files and versions Community

Arcane-Diffusion / README.md

patrickvonplaten's picture

patrickvonplaten

add diffusers weights

3c52108 about 2 years ago

|

2.61 kB

	---
	license: creativeml-openrail-m
	---
	# Arcane Diffusion
	This is the fine-tuned Stable Diffusion model trained on images from the TV Show Arcane.
	Use the tokens _arcane style_ in your prompts for the effect.

	If you enjoy this model, please check out my other models on [Huggingface](https://huggingface.co/nitrosocke)

	### 🧨 Diffusers

	This model can be used just like any other Stable Diffusion model. For more information,
	please have a look at the [Stable Diffusion](https://huggingface.co/docs/diffusers/api/pipelines/stable_diffusion).

	You can also export the model to [ONNX](https://huggingface.co/docs/diffusers/optimization/onnx), [MPS](https://huggingface.co/docs/diffusers/optimization/mps) and/or [FLAX/JAX]().

	```python
	from diffusers import StableDiffusionPipeline
	import torch

	model_id = "nitrosocke/Arcane-Diffusion"
	pipe = StableDiffusionPipeline.from_pretrained(model_id, torch_dtype=torch.float16)
	pipe = pipe.to("cuda")

	prompt = "arcane style, a magical princess with golden hair"
	image = pipe(prompt).images[0]

	image.save("./magical_princess.png")
	```

	![img](https://huggingface.co/nitrosocke/Arcane-Diffusion/resolve/main/magical_princess.png)

	### Sample images from v3:
	![output Samples v3](https://huggingface.co/nitrosocke/Arcane-Diffusion/resolve/main/arcane-v3-samples-01.jpg)
	![output Samples v3](https://huggingface.co/nitrosocke/Arcane-Diffusion/resolve/main/arcane-v3-samples-02.jpg)
	### Sample images from the model:
	![output Samples](https://huggingface.co/nitrosocke/Arcane-Diffusion/resolve/main/arcane-diffusion-output-images.jpg)
	### Sample images used for training:
	![Training Samples](https://huggingface.co/nitrosocke/Arcane-Diffusion/resolve/main/arcane-diffusion-training-images.jpg)

	Version 3 (arcane-diffusion-v3): This version uses the new _train-text-encoder_ setting and improves the quality and edibility of the model immensely. Trained on 95 images from the show in 8000 steps.

	Version 2 (arcane-diffusion-v2): This uses the diffusers based dreambooth training and prior-preservation loss is way more effective. The diffusers where then converted with a script to a ckpt file in order to work with automatics repo.
	Training was done with 5k steps for a direct comparison to v1 and results show that it needs more steps for a more prominent result. Version 3 will be tested with 11k steps.

	Version 1 (arcane-diffusion-5k): This model was trained using _Unfrozen Model Textual Inversion_ utilizing the _Training with prior-preservation loss_ methods. There is still a slight shift towards the style, while not using the arcane token.