File size: 1,878 Bytes
239ae60
 
 
 
 
 
 
7433705
e0d2f98
 
7433705
239ae60
 
30da159
 
3493476
30da159
 
 
3493476
 
 
 
30da159
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
3493476
 
 
 
 
30da159
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
---
license: apache-2.0
datasets:
- laion/laion-art
language:
- en
library_name: diffusers
pipeline_tag: image-to-image
tags:
- jax-diffusers-event
base_model: runwayml/stable-diffusion-v1-5
---

# Color-Canny CantrolNet

These are ControlNet checkpoints trained on runwayml/stable-diffusion-v1-5, using fused color and canny edge as conditioning. 

You can find some example images in the following. 

## Examples

#### Color examples

**prompt**: a concept art of by Makoto Shinkai, a girl is standing in the middle of the sea

**negative prompt**: text, bad anatomy, blurry, (low quality, blurry)
![images_1)](./1.png)

**prompt**: a concept art of by Makoto Shinkai, a girl is standing in the middle of the sea

**negative prompt**: text, bad anatomy, blurry, (low quality, blurry)
![images_2)](./2.png)

**prompt**: a concept art of by Makoto Shinkai, a girl is standing in the middle of the grass

**negative prompt**: text, bad anatomy, blurry, (low quality, blurry)
![images_3)](./3.png)


#### Brightness examples
This model also can be used to control image brightness. The following images are generated with different brightness conditioning image and controlnet strength(0.5 ~ 0.7).
![images_4)](./4.jpg)


## Limitations and Bias

- No strict control by input color
- Sometimes generate image with confusion When color description in prompt

## Training

**Dataset**
We train this model on [laion-art](https://huggingface.co/datasets/laion/laion-art) dataset with 2.6m images, the processed dataset can be found in [ghoskno/laion-art-en-colorcanny](https://huggingface.co/datasets/ghoskno/laion-art-en-colorcanny).


**Training Details**

- **Hardware**: Google Cloud TPUv4-8 VM

- **Optimizer**: AdamW

- **Train Batch Size**: 4 x 4 = 16

- **Learning rate**: 0.00001 constant

- **Gradient Accumulation Steps**: 4

- **Resolution**: 512

- **Train Steps**: 36000