PseudoTerminal X
commited on
Commit
•
a05352f
1
Parent(s):
1d02a1e
Trained for 0 epochs and 2000 steps.
Browse filesTrained with datasets ['text-embeds-pixart-filter', 'photo-concept-bucket', 'ideogram', 'midjourney-v6-520k-raw', 'sfwbooru', 'nijijourney-v6-520k-raw', 'dalle3']
Learning rate 1e-06, batch size 24, and 1 gradient accumulation steps.
Used DDPM noise scheduler for training with epsilon prediction type and rescaled_betas_zero_snr=False
Using 'trailing' timestep spacing.
Base model: terminusresearch/pixart-900m-1024-ft-v0.6
VAE: madebyollin/sdxl-vae-fp16-fix
- README.md +9 -1
- optimizer.bin +1 -1
- random_states_0.pkl +1 -1
- scheduler.bin +1 -1
- training_state-dalle3.json +0 -0
- training_state-ideogram.json +0 -0
- training_state-midjourney-v6-520k-raw.json +0 -0
- training_state-nijijourney-v6-520k-raw.json +0 -0
- training_state-photo-concept-bucket.json +0 -0
- training_state-sfwbooru.json +0 -0
- training_state.json +1 -1
- transformer/config.json +1 -1
- transformer/diffusion_pytorch_model.safetensors +1 -1
README.md
CHANGED
@@ -47,7 +47,7 @@ You may reuse the base model text encoder for inference.
|
|
47 |
## Training settings
|
48 |
|
49 |
- Training epochs: 0
|
50 |
-
- Training steps:
|
51 |
- Learning rate: 1e-06
|
52 |
- Effective batch size: 192
|
53 |
- Micro-batch size: 24
|
@@ -70,6 +70,14 @@ You may reuse the base model text encoder for inference.
|
|
70 |
- Cropped: True
|
71 |
- Crop style: random
|
72 |
- Crop aspect: square
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
73 |
### midjourney-v6-520k-raw
|
74 |
- Repeats: 0
|
75 |
- Total number of images: ~390912
|
|
|
47 |
## Training settings
|
48 |
|
49 |
- Training epochs: 0
|
50 |
+
- Training steps: 2000
|
51 |
- Learning rate: 1e-06
|
52 |
- Effective batch size: 192
|
53 |
- Micro-batch size: 24
|
|
|
70 |
- Cropped: True
|
71 |
- Crop style: random
|
72 |
- Crop aspect: square
|
73 |
+
### ideogram
|
74 |
+
- Repeats: 15
|
75 |
+
- Total number of images: ~36096
|
76 |
+
- Total number of aspect buckets: 1
|
77 |
+
- Resolution: 1.0 megapixels
|
78 |
+
- Cropped: True
|
79 |
+
- Crop style: random
|
80 |
+
- Crop aspect: square
|
81 |
### midjourney-v6-520k-raw
|
82 |
- Repeats: 0
|
83 |
- Total number of images: ~390912
|
optimizer.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 5451415117
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:dd8f60e8aae208dd92fae9115d03c841fc61241bc8cce99f08e5c10b0cbe1737
|
3 |
size 5451415117
|
random_states_0.pkl
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 16100
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:4cc35cbbfba1d770fe33fbe068175bdf96d2b0925f70d35c452b11e1fe34dee7
|
3 |
size 16100
|
scheduler.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 1000
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:bf997b28d4bfd8d4b58ad8d6128f43c39216511c3e56cc458eab92b813f97246
|
3 |
size 1000
|
training_state-dalle3.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_state-ideogram.json
ADDED
The diff for this file is too large to render.
See raw diff
|
|
training_state-midjourney-v6-520k-raw.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_state-nijijourney-v6-520k-raw.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_state-photo-concept-bucket.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_state-sfwbooru.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_state.json
CHANGED
@@ -1 +1 @@
|
|
1 |
-
{"global_step":
|
|
|
1 |
+
{"global_step": 2000, "epoch_step": 2000, "epoch": 1, "exhausted_backends": [], "repeats": {}}
|
transformer/config.json
CHANGED
@@ -1,7 +1,7 @@
|
|
1 |
{
|
2 |
"_class_name": "PixArtTransformer2DModel",
|
3 |
"_diffusers_version": "0.30.0.dev0",
|
4 |
-
"_name_or_path": "
|
5 |
"activation_fn": "gelu-approximate",
|
6 |
"attention_bias": true,
|
7 |
"attention_head_dim": 72,
|
|
|
1 |
{
|
2 |
"_class_name": "PixArtTransformer2DModel",
|
3 |
"_diffusers_version": "0.30.0.dev0",
|
4 |
+
"_name_or_path": "/home/ubuntu/training/models/checkpoint-1000",
|
5 |
"activation_fn": "gelu-approximate",
|
6 |
"attention_bias": true,
|
7 |
"attention_head_dim": 72,
|
transformer/diffusion_pytorch_model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 1816969728
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:c3919281ef146513cdb0f68a3673f8c0cc69d7e346caae41749fddfab312472b
|
3 |
size 1816969728
|