Text-to-Image
GGUF
English
pixart
gguf-node
File size: 3,864 Bytes
d00b06c
 
 
 
 
 
 
 
 
 
185a41d
a7bdc62
edddb95
 
 
 
 
cef8eb3
edddb95
 
 
 
 
 
 
 
 
cef8eb3
edddb95
a7bdc62
edddb95
 
 
 
 
cef8eb3
edddb95
623e413
 
 
 
 
 
 
 
 
d00b06c
 
 
 
185a41d
 
d00b06c
e18f774
 
 
d00b06c
 
4f6437f
962a99f
d00b06c
 
7806513
 
d00b06c
4f6437f
cf45fff
7e88c93
f3937a1
027e539
7f3b36e
027e539
 
 
98f65a2
027e539
4f6437f
d00b06c
e30158d
d00b06c
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
---
license: openrail++
language:
- en
base_model:
- PixArt-alpha/PixArt-XL-2-1024-MS
pipeline_tag: text-to-image
tags:
- pixart
- gguf-node
widget:
- text: a close-up shot of a beautiful girl in a serene world. She has white hair
    and is blindfolded, with a calm expression. Her hands are pressed together in
    a prayer pose, with fingers interlaced and palms touching. The background is softly
    blurred, enhancing her ethereal presence.
  parameters:
    negative_prompt: blurry, cropped, ugly
  output:
    url: samples\ComfyUI_00007_.png
- text: a wizard with a glowing staff and a glowing hat, colorful magic, dramatic
    atmosphere, sharp focus, highly detailed, cinematic, original composition, fine
    detail, intricate, elegant, creative, color spread, shiny, amazing, symmetry,
    illuminated, inspired, pretty, attractive, artistic, dynamic background, relaxed,
    professional, extremely inspirational, beautiful, determined, cute, adorable,
    best
  parameters:
    negative_prompt: blurry, cropped, ugly
  output:
    url: samples\ComfyUI_00008_.png
- text: a girl stands amidst scattered glass shards, surrounded by a beautifully crafted
    and expansive world. The scene is depicted from a dynamic angle, emphasizing her
    determined expression. The background features vast landscapes with floating crystals
    and soft, glowing lights that create a mystical and grand atmosphere.
  parameters:
    negative_prompt: blurry, cropped, ugly
  output:
    url: samples\ComfyUI_00009_.png
- text: close-up portrait of girl
  output:
    url: samples\ComfyUI_00001_.png
- text: close-up portrait of cat
  output:
    url: samples\ComfyUI_00002_.png
- text: close-up portrait of young lady
  output:
    url: samples\ComfyUI_00003_.png
---

# **gguf quantized version of pixart**

<Gallery />

## **setup (once)**
- drag pixart-xl-2-1024-ms-q4_k_m.gguf [[1GB](https://huggingface.co/calcuis/pixart/blob/main/pixart-xl-2-1024-ms-q4_k_m.gguf)] to > ./ComfyUI/models/diffusion_models
- drag t5xxl_fp16-q4_0.gguf [[2.9GB](https://huggingface.co/calcuis/pixart/blob/main/t5xxl_fp16-q4_0.gguf)] to > ./ComfyUI/models/text_encoders
- drag pixart_vae_fp8_e4m3fn.safetensors [[83.7MB](https://huggingface.co/calcuis/pixart/blob/main/pixart_vae_fp8_e4m3fn.safetensors)] to > ./ComfyUI/models/vae

## **run it straight (no installation needed way)**
- run the .bat file in the main directory (assuming you are using the gguf-node [pack](https://github.com/calcuis/gguf/releases) below)
- drag the workflow json file (below) or the demo picture above to > your browser

### **workflow**
- example workflow for [gguf](https://huggingface.co/calcuis/pixart/blob/main/workflow-pixart-gguf.json)
- example workflow for [safetensors](https://huggingface.co/calcuis/pixart/blob/main/workflow-pixart-safetensors.json)

### review
- should set the output image size according to the model stated, i.e., 1024x1024 or 512x512
- pixart-xl-2-1024-ms and pixart-sigma-xl-2-1024-ms are recommended (with 1024x1024 size)
- small size model but good quality pictures; and t5 encoder allows you inputting short description or sentence instead of tag(s)
- more quantized versions of t5xxl encoder can be found [here](https://huggingface.co/chatpig/t5xxl/tree/main)
- upgrade your gguf-node (see the last item in reference list below) to the latest version for pixart model support

### **paper**
- [pixart-α](https://arxiv.org/pdf/2310.00426)
- [pixart-Σ](https://arxiv.org/pdf/2403.04692)
- [high-resolution image synthesis](https://arxiv.org/pdf/2112.10752)

### **reference**
- base model from [pixart-alpha](https://huggingface.co/PixArt-alpha)
- comfyui [comfyanonymous](https://github.com/comfyanonymous/ComfyUI)
- gguf-node ([pypi](https://pypi.org/project/gguf-node)|[repo](https://github.com/calcuis/gguf)|[pack](https://github.com/calcuis/gguf/releases))