Text-to-Image
GGUF
English
pixart
gguf-node
pixart / README.md
calcuis's picture
Update README.md
623e413 verified
metadata
license: openrail++
language:
  - en
base_model:
  - PixArt-alpha/PixArt-XL-2-1024-MS
pipeline_tag: text-to-image
tags:
  - pixart
  - gguf-node
widget:
  - text: >-
      a close-up shot of a beautiful girl in a serene world. She has white hair
      and is blindfolded, with a calm expression. Her hands are pressed together
      in a prayer pose, with fingers interlaced and palms touching. The
      background is softly blurred, enhancing her ethereal presence.
    parameters:
      negative_prompt: blurry, cropped, ugly
    output:
      url: samples\ComfyUI_00007_.png
  - text: >-
      a wizard with a glowing staff and a glowing hat, colorful magic, dramatic
      atmosphere, sharp focus, highly detailed, cinematic, original composition,
      fine detail, intricate, elegant, creative, color spread, shiny, amazing,
      symmetry, illuminated, inspired, pretty, attractive, artistic, dynamic
      background, relaxed, professional, extremely inspirational, beautiful,
      determined, cute, adorable, best
    parameters:
      negative_prompt: blurry, cropped, ugly
    output:
      url: samples\ComfyUI_00008_.png
  - text: >-
      a girl stands amidst scattered glass shards, surrounded by a beautifully
      crafted and expansive world. The scene is depicted from a dynamic angle,
      emphasizing her determined expression. The background features vast
      landscapes with floating crystals and soft, glowing lights that create a
      mystical and grand atmosphere.
    parameters:
      negative_prompt: blurry, cropped, ugly
    output:
      url: samples\ComfyUI_00009_.png
  - text: close-up portrait of girl
    output:
      url: samples\ComfyUI_00001_.png
  - text: close-up portrait of cat
    output:
      url: samples\ComfyUI_00002_.png
  - text: close-up portrait of young lady
    output:
      url: samples\ComfyUI_00003_.png

gguf quantized version of pixart

Prompt
a close-up shot of a beautiful girl in a serene world. She has white hair and is blindfolded, with a calm expression. Her hands are pressed together in a prayer pose, with fingers interlaced and palms touching. The background is softly blurred, enhancing her ethereal presence.
Negative Prompt
blurry, cropped, ugly
Prompt
a wizard with a glowing staff and a glowing hat, colorful magic, dramatic atmosphere, sharp focus, highly detailed, cinematic, original composition, fine detail, intricate, elegant, creative, color spread, shiny, amazing, symmetry, illuminated, inspired, pretty, attractive, artistic, dynamic background, relaxed, professional, extremely inspirational, beautiful, determined, cute, adorable, best
Negative Prompt
blurry, cropped, ugly
Prompt
a girl stands amidst scattered glass shards, surrounded by a beautifully crafted and expansive world. The scene is depicted from a dynamic angle, emphasizing her determined expression. The background features vast landscapes with floating crystals and soft, glowing lights that create a mystical and grand atmosphere.
Negative Prompt
blurry, cropped, ugly
Prompt
close-up portrait of girl
Prompt
close-up portrait of cat
Prompt
close-up portrait of young lady

setup (once)

  • drag pixart-xl-2-1024-ms-q4_k_m.gguf [1GB] to > ./ComfyUI/models/diffusion_models
  • drag t5xxl_fp16-q4_0.gguf [2.9GB] to > ./ComfyUI/models/text_encoders
  • drag pixart_vae_fp8_e4m3fn.safetensors [83.7MB] to > ./ComfyUI/models/vae

run it straight (no installation needed way)

  • run the .bat file in the main directory (assuming you are using the gguf-node pack below)
  • drag the workflow json file (below) or the demo picture above to > your browser

workflow

review

  • should set the output image size according to the model stated, i.e., 1024x1024 or 512x512
  • pixart-xl-2-1024-ms and pixart-sigma-xl-2-1024-ms are recommended (with 1024x1024 size)
  • small size model but good quality pictures; and t5 encoder allows you inputting short description or sentence instead of tag(s)
  • more quantized versions of t5xxl encoder can be found here
  • upgrade your gguf-node (see the last item in reference list below) to the latest version for pixart model support

paper

reference