peacej's picture
Update README.md
368f004 verified
|
raw
history blame
4.26 kB
metadata
language:
  - ja
tags:
  - text-to-image
  - stable-diffusion
  - japanese-stable-diffusion
pipeline_tag: text-to-image
license: other
extra_gated_prompt: >-
  By downloading, using, or distributing any portion or element of this model,
  you agree to be bound by the agreement described in the LICENSE file.
extra_gated_fields:
  Name: text
  Email: text
  Country: text
  Organization or Affiliation: text
  I allow Stability AI to contact me about information related to its models and research: checkbox

Japanese Stable Diffusion XL

image

Please note: for commercial usage of this model, please see https://stability.ai/membership

ๅ•†็”จๅˆฉ็”จใซ้–ขใ™ใ‚‹ๆ—ฅๆœฌ่ชžใงใฎๅ•ใ„ๅˆใ‚ใ›ใฏใ€€[email protected] ใพใงใŠ้ก˜ใ„่‡ดใ—ใพใ™ใ€‚

Model Details

Japanese Stable Diffusion XL (JSDXL) is a Japanese-specific SDXL model that is capable of inputting prompts in Japanese and generating Japanese-style images.

Usage


from diffusers import DiffusionPipeline
import torch

pipeline = DiffusionPipeline.from_pretrained(
    "stabilityai/japanese-stable-diffusion-xl", trust_remote_code=True
)
pipeline.to("cuda")

# if using torch < 2.0
# pipeline.enable_xformers_memory_efficient_attention()

prompt = "ๆŸด็Šฌใ€ใ‚ซใƒฉใƒ•ใƒซใ‚ขใƒผใƒˆ"

image = pipeline(prompt=prompt).images[0]

Model Details

  • Developed by: Stability AI
  • Model type: Diffusion-based text-to-image generative model
  • Model Description: This model is a fine-tuned model based on SDXL 1.0. In order to maximize the understanding of the Japanese language and Japanese culture/expressions while preserving the versatility of the pre-trained model, we performed a PEFT training using one Japanese-specific compatible text encoder. As a PEFT method, we applied Orthogonal Fine-tuning (OFT) for better results and training stability.
  • License: STABILITY AI JAPANESE STABLE DIFFUSION XL COMMUNITY LICENSE

Uses

Direct Use

Commercial use: for commercial usage of this model, please see https://stability.ai/membership

ๅ•†็”จๅˆฉ็”จใซ้–ขใ™ใ‚‹ๆ—ฅๆœฌ่ชžใงใฎๅ•ใ„ๅˆใ‚ใ›ใฏใ€€[email protected] ใพใงใŠ้ก˜ใ„่‡ดใ—ใพใ™ใ€‚

Research: possible research areas/tasks include:

  • Generation of artworks and use in design and other artistic processes.
  • Applications in educational or creative tools.
  • Research on generative models.
  • Safe deployment of models which have the potential to generate harmful content.
  • Probing and understanding the limitations and biases of generative models.

Excluded uses are described below.

Out-of-Scope Use

The model was not trained to be factual or true representations of people or events, and therefore using the model to generate such content is out-of-scope for the abilities of this model.

Limitations and Bias

Limitations

  • The model does not achieve perfect photorealism
  • The model cannot render legible text
  • The model struggles with more difficult tasks which involve compositionality, such as rendering an image corresponding to โ€œA red cube on top of a blue sphereโ€
  • Faces and people in general may not be generated properly.
  • The autoencoding part of the model is lossy.

Bias

While the capabilities of image generation models are impressive, they can also reinforce or exacerbate social biases.

How to cite

@misc{JSDXL, 
    url    = {[https://huggingface.co/stabilityai/japanese-stable-diffusion-xl](https://huggingface.co/stabilityai/japanese-stable-diffusion-xl)}, 
    title  = {Japanese Stable Diffusion XL}, 
    author = {Shing, Makoto and Akiba, Takuya}
}

Contact

  • For questions and comments about the model, please join Stable Community Japan.
  • For future announcements / information about Stability AI models, research, and events, please follow https://twitter.com/StabilityAI_JP.
  • For business and partnership inquiries, please contact [email protected]. ใƒ“ใ‚ธใƒใ‚นใ‚„ๅ”ๆฅญใซ้–ขใ™ใ‚‹ใŠๅ•ใ„ๅˆใ‚ใ›ใฏ[email protected]ใซใ”้€ฃ็ตกใใ ใ•ใ„ใ€‚