diffusers / docs /source /en /optimization /onnx.md

End of training

c0af20c almost 2 years ago

4.19 kB

	<!--Copyright 2023 The HuggingFace Team. All rights reserved.

	Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with
	the License. You may obtain a copy of the License at

	http://www.apache.org/licenses/LICENSE-2.0

	Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on
	an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the
	specific language governing permissions and limitations under the License.
	-->


	# How to use ONNX Runtime for inference

	🤗 [Optimum](https://github.com/huggingface/optimum) provides a Stable Diffusion pipeline compatible with ONNX Runtime.

	## Installation

	Install 🤗 Optimum with the following command for ONNX Runtime support:

	```
	pip install optimum["onnxruntime"]
	```

	## Stable Diffusion

	### Inference

	To load an ONNX model and run inference with ONNX Runtime, you need to replace [`StableDiffusionPipeline`] with `ORTStableDiffusionPipeline`. In case you want to load a PyTorch model and convert it to the ONNX format on-the-fly, you can set `export=True`.

	```python
	from optimum.onnxruntime import ORTStableDiffusionPipeline

	model_id = "runwayml/stable-diffusion-v1-5"
	pipeline = ORTStableDiffusionPipeline.from_pretrained(model_id, export=True)
	prompt = "sailing ship in storm by Leonardo da Vinci"
	image = pipeline(prompt).images[0]
	pipeline.save_pretrained("./onnx-stable-diffusion-v1-5")
	```

	If you want to export the pipeline in the ONNX format offline and later use it for inference,
	you can use the [`optimum-cli export`](https://huggingface.co/docs/optimum/main/en/exporters/onnx/usage_guides/export_a_model#exporting-a-model-to-onnx-using-the-cli) command:

	```bash
	optimum-cli export onnx --model runwayml/stable-diffusion-v1-5 sd_v15_onnx/
	```

	Then perform inference:

	```python
	from optimum.onnxruntime import ORTStableDiffusionPipeline

	model_id = "sd_v15_onnx"
	pipeline = ORTStableDiffusionPipeline.from_pretrained(model_id)
	prompt = "sailing ship in storm by Leonardo da Vinci"
	image = pipeline(prompt).images[0]
	```

	Notice that we didn't have to specify `export=True` above.

	<div class="flex justify-center">
	<img src="https://huggingface.co/datasets/optimum/documentation-images/resolve/main/onnxruntime/stable_diffusion_v1_5_ort_sail_boat.png">
	</div>

	You can find more examples in [optimum documentation](https://huggingface.co/docs/optimum/).


	### Supported tasks

	\| Task \| Loading Class \|
	\|--------------------------------------\|--------------------------------------\|
	\| `text-to-image` \| `ORTStableDiffusionPipeline` \|
	\| `image-to-image` \| `ORTStableDiffusionImg2ImgPipeline` \|
	\| `inpaint` \| `ORTStableDiffusionInpaintPipeline` \|

	## Stable Diffusion XL

	### Export

	To export your model to ONNX, you can use the [Optimum CLI](https://huggingface.co/docs/optimum/main/en/exporters/onnx/usage_guides/export_a_model#exporting-a-model-to-onnx-using-the-cli) as follows :

	```bash
	optimum-cli export onnx --model stabilityai/stable-diffusion-xl-base-1.0 --task stable-diffusion-xl sd_xl_onnx/
	```

	### Inference

	To load an ONNX model and run inference with ONNX Runtime, you need to replace `StableDiffusionPipelineXL` with `ORTStableDiffusionPipelineXL` :

	```python
	from optimum.onnxruntime import ORTStableDiffusionXLPipeline

	pipeline = ORTStableDiffusionXLPipeline.from_pretrained("sd_xl_onnx")
	prompt = "sailing ship in storm by Leonardo da Vinci"
	image = pipeline(prompt).images[0]
	```

	### Supported tasks

	\| Task \| Loading Class \|
	\|--------------------------------------\|--------------------------------------\|
	\| `text-to-image` \| `ORTStableDiffusionXLPipeline` \|
	\| `image-to-image` \| `ORTStableDiffusionXLImg2ImgPipeline`\|

	## Known Issues

	- Generating multiple prompts in a batch seems to take too much memory. While we look into it, you may need to iterate instead of batching.