Update README.md
Browse files
README.md
CHANGED
|
@@ -1,3 +1,42 @@
|
|
| 1 |
-
---
|
| 2 |
-
license:
|
| 3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: other
|
| 3 |
+
license_name: stabilityai-ai-community
|
| 4 |
+
license_link: LICENSE.md
|
| 5 |
+
tags:
|
| 6 |
+
- text-to-image
|
| 7 |
+
- stable-diffusion
|
| 8 |
+
- diffusers
|
| 9 |
+
inference: true
|
| 10 |
+
language:
|
| 11 |
+
- en
|
| 12 |
+
pipeline_tag: text-to-image
|
| 13 |
+
---
|
| 14 |
+
|
| 15 |
+
# Stable Diffusion 3.5 Medium ONNX
|
| 16 |
+
This ONNX version of Stable Diffusion 3.5 Medium was made from the [PyTorch source model](https://huggingface.co/stabilityai/stable-diffusion-3.5-medium), using `optimum-cli`: [Converting Stable Diffusion 3.5 Medium From PyTorch to ONNX](https://github.com/armwaheed/stability-ai-toolkit/tree/armwaheed/sd35-inpainting-onnx/pytorch-to-onnx)
|
| 17 |
+
|
| 18 |
+

|
| 19 |
+
|
| 20 |
+
## Usage
|
| 21 |
+
Python Gradio: [Stable Diffusion 3.5 Inpainting in ONNX](https://github.com/armwaheed/stability-ai-toolkit/blob/armwaheed/sd35-inpainting-onnx/README.md)
|
| 22 |
+
|
| 23 |
+
## Model
|
| 24 |
+
|
| 25 |
+

|
| 26 |
+
|
| 27 |
+
[Stable Diffusion 3.5 Medium](https://stability.ai/news/introducing-stable-diffusion-3-5) is a Multimodal Diffusion Transformer with improvements (MMDiT-X) text-to-image model that features improved performance in image quality, typography, complex prompt understanding, and resource-efficiency.
|
| 28 |
+
|
| 29 |
+
Please note: This model is released under the [Stability Community License](https://stability.ai/community-license-agreement). Visit [Stability AI](https://stability.ai/license) to learn or [contact us](https://stability.ai/enterprise) for commercial licensing details.
|
| 30 |
+
|
| 31 |
+
|
| 32 |
+
### Model Description
|
| 33 |
+
|
| 34 |
+
- **Developed by:** Stability AI
|
| 35 |
+
- **Model type:** MMDiT-X text-to-image generative model
|
| 36 |
+
- **Model Description:** This model generates images based on text prompts. It is a Multimodal Diffusion Transformer
|
| 37 |
+
(https://arxiv.org/abs/2403.03206) with improvements that use three fixed, pretrained text encoders, with QK-normalization to improve training stability, and dual attention blocks in the first 12 transformer layers.
|
| 38 |
+
|
| 39 |
+
### License
|
| 40 |
+
|
| 41 |
+
- **Community License:** Free for research, non-commercial, and commercial use for organizations or individuals with less than $1M in total annual revenue. More details can be found in the [Community License Agreement](https://stability.ai/community-license-agreement). Read more at https://stability.ai/license.
|
| 42 |
+
- **For individuals and organizations with annual revenue above $1M**: please [contact us](https://stability.ai/enterprise) to get an Enterprise License.
|