armwaheed commited on
Commit
e5bcc3c
·
verified ·
1 Parent(s): 68f94ad

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +42 -3
README.md CHANGED
@@ -1,3 +1,42 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: other
3
+ license_name: stabilityai-ai-community
4
+ license_link: LICENSE.md
5
+ tags:
6
+ - text-to-image
7
+ - stable-diffusion
8
+ - diffusers
9
+ inference: true
10
+ language:
11
+ - en
12
+ pipeline_tag: text-to-image
13
+ ---
14
+
15
+ # Stable Diffusion 3.5 Medium ONNX
16
+ This ONNX version of Stable Diffusion 3.5 Medium was made from the [PyTorch source model](https://huggingface.co/stabilityai/stable-diffusion-3.5-medium), using `optimum-cli`: [Converting Stable Diffusion 3.5 Medium From PyTorch to ONNX](https://github.com/armwaheed/stability-ai-toolkit/tree/armwaheed/sd35-inpainting-onnx/pytorch-to-onnx)
17
+
18
+ ![3.5 Medium Demo Image](https://huggingface.co/stabilityai/stable-diffusion-3.5-medium/blob/main/sd3.5_medium_demo.jpg)
19
+
20
+ ## Usage
21
+ Python Gradio: [Stable Diffusion 3.5 Inpainting in ONNX](https://github.com/armwaheed/stability-ai-toolkit/blob/armwaheed/sd35-inpainting-onnx/README.md)
22
+
23
+ ## Model
24
+
25
+ ![MMDiT-X](https://huggingface.co/stabilityai/stable-diffusion-3.5-medium/blob/main/mmdit-x.png)
26
+
27
+ [Stable Diffusion 3.5 Medium](https://stability.ai/news/introducing-stable-diffusion-3-5) is a Multimodal Diffusion Transformer with improvements (MMDiT-X) text-to-image model that features improved performance in image quality, typography, complex prompt understanding, and resource-efficiency.
28
+
29
+ Please note: This model is released under the [Stability Community License](https://stability.ai/community-license-agreement). Visit [Stability AI](https://stability.ai/license) to learn or [contact us](https://stability.ai/enterprise) for commercial licensing details.
30
+
31
+
32
+ ### Model Description
33
+
34
+ - **Developed by:** Stability AI
35
+ - **Model type:** MMDiT-X text-to-image generative model
36
+ - **Model Description:** This model generates images based on text prompts. It is a Multimodal Diffusion Transformer
37
+ (https://arxiv.org/abs/2403.03206) with improvements that use three fixed, pretrained text encoders, with QK-normalization to improve training stability, and dual attention blocks in the first 12 transformer layers.
38
+
39
+ ### License
40
+
41
+ - **Community License:** Free for research, non-commercial, and commercial use for organizations or individuals with less than $1M in total annual revenue. More details can be found in the [Community License Agreement](https://stability.ai/community-license-agreement). Read more at https://stability.ai/license.
42
+ - **For individuals and organizations with annual revenue above $1M**: please [contact us](https://stability.ai/enterprise) to get an Enterprise License.