Scaling Rectified Flow Transformers for High-Resolution Image Synthesis
Paper • 2403.03206 • Published • 71
import torch
from diffusers import DiffusionPipeline
# switch to "mps" for apple devices
pipe = DiffusionPipeline.from_pretrained("armwaheed/stable-diffusion-3.5-medium-onnx", dtype=torch.bfloat16, device_map="cuda")
prompt = "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k"
image = pipe(prompt).images[0]This ONNX version of Stable Diffusion 3.5 Medium was made from the PyTorch source model, using optimum-cli: Converting Stable Diffusion 3.5 Medium From PyTorch to ONNX.
Python Gradio: Stable Diffusion 3.5 Inpainting in ONNX
Stable Diffusion 3.5 Medium is a Multimodal Diffusion Transformer with improvements (MMDiT-X) text-to-image model that features improved performance in image quality, typography, complex prompt understanding, and resource-efficiency.
Please note: This model is released under the Stability Community License. Visit Stability AI to learn or contact us for commercial licensing details.