stable-diffusion-3.5-large-turbo

ADD-distilled Multimodal Diffusion Transformer (MMDiT) text-to-image model. Uses three fixed, pretrained text encoders with QK-normalization. Designed for high-quality output at low step counts (typically 4 steps, guidance 0.0).

This repository is an unmodified redistribution of stabilityai/stable-diffusion-3.5-large-turbo. Weights, configs, and license are preserved verbatim.

Files

  • sd3.5_large_turbo.safetensors โ€” single-file checkpoint.
  • transformer/, vae/, scheduler/, text_encoder/, text_encoder_2/, text_encoder_3/, tokenizer*/ โ€” diffusers-format components.
  • text_encoders/ โ€” single-file text encoders for ComfyUI workflows.
  • model_index.json, SD3.5L_Turbo_example_workflow.json โ€” pipeline and workflow configs.
  • LICENSE.md โ€” Stability AI Community License (verbatim).

Inference

import torch
from diffusers import StableDiffusion3Pipeline

pipe = StableDiffusion3Pipeline.from_pretrained(
    "cudabenchmarktest/stable-diffusion-3.5-large-turbo",
    torch_dtype=torch.bfloat16,
).to("cuda")

image = pipe(
    "A capybara holding a sign that reads Hello Fast World",
    num_inference_steps=4,
    guidance_scale=0.0,
).images[0]
image.save("out.png")

License and attribution

Governed by the Stability AI Community License Agreement (see LICENSE.md). Permits research, non-commercial use, and commercial use for organizations or individuals with less than $1M USD in total annual revenue. Above that threshold a separate Stability Enterprise license is required.

  • Original release: Stability AI (stabilityai/stable-diffusion-3.5-large-turbo).
  • This redistribution: weights and configs unmodified, LICENSE preserved, README replaced, demo image files removed. No additional modifications.
Downloads last month
15
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for cudabenchmarktest/stable-diffusion-3.5-large-turbo

Finetuned
(4)
this model