File size: 7,526 Bytes
6268841
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
# Compatibility Matrix

The table below shows every supported model and the optimizations supported for them.

The symbols used have the following meanings:

- βœ… = Full compatibility
- ❌ = No compatibility
- β­• = Does not apply to this model

## Models x Optimization

The `HuggingFace Model ID` can be passed directly to `from_pretrained()` methods, and sglang-diffusion will use the
optimal
default parameters when initializing and generating videos.

### Video Generation Models

| Model Name                   | Hugging Face Model ID                             | Resolutions         | TeaCache | Sliding Tile Attn | Sage Attn | Video Sparse Attention (VSA) | Sparse Linear Attention (SLA) | Sage Sparse Linear Attention (SageSLA) | Sparse Video Gen 2 (SVG2) |
|:-----------------------------|:--------------------------------------------------|:--------------------|:--------:|:-----------------:|:---------:|:----------------------------:|:----------------------------:|:-----------------------------------------------:|:----------------------------------:|
| FastWan2.1 T2V 1.3B          | `FastVideo/FastWan2.1-T2V-1.3B-Diffusers`         | 480p                |    β­•     |         β­•         |      β­•     |              βœ…               |              ❌               |              ❌               |    ❌     |
| FastWan2.2 TI2V 5B Full Attn | `FastVideo/FastWan2.2-TI2V-5B-FullAttn-Diffusers` | 720p                |    β­•     |         β­•         |     β­•     |              βœ…               |              ❌               |              ❌               |    ❌     |
| Wan2.2 TI2V 5B               | `Wan-AI/Wan2.2-TI2V-5B-Diffusers`                 | 720p                |    β­•     |         β­•         |     βœ…     |              β­•               |              ❌               |              ❌               |    ❌     |
| Wan2.2 T2V A14B              | `Wan-AI/Wan2.2-T2V-A14B-Diffusers`                | 480p<br>720p        |    ❌     |         ❌         |     βœ…     |              β­•               |              ❌               |              ❌               |    ❌     |
| Wan2.2 I2V A14B              | `Wan-AI/Wan2.2-I2V-A14B-Diffusers`                | 480p<br>720p        |    ❌     |         ❌         |     βœ…     |              β­•               |              ❌               |              ❌               |    ❌     |
| HunyuanVideo                 | `hunyuanvideo-community/HunyuanVideo`             | 720Γ—1280<br>544Γ—960 |    ❌     |         βœ…         |     βœ…     |              β­•               |              ❌               |              ❌               |    βœ…     |
| FastHunyuan                  | `FastVideo/FastHunyuan-diffusers`                 | 720Γ—1280<br>544Γ—960 |    ❌     |         βœ…         |     βœ…     |              β­•               |              ❌               |              ❌               |    βœ…     |
| Wan2.1 T2V 1.3B              | `Wan-AI/Wan2.1-T2V-1.3B-Diffusers`                | 480p                |    βœ…     |         βœ…         |     βœ…     |              β­•               |              ❌               |              ❌               |    βœ…     |
| Wan2.1 T2V 14B               | `Wan-AI/Wan2.1-T2V-14B-Diffusers`                 | 480p, 720p          |    βœ…     |         βœ…         |     βœ…     |              β­•               |              ❌               |              ❌               |    βœ…     |
| Wan2.1 I2V 480P              | `Wan-AI/Wan2.1-I2V-14B-480P-Diffusers`            | 480p                |    βœ…     |         βœ…         |     βœ…     |              β­•               |              ❌               |              ❌               |    βœ…     |
| Wan2.1 I2V 720P              | `Wan-AI/Wan2.1-I2V-14B-720P-Diffusers`            | 720p                |    βœ…     |         βœ…         |     βœ…     |              β­•               |              ❌               |              ❌               |    βœ…     |
| TurboWan2.1 T2V 1.3B         | `IPostYellow/TurboWan2.1-T2V-1.3B-Diffusers`      | 480p                |    βœ…     |         ❌         |     ❌     |              ❌               |              βœ…               |              βœ…               |    β­•     |
| TurboWan2.1 T2V 14B          | `IPostYellow/TurboWan2.1-T2V-14B-Diffusers`       | 480p                |    βœ…     |         ❌         |     ❌     |              ❌               |              βœ…               |              βœ…               |    β­•     |
| TurboWan2.1 T2V 14B 720P     | `IPostYellow/TurboWan2.1-T2V-14B-720P-Diffusers`  | 720p                |    βœ…     |         ❌         |     ❌     |              ❌               |              βœ…               |              βœ…               |    β­•     |
| TurboWan2.2 I2V A14B         | `IPostYellow/TurboWan2.2-I2V-A14B-Diffusers`      | 720p                |    βœ…     |         ❌         |     ❌     |              ❌               |              βœ…               |              βœ…               |    β­•     |

**Note**:
1.Wan2.2 TI2V 5B has some quality issues when performing I2V generation. We are working on fixing this issue.
2.SageSLA Based on SpargeAttn. Install it first with `pip install git+https://github.com/thu-ml/SpargeAttn.git --no-build-isolation`

### Image Generation Models

| Model Name       | HuggingFace Model ID                    | Resolutions    |
|:-----------------|:----------------------------------------|:---------------|
| FLUX.1-dev       | `black-forest-labs/FLUX.1-dev`          | Any resolution |
| FLUX.2-dev       | `black-forest-labs/FLUX.2-dev`          | Any resolution |
| FLUX.2-Klein     | `black-forest-labs/FLUX.2-klein-4B`     | Any resolution |
| Z-Image-Turbo    | `Tongyi-MAI/Z-Image-Turbo`              | Any resolution |
| GLM-Image        | `zai-org/GLM-Image`                     | Any resolution |
| Qwen Image       | `Qwen/Qwen-Image`                       | Any resolution |
| Qwen Image 2512  | `Qwen/Qwen-Image-2512`                  | Any resolution |
| Qwen Image Edit  | `Qwen/Qwen-Image-Edit`                  | Any resolution |

## Verified LoRA Examples

This section lists example LoRAs that have been explicitly tested and verified with each base model in the **SGLang Diffusion** pipeline.

> Important:
> LoRAs that are not listed here are not necessarily incompatible.
> In practice, most standard LoRAs are expected to work, especially those following common Diffusers or SD-style conventions.
> The entries below simply reflect configurations that have been manually validated by the SGLang team.

### Verified LoRAs by Base Model

| Base Model       | Supported LoRAs |
|:-----------------|:----------------|
| Wan2.2           | `lightx2v/Wan2.2-Distill-Loras`<br>`Cseti/wan2.2-14B-Arcane_Jinx-lora-v1` |
| Wan2.1           | `lightx2v/Wan2.1-Distill-Loras` |
| Z-Image-Turbo    | `tarn59/pixel_art_style_lora_z_image_turbo`<br>`wcde/Z-Image-Turbo-DeJPEG-Lora` |
| Qwen-Image       | `lightx2v/Qwen-Image-Lightning`<br>`flymy-ai/qwen-image-realism-lora`<br>`prithivMLmods/Qwen-Image-HeadshotX`<br>`starsfriday/Qwen-Image-EVA-LoRA` |
| Qwen-Image-Edit  | `ostris/qwen_image_edit_inpainting`<br>`lightx2v/Qwen-Image-Edit-2511-Lightning` |
| Flux             | `dvyio/flux-lora-simple-illustration`<br>`XLabs-AI/flux-furry-lora`<br>`XLabs-AI/flux-RealismLora` |

## Special requirements

### Sliding Tile Attention

- Currently, only Hopper GPUs (H100s) are supported.