ThirdMiddle commited on
Commit
7a977ae
·
verified ·
1 Parent(s): 4554878

Add model card (run prod-20260407)

Browse files
Files changed (1) hide show
  1. README.md +121 -3
README.md CHANGED
@@ -1,3 +1,121 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model:
4
+ - Qwen/Qwen-Image-2512
5
+ - Qwen/Qwen-Image-Edit-2511
6
+ - Qwen/Qwen-Image
7
+ tags:
8
+ - image-generation
9
+ - qwen
10
+ - mmdit
11
+ - abliterated
12
+ - quantized
13
+ - rocm
14
+ language:
15
+ - en
16
+ library_name: diffusers
17
+ pipeline_tag: text-to-image
18
+ ---
19
+
20
+ # Qwen-Image-1.9
21
+
22
+ A merged, abliterated, and quantized derivative of the Qwen-Image 20B MMDiT family.
23
+
24
+ > **Run ID:** `prod-20260407`
25
+ > **Created:** 2026-04-07T18:59:37+00:00
26
+
27
+ ## Architecture
28
+
29
+ | Property | Value |
30
+ | --- | --- |
31
+ | Base family | Qwen-Image (MMDiT 20B) |
32
+ | Text encoder | Qwen2.5-VL |
33
+ | VAE | RGB-VAE |
34
+ | RoPE | 2D |
35
+ | Backbone parameters | ~20B |
36
+ | License | Apache-2.0 |
37
+
38
+ ## Source Models
39
+
40
+ | Alias | Model | Role | License |
41
+ | --- | --- | --- | --- |
42
+ | `qwen-image-2512` | [Qwen/Qwen-Image-2512](https://huggingface.co/Qwen/Qwen-Image-2512) | foundation | Apache-2.0 |
43
+ | `qwen-image-base` | [Qwen/Qwen-Image](https://huggingface.co/Qwen/Qwen-Image) | ancestry-base | Apache-2.0 |
44
+ | `qwen-image-edit-2511` | [Qwen/Qwen-Image-Edit-2511](https://huggingface.co/Qwen/Qwen-Image-Edit-2511) | edit-donor | Apache-2.0 |
45
+ | `qwen-image-layered` | [Qwen/Qwen-Image-Layered](https://huggingface.co/Qwen/Qwen-Image-Layered) | layer-logic-donor | Apache-2.0 |
46
+
47
+ ## Research Method
48
+
49
+ ### 1. Delta-Edit Merge
50
+
51
+ The edit capability is transferred to the foundation model via a controlled
52
+ delta injection:
53
+
54
+ ```
55
+ edit_delta = Qwen-Image-Edit-2511 − Qwen-Image (delta base)
56
+ merged = Qwen-Image-2512 + 0.35 × edit_delta
57
+ ```
58
+
59
+ Only MMDiT backbone tensors are blended. Text encoder, VAE, and RoPE
60
+ components are passed through from the foundation checkpoint unchanged.
61
+
62
+ - **Strategy:** `slerp`
63
+ - **Blend coefficient:** `0.35`
64
+ - **Foundation:** `Qwen/Qwen-Image-2512`
65
+ - **Excluded subsystems:** text_encoder, vae, rope
66
+
67
+ ### 2. Abliteration (Refusal-Direction Removal)
68
+
69
+ Refusal-direction vectors are identified in the residual stream and
70
+ projected out of target weight matrices using a norm-preserving
71
+ orthogonal projection:
72
+
73
+ ```
74
+ W′ = W − scale × (W @ r̂) ⊗ r̂ (norm-preserving variant)
75
+ ```
76
+
77
+ - **Target layers:** 18+ (attention o_proj + MLP down_proj)
78
+ - **Scale:** 1.0
79
+ - **Mode:** norm-preserving (preserves weight magnitude distribution)
80
+ - Recipe: `stage-3-abliteration.yaml`
81
+
82
+ ### 3. Quantization
83
+
84
+ | Kind | Path |
85
+ | --- | --- |
86
+ | `quant_config` | `quant-config.json` |
87
+
88
+ - **GGUF targets:** Q4_K_M, IQ4_XS (with importance-matrix)
89
+ - **EXL2 target:** 4.0 bpw
90
+ - **Runtime:** vLLM-Omni (ROCm), ExLlamaV2
91
+
92
+ ## Hardware
93
+
94
+ - **GPU:** AMD Instinct MI300X — 192 GB HBM3 VRAM
95
+ - **ROCm:** 7.2.0
96
+ - **Precision:** bf16 (merge + abliterate), quantized (deployment)
97
+
98
+ ## Usage
99
+
100
+ ```python
101
+ from diffusers import DiffusionPipeline
102
+ import torch
103
+
104
+ pipe = DiffusionPipeline.from_pretrained(
105
+ "ThirdMiddle/Qwen-Image-1.9",
106
+ torch_dtype=torch.bfloat16,
107
+ trust_remote_code=True,
108
+ )
109
+ pipe = pipe.to("cuda")
110
+
111
+ image = pipe(
112
+ "a photorealistic portrait of an astronaut on Mars at sunrise",
113
+ num_inference_steps=30,
114
+ guidance_scale=4.0,
115
+ ).images[0]
116
+ image.save("output.png")
117
+ ```
118
+
119
+ ## License
120
+
121
+ Apache-2.0 — inherited from all source models.