Instructions to use WaveCut/ideogram-4-sdnq-uint4 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Diffusers
How to use WaveCut/ideogram-4-sdnq-uint4 with Diffusers:
pip install -U diffusers transformers accelerate
import torch from diffusers import DiffusionPipeline # switch to "mps" for apple devices pipe = DiffusionPipeline.from_pretrained("WaveCut/ideogram-4-sdnq-uint4", dtype=torch.bfloat16, device_map="cuda") prompt = "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k" image = pipe(prompt).images[0] - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- Draw Things
- DiffusionBee
Replace comparison matrices with vertical layout
Browse filesAdd narrow vertical side-by-side WebP q95 matrices for FP8 vs SDNQ and RTX 4090 SDNQ vs official NF4; remove the old wide/square matrix assets after updating the model card references.
.gitattributes
CHANGED
|
@@ -43,3 +43,5 @@ assets/benchmarks/opensource2.png filter=lfs diff=lfs merge=lfs -text
|
|
| 43 |
assets/samples/collage_landscape.jpg filter=lfs diff=lfs merge=lfs -text
|
| 44 |
assets/comparison_matrix.webp filter=lfs diff=lfs merge=lfs -text
|
| 45 |
assets/sdnq_vs_nf4_4090_side_by_side.webp filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
|
|
|
| 43 |
assets/samples/collage_landscape.jpg filter=lfs diff=lfs merge=lfs -text
|
| 44 |
assets/comparison_matrix.webp filter=lfs diff=lfs merge=lfs -text
|
| 45 |
assets/sdnq_vs_nf4_4090_side_by_side.webp filter=lfs diff=lfs merge=lfs -text
|
| 46 |
+
assets/original_vs_sdnq_vertical.webp filter=lfs diff=lfs merge=lfs -text
|
| 47 |
+
assets/sdnq_vs_nf4_4090_vertical.webp filter=lfs diff=lfs merge=lfs -text
|
README.md
CHANGED
|
@@ -83,9 +83,9 @@ The FP8 baseline was loaded through the upstream `ideogram4` `Ideogram4Pipeline.
|
|
| 83 |
|
| 84 |
## Example Matrix
|
| 85 |
|
| 86 |
-
The matrix below
|
| 87 |
|
| 88 |
-

|
| 89 |
|
| 90 |
## Prompt Set
|
| 91 |
|
|
|
|
| 105 |
## Files
|
| 106 |
|
| 107 |
- `prompts.json`: the 10 structured prompts used for the comparison.
|
| 108 |
+
- `assets/original_vs_sdnq_vertical.webp`: vertical side-by-side WebP comparison matrix for original FP8 vs SDNQ UInt4, quality 95.
|
| 109 |
+
- `assets/sdnq_vs_nf4_4090_vertical.webp`: vertical side-by-side WebP comparison matrix for the RTX 4090 SDNQ vs official NF4 follow-up, quality 95.
|
| 110 |
- `benchmark/`: raw benchmark JSONL/CSV files and `summary.json`.
|
| 111 |
- `quantization_manifest.json`: component-level quantization timings, storage, and VRAM peaks.
|
| 112 |
- `ideogram4_sdnq_pipeline.py`: loader helper for the SDNQ custom transformer components.
|
|
|
|
| 120 |
| sdnq | 10.00 | 211.61 | 14124.00 | 14466.00 | 59.65 | 37.05 | 37.57 | 19768.00 | 20521.00 |
|
| 121 |
| nf4 | 10.00 | 269.31 | 15370.00 | 15766.00 | 36.57 | 36.31 | 36.77 | 21012.00 | 21801.00 |
|
| 122 |
|
| 123 |
+

|
| 124 |
|
| 125 |
Raw follow-up metrics are in `benchmark/summary_4090_sdnq_vs_nf4.json`, `benchmark/sdnq_4090_metrics.*`, and `benchmark/nf4_4090_metrics.*`. The exact runner used for the follow-up is `benchmark/followup_runner.py`.
|
| 126 |
|
assets/{comparison_matrix.webp → original_vs_sdnq_vertical.webp}
RENAMED
|
File without changes
|
assets/{sdnq_vs_nf4_4090_side_by_side.webp → sdnq_vs_nf4_4090_vertical.webp}
RENAMED
|
File without changes
|