TokForge β€” RealVisXL V4.0 Lightning (Qualcomm Hexagon NPU)

RealVisXL V4.0 Lightning image generation for the Qualcomm Hexagon NPU (HTP), packaged for on-device image generation in the TokForge Android app (dev.tokforge). This is the higher-quality 1024Γ—1024 "Faithful" tier alongside the SD1.5 NPU bundle.

vc410: this repo replaces the previous SDXL-Turbo bins (few-step distillation gave fused heads / doubled bodies / extra limbs) with RealVisXL V4.0 Lightning, which is adversarially distilled for deterministic few-step Euler and renders clean, photoreal people. The W8A16 NPU quant is unchanged β€” the checkpoint was the cure.

The model is quantized to W8A16 (8-bit weights, 16-bit activations) and compiled to QNN HTP context binaries that run on the phone's Hexagon DSP. The pipeline uses fp16 text encoders and a TAESDXL tiny-VAE decoder, runs 6-step EulerDiscrete, and is guidance-free (one UNet pass per step).

Based on

SG161222/RealVisXL_V4.0_Lightning β€” CreativeML OpenRAIL++-M license (commercial use permitted, no revenue cap).

Format

QNN HTP context binaries (W8A16), native 1024px (128Γ—128 latent). These are compiled for the Hexagon DSP and are not a portable format like GGUF. The repo ships native sets for V73 / V75 / V79 / V81; the app reads the device Hexagon arch (dsp_arch) and downloads the matching set. Forward-compatibility (a lower-arch binary also runs on a higher-arch DSP) still applies as a fallback. Device-verified clean on V75 (Lenovo SM8650) and V81 (RedMagic SM8850).

Pipeline

Stage File
CLIP-L text encoder (fp16) <arch>/text_encoder_1_fp16.bin
OpenCLIP-bigG text encoder (fp16) <arch>/text_encoder_2_fp16.bin
Combined-emb MLP (host CPU) <arch>/sdxl_emb_mlp.bin
UNet (W8A16, DSP) <arch>/unet.bin
TAESDXL tiny-VAE (CPU MNN) <arch>/taesdxl_decoder.mnn
Dual CLIP BPE tokenizers <arch>/tokenizer/, <arch>/tokenizer_2/

Scheduler: EulerDiscrete (deterministic), trailing spacing, epsilon prediction, 6 steps, guidance_scale 0. Runs via the license-clean libsdxl_qnn_driver in the TokForge app.

See manifest.json for the per-arch file list, sizes, and md5 checksums.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Collection including darkmaniac7/TokForge-SDXL-QNN-NPU