--- license: apache-2.0 base_model: - Wan-AI/Wan2.2-T2V-A14B library_name: diffusers tags: - video_generation - NVFP4 - Sparse_Attention - Wan --- # š¬ Wan2.2-NVFP4-Sparse > **An extremely efficient Wan 2.2 14B variant: NVFP4 Quantization-Aware Step Distillation with Sparse Attention for Blackwell Architecture** [](https://github.com/ModelTC/LightX2V) [](https://huggingface.co/lightx2v/) ## š Table of Contents - [⨠Features](#-features) - [š Quick Start](#-quick-start) - [š¬ Generation Results](#-generation-results) - [ā” Performance Comparison](#-performance-comparison) - [ā ļø Notes](#ļø-notes) - [š¤ Community](#-community) ## ⨠Features - **ā” 4-Step Inference**: Two high-noise expert steps followed by two low-noise expert steps, enabling extremely fast Wan2.2 MoE generation on a single Blackwell GPU. - **šÆ NVFP4 Quantization**: Quantization-aware step distillation reduces memory traffic and compute cost while targeting Blackwell architecture. - **š§© Sparse Attention**: Accelerates the costly O(n²) self-attention workload with sparse attention, reducing end-to-end latency for high-resolution video generation. - **š§ LightX2V Integration**: Recommended runtime stack for stable deployment and best performance. - **š High-Quality Generation**: Preserves the visual quality of Wan2.2-T2V-14B while dramatically improving inference speed. ## š Quick Start We strongly recommend using the official LightX2V Docker image for the cleanest environment and best reproducibility. ### Option A: Docker Recommended ```bash # 1. Pull LightX2V Docker image docker pull lightx2v/lightx2v:26052801-cu130-5090 # 2. Run inference bash scripts/wan22/distill/run_wan22_moe_t2v_extreme.sh ``` ### Option B: Manual Installation If Docker is not available, install the environment manually: ```bash # 1. Install LightX2V git clone https://github.com/ModelTC/LightX2V.git cd LightX2V uv pip install -v . # 2. Install NVFP4 Kernel pip install scikit_build_core uv git clone https://github.com/NVIDIA/cutlass.git cd lightx2v_kernel MAX_JOBS=$(nproc) CMAKE_BUILD_PARALLEL_LEVEL=$(nproc) \ uv build --wheel \ -Cbuild-dir=build . \ -Ccmake.define.CUTLASS_PATH=/path/to/cutlass \ --verbose --color=always --no-build-isolation pip install dist/*whl --force-reinstall --no-deps # 3. Run inference bash scripts/wan22/distill/run_wan22_moe_t2v_extreme.sh ``` Script: [run_wan22_moe_t2v_extreme.sh](https://github.com/ModelTC/LightX2V/blob/main/scripts/wan22/distill/run_wan22_moe_t2v_extreme.sh) ## š¬ Generation Results
"Two anthropomorphic cats in comfy boxing gear and bright gloves fight intensely on a spotlighted stage"