Standalone Source Map
Important files:
fastvideo/models/dits/wanvideo.py- Defines
WanTransformerBlock_VSA. - Adds
to_gate_compress. - Passes
gate_compressinto self-attention.
- Defines
fastvideo/attention/backends/sparse_fp4_compress_attn.py- Backend name:
SPARSE_FP4_COMPRESS_ATTN. - Sparse FP4 main branch plus high-precision block-mean compress branch.
- Prints
FASTVIDEO_BACKEND_CONFIRM: SPARSE_FP4_COMPRESS_ATTN is running.
- Backend name:
fastvideo/attention/backends/sparse_fp4_attn.py- Base sparse FP4 attention without compress branch.
fastvideo/layers/nvfp4_fake_quant_linear.pyNVFP4FakeQuantReplicatedLinear.- Wan replacement helpers for normal fake-quant QAT and SVD-LoRA variants.
fastvideo/train/models/wan/wan.py- Training-time switches for enabling fake-quant linear and gate quantization.
fastvideo/pipelines/basic/wan/wan_dmd_pipeline.py- Deployment/validation pipeline used by this checkpoint.
- It constructs
FlowMatchEulerDiscreteSchedulerfromflow_shift.
fastvideo/pipelines/stages/denoising.py- Contains
DmdDenoisingStage, the actual solver path for online validation. - Uses fixed DMD timesteps, predicts clean video, and re-noises to the next
timestep instead of running a normal UniPC/Euler
scheduler.steploop.
- Contains
fastvideo/models/schedulers/scheduling_flow_match_euler_discrete.py- Flow-match scheduler implementation used for DMD conversion and re-noising.
fastvideo/platforms/interface.py,fastvideo/platforms/cuda.py- Registers
SPARSE_FP4_ATTNandSPARSE_FP4_COMPRESS_ATTN.
- Registers
fastvideo-kernel/python/fastvideo_kernel/...- Block-sparse attention and quantization kernel source used by the backend.
The full source snapshot in ../repo_source/ is preferred when running the
included DCP export script. This directory is meant for quick inspection and
porting into another inference stack.
Solver reproduction summary:
- Use
WanDMDPipeline, not the vanilla diffusers scheduler inmodel_index.json. - The copied
scheduler/scheduler_config.jsonis base-model metadata (UniPCMultistepScheduler,solver_type=bh2) and is not the solver used by online validation. - Matching solver:
DmdDenoisingStagewithFlowMatchEulerDiscreteScheduler,shift=8.0, DMD timesteps[1000, 757, 522],num_inference_steps=3,guidance_scale=6.0, attention backendSPARSE_FP4_COMPRESS_ATTN, sparsity0.8.