inference-optimization 's Collections

Qwen3.6-HIGGS

Qwen3.6-35B-A3B mixed-precision HIGGS model variants, plus base FP16/FP8/NVFP4 references.