inference-optimization/Qwen3-Next-80B-A3B-Instruct-FP8
Text Generation
•
81B
•
Updated
•
12
FP8-dynamic, FP8-block, NVFP4, INT4, INT8 versions of Qwen3-Next-80B-A3B-Instruct and Qwen3-Next-80B-A3B-Thinking Models