Spaces:
Build error
Build error
A newer version of the Gradio SDK is available:
6.9.0
TODO: MXFP8/NVFP4 Support
Status: Temporarily Disabled
MXFP8 and NVFP4 quantization formats are temporarily disabled due to build issues with comfy-kitchen on HuggingFace Space infrastructure.
Issue
The comfy-kitchen CUDA build fails due to a CUDA 12.9/glibc header conflict:
cospi/sinpifunction exception specification mismatch between CUDA'smath_functions.hand system headers
Planned Resolution
Options being considered:
- Pre-built wheel: Host a pre-compiled comfy-kitchen wheel
- Custom Dockerfile: Build comfy-kitchen in a controlled environment
- PyTorch fallback: Implement pure PyTorch quantization as fallback
Currently Available Formats
- FP8 Tensorwise (per-tensor scaling)
- FP8 Block (per-block scaling, 64 or 128 block size)
- INT8 Block (Triton-based, 128 block size)
Reference
- comfy-kitchen branch:
sc_mm_mxfp8_sync - MXFP8 requires SM >= 10.0 (Blackwell GPU)
- NVFP4 requires SM >= 10.0/12.0 (Blackwell GPU)