Spaces:

mvp-lab
/

SyncAI

Sleeping

ICGenAIShare04 commited on Mar 6

Commit

7a2f22f

verified ·

1 Parent(s): 84e2cb3

fix: update torchao FP8 quantization API for HF Spaces compatibility

Files changed (1) hide show

src/video_generator_hf.py CHANGED Viewed

@@ -60,8 +60,8 @@ def _get_pipe():
     # Quantize transformer to FP8 to fit in 24GB ZeroGPU VRAM
     # (~28GB bf16 → ~14GB fp8). VAE + image encoder stay float32.
-    from torchao.quantization import quantize_, float8_weight_only
-    quantize_(_pipe.transformer, float8_weight_only())
     _pipe.to("cuda")

     # Quantize transformer to FP8 to fit in 24GB ZeroGPU VRAM
     # (~28GB bf16 → ~14GB fp8). VAE + image encoder stay float32.
+    from torchao.quantization import quantize_, Float8WeightOnlyConfig
+    quantize_(_pipe.transformer, Float8WeightOnlyConfig())
     _pipe.to("cuda")