WaveCut
/

Cosmos3-Super-Text2Image-Quanto-FP8-Transformer

Model card Files Files and versions

WaveCut commited on Jun 1

Commit

3a5ee2a

·

verified ·

1 Parent(s): 0d516b5

Clarify Quanto FP8 quantization

Files changed (1) hide show

README.md +6 -3

README.md CHANGED Viewed

@@ -7,15 +7,18 @@ tags:
   - diffusers
   - fp8
   - quanto
   - text-to-image
 license: other
 license_name: openmdw1.1-license
 license_link: https://openmdw.ai/license/1-1/
 ---
-# Cosmos3-Super-Text2Image FP8 Transformer
-This repository contains a transformer-only FP8/float8 Quanto quantization for [nvidia/Cosmos3-Super-Text2Image](https://huggingface.co/nvidia/Cosmos3-Super-Text2Image).
 Read NVIDIA's card, license, safety notes, and prompt-format guidance here:
 [nvidia/Cosmos3-Super-Text2Image](https://huggingface.co/nvidia/Cosmos3-Super-Text2Image).
@@ -31,7 +34,7 @@ from diffusers import Cosmos3OmniPipeline, Cosmos3OmniTransformer
 from diffusers.schedulers.scheduling_unipc_multistep import UniPCMultistepScheduler
 transformer = Cosmos3OmniTransformer.from_pretrained(
-    "WaveCut/Cosmos3-Super-Text2Image-FP8-Transformer",
     subfolder="transformer",
     torch_dtype=torch.bfloat16,
 )

   - diffusers
   - fp8
   - quanto
+  - optimum-quanto
   - text-to-image
 license: other
 license_name: openmdw1.1-license
 license_link: https://openmdw.ai/license/1-1/
 ---
+# Cosmos3-Super-Text2Image Quanto FP8 Transformer
+This repository contains a transformer-only FP8/float8 quantization made with Hugging Face Optimum Quanto for [nvidia/Cosmos3-Super-Text2Image](https://huggingface.co/nvidia/Cosmos3-Super-Text2Image).
+**This is a Quanto quantization, not an NVIDIA ModelOpt/NVFP quantization.** The separate NVFP experiments should be compared against this repo explicitly as a different quantization backend.
 Read NVIDIA's card, license, safety notes, and prompt-format guidance here:
 [nvidia/Cosmos3-Super-Text2Image](https://huggingface.co/nvidia/Cosmos3-Super-Text2Image).
 from diffusers.schedulers.scheduling_unipc_multistep import UniPCMultistepScheduler
 transformer = Cosmos3OmniTransformer.from_pretrained(
+    "WaveCut/Cosmos3-Super-Text2Image-Quanto-FP8-Transformer",
     subfolder="transformer",
     torch_dtype=torch.bfloat16,
 )