NVFP4 / FP8 Quantizations

#19

by vincentzed-hf - opened Jun 4

Jun 4

Unofficial-- feel free to try these!!
https://huggingface.co/AxionML/Gemma-4-12B-NVFP4
https://huggingface.co/AxionML/Gemma-4-12B-FP8

thnamratha

Google org Jun 4

•

edited Jun 4

Hi @vincentzed-hf ,

Thank you so much for taking the time to provide these quantized versions. We truly appreciate your efforts on the potential for NVFP4 and FP8 quantized versions of Gemma-4-12B model.

arbv

Jun 4

Hi @vincentzed-hf ,

Thank you so much for taking the time to provide these quantized versions. We truly appreciate your efforts on the potential for NVFP4 and FP8 quantized versions of Gemma-4-12B model.

Are there any chances for us to get QAT versions of Gemma 4 releases (like it was the case for Gemma 3)? That would be awesome - I still use them on my hardware to this day.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment