HPC-Quantize / hexstate_requantize.py

Commit History

Q8_0 tied embeddings
f32b3c6
verified

CompressedGemma commited on

Experimental support for other LLMs
ae8c38d
verified

CompressedGemma commited on

It's only calibrated for Gemma, atm.
07b428c
verified

CompressedGemma commited on