GGUF
conversational

FP16 GGUF

#1
by redaihf - opened

Ordinary conversion of the Torch-format model fails due to FP8 weights. Please upload a full FP16 GGUF to allow for quantization to desired formats.

Sign up or log in to comment