NVFP4 for vLLM

#6
by Glomolg - opened

Hi and thanks for this model. Can we have a quantized NVFP4 version that can run on vLLM ?

Sign up or log in to comment