qwen_2.5_vl_7b_fp4.safetensors

#21

by Landsharkbaby - opened Feb 8

Discussion

Landsharkbaby

Feb 8

Could you please make NVFP4 version of qwen_2.5_vl_7b_fp4.safetensors? Thank you

andrew109

Feb 11

Yes, it will be wonderful. Or may be are there any instructions how to create qwen_2.5_vl_7b_ "awq" "nf4" "4bit" "fp4" versions from files for diffusers? "Turboderp" has EXL2 models, "TheBloke" has(AWQ/GPTQ models and "Bartowski" has EXL2 models but they are not for ComfyUI. May be there is some method to transform them? On my GeForce RTX5060Ti with 16 GB qwen_2.5_vl_7b_fp8_scaled.safetensors works too slow.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment