Request for AWQ Quant of Qwen3.5-40B-Claude-4.6-Opus-Deckard-Heretic-Uncensored-Thinking-i1
#6 opened 1 day ago
by
0xburakcelik
AWQ 4-bit version of this Opus-Distilled-v2 model?
9
#5 opened 18 days ago
by
0xburakcelik
More info about data-free quantization
๐ 1
#4 opened about 1 month ago
by
naripok
--max-model-len 32768 seems a bit too small for agent use cases ?
3
#3 opened about 1 month ago
by
edwarddukewu
This is the best quant version in the world,better than FP8
๐ 5
3
#2 opened about 1 month ago
by
kq
My personal vLLM launch cmd on my old personal 2x3090 workstation
7
#1 opened about 2 months ago
by
tclf90