8xH20 141GB cuda out of memory
#6 opened 7 days ago
by
ErisLU
ERROR: Should have a `model_type` key in its config.json
#5 opened 7 days ago
by
PakJoeng
Any chances for A100?
1
#4 opened 9 days ago
by
traphix
GLM-5.2-W4AFP8 on 8×H100: fp8_e4m3 KV cache produces corrupted output, while BF16 KV works correctly
1
#3 opened 9 days ago
by
loveblairsky
Is the quantize script opensource?
3
#2 opened 9 days ago
by
wxsm
KTransformers + SGLang
1
#1 opened 10 days ago
by
mtcl