DLM-2.1-14B-GPTQ / tokenizer.json

Commit History

Re-quantize with thinking-aware calibration (v3): fix <think> token handling
d9f7e34
verified

hkyoo89 commited on

Upload GPTQ 4-bit quantized DNA-2.1-14B (W4A16, group_size=128, llm-compressor)
b66177b
verified

hkyoo89 commited on