filter-2b-w8a8 / generation_config.json

Commit History

Add W8A8 INT8 quantized filter-2b (SmoothQuant + GPTQ W8A8)
844fc11
verified

alphaXiv commited on