Trinity-337B-W4A16 / quantization_config.json

Commit History

W4A16 RTN quantization (216/256 experts, 166GB)
8032f24
verified

0xSero commited on