Llama-3.1-8B-Instruct-125m-4bit / quantize_config.json

Commit History

AutoGPTQ model for NousResearch/Meta-Llama-3.1-8B-Instruct: 4bits, gr128, desc_act=False
f40990f
verified

Sumail commited on