Luna-MNN / config.json
beyoru's picture
Upload Luna MNN 4-bit quantized model
f6772d0 verified
raw
history blame contribute delete
210 Bytes
{
"llm_model": "llm.mnn",
"llm_weight": "llm.mnn.weight",
"backend_type": "cpu",
"thread_num": 4,
"precision": "low",
"memory": "low",
"sampler_type": "penalty",
"penalty": 1.1
}