optimum-neuron-cache / inference-cache-config
22.3 kB
dacorvo's picture
dacorvo HF Staff
Update inference-cache-config/llama-variants.json
a510ca8 verified