optimum-neuron-cache / inference-cache-config
22.1 kB
dacorvo's picture
dacorvo HF Staff
Update inference-cache-config/llama4.json
711bb96 verified