optimum-neuron-cache / inference-cache-config
2.18 kB
philschmid's picture
Create inference-cache-config/llama.json
1960ccb verified