Update inference-cache-config/trn1/llama3.json 412a86d verified dacorvo HF Staff commited on 11 days ago
Add llama3 configurations with longer sequences 6d9930a verified dacorvo HF Staff commited on 14 days ago