Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

aws-neuron
/
optimum-neuron-cache

Model card Files Files and versions
xet
Community
637
optimum-neuron-cache / inference-cache-config /trn1
10.5 kB
  • 4 contributors
History: 6 commits
dacorvo's picture
dacorvo HF Staff
Update inference-cache-config/trn1/llama3.json
412a86d verified about 24 hours ago
  • granite.json
    1.59 kB
    Update inference-cache-config/trn1/granite.json 2 months ago
  • llama3.json
    3.35 kB
    Update inference-cache-config/trn1/llama3.json about 24 hours ago
  • llama4.json
    1.04 kB
    clean-up Trainium 1 cached configurations 3 months ago
  • mixtral.json
    760 Bytes
    Update inference-cache-config/trn1/mixtral.json 3 months ago
  • phi4.json
    601 Bytes
    clean-up Trainium 1 cached configurations 3 months ago
  • qwen3-moe.json
    575 Bytes
    clean-up Trainium 1 cached configurations 3 months ago
  • qwen3.json
    2.13 kB
    clean-up Trainium 1 cached configurations 3 months ago
  • smollm3.json
    430 Bytes
    clean-up Trainium 1 cached configurations 3 months ago