Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

aws-neuron
/
optimum-neuron-cache

Model card Files Files and versions
xet
Community
611
optimum-neuron-cache / inference-cache-config /trn1
9.93 kB
  • 3 contributors
History: 4 commits
dacorvo's picture
dacorvo HF Staff
Update inference-cache-config/trn1/granite.json
1fdf53e verified 17 days ago
  • granite.json
    1.59 kB
    Update inference-cache-config/trn1/granite.json 17 days ago
  • llama3.json
    2.82 kB
    clean-up Trainium 1 cached configurations about 2 months ago
  • llama4.json
    1.04 kB
    clean-up Trainium 1 cached configurations about 2 months ago
  • mixtral.json
    760 Bytes
    Update inference-cache-config/trn1/mixtral.json about 2 months ago
  • phi4.json
    601 Bytes
    clean-up Trainium 1 cached configurations about 2 months ago
  • qwen3-moe.json
    575 Bytes
    clean-up Trainium 1 cached configurations about 2 months ago
  • qwen3.json
    2.13 kB
    clean-up Trainium 1 cached configurations about 2 months ago
  • smollm3.json
    430 Bytes
    clean-up Trainium 1 cached configurations about 2 months ago