Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

aws-neuron
/
optimum-neuron-cache

Model card Files Files and versions
xet
Community
669
optimum-neuron-cache / inference-cache-config /trn1
10.3 kB
Ctrl+K
Ctrl+K
  • 5 contributors
History: 8 commits
dacorvo's picture
dacorvo HF Staff
Update inference-cache-config/trn1/llama4.json
ecc0bf7 verified 2 months ago
  • granite.json
    1.59 kB
    Update inference-cache-config/trn1/granite.json 5 months ago
  • llama3.json
    3.35 kB
    Update inference-cache-config/trn1/llama3.json 3 months ago
  • llama4.json
    876 Bytes
    Update inference-cache-config/trn1/llama4.json 2 months ago
  • mixtral.json
    760 Bytes
    Update inference-cache-config/trn1/mixtral.json 6 months ago
  • phi4.json
    601 Bytes
    clean-up Trainium 1 cached configurations 6 months ago
  • qwen3-moe.json
    575 Bytes
    clean-up Trainium 1 cached configurations 6 months ago
  • qwen3.json
    2.13 kB
    clean-up Trainium 1 cached configurations 6 months ago
  • smollm3.json
    430 Bytes
    clean-up Trainium 1 cached configurations 6 months ago