Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
aws-neuron
/
optimum-neuron-cache
like
31
Follow
AWS Inferentia and Trainium
172
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
669
e4df17f
optimum-neuron-cache
/
inference-cache-config
/
trn1
10.3 kB
Ctrl+K
Ctrl+K
5 contributors
History:
8 commits
dacorvo
HF Staff
Update inference-cache-config/trn1/llama4.json
ecc0bf7
verified
2 months ago
granite.json
Safe
1.59 kB
Update inference-cache-config/trn1/granite.json
5 months ago
llama3.json
Safe
3.35 kB
Update inference-cache-config/trn1/llama3.json
3 months ago
llama4.json
Safe
876 Bytes
Update inference-cache-config/trn1/llama4.json
2 months ago
mixtral.json
Safe
760 Bytes
Update inference-cache-config/trn1/mixtral.json
6 months ago
phi4.json
Safe
601 Bytes
clean-up Trainium 1 cached configurations
6 months ago
qwen3-moe.json
Safe
575 Bytes
clean-up Trainium 1 cached configurations
6 months ago
qwen3.json
Safe
2.13 kB
clean-up Trainium 1 cached configurations
6 months ago
smollm3.json
Safe
430 Bytes
clean-up Trainium 1 cached configurations
6 months ago