Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
aws-neuron
/
optimum-neuron-cache
like
27
Follow
AWS Inferentia and Trainium
148
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
611
009561d
optimum-neuron-cache
/
inference-cache-config
/
trn1
9.93 kB
3 contributors
History:
4 commits
dacorvo
HF Staff
Update inference-cache-config/trn1/granite.json
1fdf53e
verified
17 days ago
granite.json
1.59 kB
Update inference-cache-config/trn1/granite.json
17 days ago
llama3.json
2.82 kB
clean-up Trainium 1 cached configurations
about 2 months ago
llama4.json
1.04 kB
clean-up Trainium 1 cached configurations
about 2 months ago
mixtral.json
760 Bytes
Update inference-cache-config/trn1/mixtral.json
about 2 months ago
phi4.json
601 Bytes
clean-up Trainium 1 cached configurations
about 2 months ago
qwen3-moe.json
575 Bytes
clean-up Trainium 1 cached configurations
about 2 months ago
qwen3.json
2.13 kB
clean-up Trainium 1 cached configurations
about 2 months ago
smollm3.json
430 Bytes
clean-up Trainium 1 cached configurations
about 2 months ago