Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
aws-neuron
/
optimum-neuron-cache
like
30
Follow
AWS Inferentia and Trainium
162
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
656
3f39cba
optimum-neuron-cache
4 contributors
History:
13698 commits
dacorvo
HF Staff
Synchronizing local compiler cache.
3f39cba
verified
about 1 month ago
inference-cache-config
use longer sequence length for llama3 on trn2
about 1 month ago
neuronxcc-2.19.8089.0+8ab9f450
Synchronizing local compiler cache.
3 months ago
neuronxcc-2.20.9961.0+0acef03a
Synchronizing local compiler cache.
3 months ago
neuronxcc-2.21.18209.0+043b1bf7
Synchronizing local compiler cache.
about 1 month ago
neuronxcc-2.21.33363.0+82129205
Synchronizing local compiler cache.
about 1 month ago
neuronxcc-2.22.12471.0+b4a00d10
Synchronizing local compiler cache.
about 2 months ago
.gitattributes
1.99 MB
Synchronizing local compiler cache.
about 1 month ago
README.md
Safe
1.27 kB
Add SageMaker deployment instructions
almost 2 years ago