Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

jburtoft
/
Trinity-Nano-Neuron-TP1

neuronx-distributed-inference
neuron
aws-inferentia
inf2
Mixture of Experts
pre-compiled
Model card Files Files and versions
xet
Community
Trinity-Nano-Neuron-TP1 / weights
12.2 GB
  • 1 contributor
History: 1 commit
jburtoft's picture
jburtoft
Trinity-Nano compiled for Neuron TP=1 BS=1 seq_len=2048 (SDK 2.28, pre-sharded weights)
0312d1a verified 9 days ago
  • tp0_sharded_checkpoint.safetensors
    12.2 GB
    xet
    Trinity-Nano compiled for Neuron TP=1 BS=1 seq_len=2048 (SDK 2.28, pre-sharded weights) 9 days ago