Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
chatsd
/
Sparse_Dynamic_MOE
like
0
Text Generation
PyTorch
custom
mixture-of-experts
Mixture of Experts
transformer
language-model
conditional-computation
arxiv:
2403.07652
License:
mit
Model card
Files
Files and versions
xet
Community
1008307
Sparse_Dynamic_MOE
476 MB
Ctrl+K
Ctrl+K
1 contributor
History:
2 commits
chatsd
Upload final_checkpoint.pt
1008307
verified
4 months ago
.gitattributes
Safe
1.52 kB
initial commit
4 months ago
final_checkpoint.pt
476 MB
xet
Upload final_checkpoint.pt
4 months ago