Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
chatsd
/
Sparse_Dynamic_MOE
like
0
Text Generation
PyTorch
custom
mixture-of-experts
Mixture of Experts
transformer
language-model
conditional-computation
arxiv:
2403.07652
License:
mit
Model card
Files
Files and versions
xet
Community
41dcf48
Sparse_Dynamic_MOE
476 MB
Ctrl+K
Ctrl+K
1 contributor
History:
7 commits
chatsd
Config Files
41dcf48
verified
4 months ago
.gitattributes
Safe
1.52 kB
initial commit
4 months ago
README.md
64 Bytes
Create README.md
4 months ago
dynamic_MOE_final_checkpoint.pt
476 MB
xet
Rename final_checkpoint.pt to dynamic_MOE_final_checkpoint.pt
4 months ago
dynamic_moe_config.json
1.33 kB
Config Files
4 months ago
sparse_moe_config.json
1.23 kB
Config Files
4 months ago