Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
sumitdotml
/
moe-emergence
like
0
Text Generation
Transformers
Safetensors
codeparrot/codeparrot-clean
allenai/ai2_arc
allenai/c4
English
mixture-of-experts
gpt2
research
expert-specialization
License:
mit
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
moe-emergence
/
README.md
Commit History
updated model card with ablation results and all 4 runs
4049aa7
sumit
commited on
26 days ago
add dense and moe checkpoints
3ff42e6
sumitdotml
commited on
27 days ago