Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
SlimFactoryHub
/
SlimMoE-250M-base
like
0
Follow
SlimFactory
3
Text Generation
Transformers
Safetensors
4 datasets
English
slim_moe
MoE
Text-Generation
Instruction Following
VGQA
Research
SLM
custom_code
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
SlimMoE-250M-base
/
PreTraining.pdf
Commit History
Pre-Training Graphs
fb3f37a
verified
SlimFactory
commited on
Dec 31, 2025