Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
Libraries
Languages
1
Licenses
Other
Reset Languages
moe
English
Chinese
French
Spanish
German
Japanese
Korean
Portuguese
Italian
Russian
Hindi
Arabic
Thai
Turkish
multilingual
Vietnamese
Indonesian
Polish
Dutch
Romanian
Swedish
Ukrainian
Persian
Czech
Finnish
Bengali
Nepali
Danish
Greek
Hebrew
Malay
Tamil
Hungarian
Urdu
Bulgarian
Catalan
Telugu
Norwegian
French
Swahili
Marathi
Serbian
Slovak
Slovenian
Gujarati
Estonian
Burmese
Croatian
Tagalog
Malayalam
Lithuanian
Galician
Latvian
Khmer
Kannada
Basque
Icelandic
Panjabi
Amharic
Lao
Afrikaans
Kazakh
Mongolian
Georgian
Hausa
Assamese
Armenian
Welsh
Macedonian
Sinhala
Belarusian
Azerbaijani
Javanese
Uzbek
Yoruba
English
Irish
Sundanese
Albanian
Latin
Bosnian
Maltese
Somali
Sanskrit
Sindhi
Oriya
code
Spanish
Thai
+ 4790 languages
Apply filters
Models
3,195
Full-text search
Inference Available
Edit filters
Sort: Trending
Active filters:
moe
Clear all
kshitijthakkar/moe-415m-147m-16x2-12L-xlarge-450m-16exp
Updated
Feb 2
•
2
kshitijthakkar/moe-161m-123m-4x2-12L-4exp-large-experts
Updated
Feb 2
•
2
kshitijthakkar/moe-198m-114m-8x2-12L-8exp-balanced
Updated
Feb 2
•
2
kshitijthakkar/moe-340m-107m-24x2-12L-24exp-specialized
Updated
Feb 2
•
2
kshitijthakkar/moe-350m-102m-16x1-12L-top1-routing
Updated
Feb 2
•
1
kshitijthakkar/moe-274m-132m-16x4-12L-top4-routing
Updated
Feb 2
•
2
kshitijthakkar/moe-240m-103m-12x2-16L-deep-narrow-16l
Updated
Feb 2
•
2
kshitijthakkar/moe-270m-132m-12x2-8L-shallow-wide-8l
Updated
Feb 2
•
2
kshitijthakkar/moe-229m-111m-12x2-10L-full-attention-no-gqa
Updated
Feb 2
•
1
kshitijthakkar/moe-284m-119m-12x2-14L-aggressive-gqa-1kv
Updated
Feb 2
•
2
kshitijthakkar/moe-255m-114m-12x2-12L-full-attention-no-gqa-lr5e-06
Updated
Feb 2
•
2
kshitijthakkar/moe-255m-114m-12x2-12L-full-attention-no-gqa-lr1e-05
Updated
Feb 2
•
2
kshitijthakkar/moe-255m-114m-12x2-12L-full-attention-no-gqa-lr3e-05
Updated
Feb 2
•
2
kshitijthakkar/moe-255m-114m-12x2-12L-full-attention-no-gqa-lr5e-05
Updated
Feb 2
•
2
kshitijthakkar/moe-255m-114m-12x2-12L-full-attention-no-gqa-lr1e-04
Updated
Feb 2
•
2
kshitijthakkar/moe-255m-114m-12x2-12L-full-attention-no-gqa-lr2e-04
Updated
Feb 2
•
2
kshitijthakkar/moe-255m-114m-12x2-12L-full-attention-no-gqa-lr3e-04
Updated
Feb 2
•
1
kshitijthakkar/moe-255m-114m-12x2-12L-full-attention-no-gqa-lr5e-04
Updated
Feb 2
•
2
kshitijthakkar/moe-255m-114m-12x2-12L-full-attention-no-gqa-lr1e-03
Updated
Feb 2
•
2
kshitijthakkar/moe-255m-114m-12x2-12L-full-attention-no-gqa-bs2-ctx512
Updated
Feb 2
•
2
kshitijthakkar/moe-255m-114m-12x2-12L-full-attention-no-gqa-bs2-ctx1024
Updated
Feb 2
•
2
kshitijthakkar/moe-255m-114m-12x2-12L-full-attention-no-gqa-bs2-ctx2048
Updated
Feb 2
•
3
kshitijthakkar/moe-255m-114m-12x2-12L-full-attention-no-gqa-bs4-ctx512
Updated
Feb 2
•
3
kshitijthakkar/moe-255m-114m-12x2-12L-full-attention-no-gqa-bs4-ctx1024
Updated
Feb 2
•
3
kshitijthakkar/moe-255m-114m-12x2-12L-full-attention-no-gqa-bs8-ctx512
Updated
Feb 2
•
4
GadflyII/GLM-4.7-Flash-MTP-NVFP4
Text Generation
•
19B
•
Updated
Feb 2
•
17.2k
•
4
nebius/EAGLE3-gpt-oss-20b
Text Generation
•
0.4B
•
Updated
2 days ago
•
33
nebius/EAGLE3-gpt-oss-120b
Text Generation
•
0.4B
•
Updated
2 days ago
•
38
rawcell/Moonlight-16B-A3B-Instruct-abliterated
Text Generation
•
16B
•
Updated
29 days ago
•
12
dipeshmajithia/Mirror-80M-MoE
Text Generation
•
Updated
29 days ago
Previous
1
...
95
96
97
98
99
100
Next