Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
fariasultana
/
MiniMind
like
0
Text Generation
Transformers
Safetensors
HuggingFaceFW/fineweb
wikipedia
bookcorpus
English
minimind
minimax_m2
conversational
custom_code
fp8
max2
Mixture of Experts
mixture-of-experts
gqa
grouped-query-attention
edge-deployment
mobile
android
efficient
llama-cpp
causal-lm
Eval Results
arxiv:
2504.07164
arxiv:
2509.06501
arxiv:
2509.13160
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
main
MiniMind
/
training
25.5 kB
1 contributor
History:
1 commit
fariasultana
MiniMind Max2 - Efficient MoE Language Model
8b187bb
verified
23 days ago
__init__.py
251 Bytes
MiniMind Max2 - Efficient MoE Language Model
23 days ago
dataset.py
3.86 kB
MiniMind Max2 - Efficient MoE Language Model
23 days ago
distillation.py
11.6 kB
MiniMind Max2 - Efficient MoE Language Model
23 days ago
trainer.py
9.82 kB
MiniMind Max2 - Efficient MoE Language Model
23 days ago