Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
KMasaki
/
8expert_2granularity_0shared_top2_0.52b-Distill
like
0
Text Generation
Transformers
Safetensors
open-r1/OpenR1-Math-220k
mixtral
Generated from Trainer
open-r1
trl
sft
conversational
text-generation-inference
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
8expert_2granularity_0shared_top2_0.52b-Distill
/
tokenizer.json
Commit History
Training in progress, step 100
5719910
verified
KMasaki
commited on
Apr 7, 2025