Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ItsMaxNorm
/
DeepSeek-R1-Fast-llada-5B-GRPO
like
0
llada
custom_code
Model card
Files
Files and versions
xet
Community
main
DeepSeek-R1-Fast-llada-5B-GRPO
/
tokenizer.json
Commit History
Training in progress, epoch 1
2cf2eda
verified
ItsMaxNorm
commited on
Aug 10, 2025