Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ItsMaxNorm
/
DeepSeek-R1-Fast-llada-5B-GRPO
like
0
llada
custom_code
Model card
Files
Files and versions
xet
Community
main
DeepSeek-R1-Fast-llada-5B-GRPO
9.82 MB
1 contributor
History:
3 commits
ItsMaxNorm
Training in progress, epoch 1
f26127b
verified
6 months ago
.gitattributes
1.52 kB
initial commit
6 months ago
chat_template.jinja
2.15 kB
Training in progress, epoch 1
6 months ago
config.json
1.85 kB
Training in progress, epoch 1
6 months ago
special_tokens_map.json
747 Bytes
Training in progress, epoch 1
6 months ago
tokenizer.json
9.75 MB
Training in progress, epoch 1
6 months ago
tokenizer_config.json
51.3 kB
Training in progress, epoch 1
6 months ago
training_args.bin
9.78 kB
xet
Training in progress, epoch 1
6 months ago