Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
quyanh
/
OpenRS-GRPO
like
0
Text Generation
Transformers
Safetensors
knoveleng/open-rs
knoveleng/open-s1
knoveleng/open-deepscaler
qwen2
conversational
text-generation-inference
arxiv:
2503.16219
License:
mit
Model card
Files
Files and versions
xet
Community
2
Deploy
Use this model
main
OpenRS-GRPO
3.57 GB
2 contributors
History:
37 commits
quyanh
remove markdown command
5bb0884
verified
10 months ago
assets
upload assets
11 months ago
.gitattributes
1.8 kB
upload assets
11 months ago
README.md
2.71 kB
remove markdown command
10 months ago
config.json
768 Bytes
Training in progress, step 2
11 months ago
generation_config.json
181 Bytes
Add checkpoint-300
11 months ago
latest
14 Bytes
Add checkpoint-300
11 months ago
model.safetensors
3.55 GB
xet
Training in progress, step 450
11 months ago
scheduler.pt
1.06 kB
xet
Add checkpoint-300
11 months ago
special_tokens_map.json
485 Bytes
Training in progress, step 2
11 months ago
tokenizer.json
11.4 MB
xet
Training in progress, step 2
11 months ago
tokenizer_config.json
6.77 kB
Training in progress, step 2
11 months ago
trainer_state.json
137 kB
Add checkpoint-300
11 months ago
training_args.bin
8.25 kB
xet
Training in progress, step 50
11 months ago
zero_to_fp32.py
29.2 kB
Add checkpoint-300
11 months ago