Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
knoveleng
/
OpenRS-GRPO
like
5
Follow
Knovel Engineering
36
Text Generation
Safetensors
knoveleng/open-rs
knoveleng/open-s1
knoveleng/open-deepscaler
qwen2
conversational
arxiv:
2503.16219
License:
mit
Model card
Files
Files and versions
xet
Community
1
refs/pr/1
OpenRS-GRPO
3.57 GB
1 contributor
History:
25 commits
nielsr
HF Staff
Add library_name to metadata
7f5a5b1
verified
11 months ago
assets
update README
11 months ago
.gitattributes
1.74 kB
update README
11 months ago
LICENSE
1.08 kB
update README and LICENSE
11 months ago
README.md
2.71 kB
Add library_name to metadata
11 months ago
config.json
768 Bytes
Upload model for experiment 1, step 100
11 months ago
generation_config.json
181 Bytes
Upload model for experiment 3, step 50
11 months ago
latest
14 Bytes
Upload model for experiment 3, step 50
11 months ago
model.safetensors
3.55 GB
xet
Upload model for experiment 2, step 300
11 months ago
scheduler.pt
1.06 kB
xet
Upload model for experiment 3, step 50
11 months ago
special_tokens_map.json
485 Bytes
Upload model for experiment 1, step 100
11 months ago
tokenizer.json
11.4 MB
xet
Upload model for experiment 1, step 100
11 months ago
tokenizer_config.json
6.77 kB
Upload model for experiment 2, step 50
11 months ago
trainer_state.json
137 kB
Upload model for experiment 3, step 50
11 months ago
training_args.bin
8.18 kB
xet
Upload model for experiment 2, step 300
11 months ago
zero_to_fp32.py
29.2 kB
Upload model for experiment 3, step 50
11 months ago