Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
SpiceRL
/
DRA-DR.GRPO
like
1
Follow
SpiceRL
3
Safetensors
qwen2
arxiv:
2505.09655
License:
cc-by-4.0
Model card
Files
Files and versions
xet
Community
1
main
DRA-DR.GRPO
Commit History
Update README.md
2e2054a
verified
xiwenc1
commited on
Jun 16
Update README.md
6ab7d4e
verified
xiwenc1
commited on
Jun 16
Upload ./tokenizer_config.json with huggingface_hub
29200d9
verified
BooBooWu
commited on
May 24
Upload ./special_tokens_map.json with huggingface_hub
40266bc
verified
BooBooWu
commited on
May 24
Upload ./config.json with huggingface_hub
1de959a
verified
BooBooWu
commited on
May 24
Upload ./tokenizer.json with huggingface_hub
07c10e9
verified
BooBooWu
commited on
May 24
Upload ./model.safetensors with huggingface_hub
b78a9a2
verified
BooBooWu
commited on
May 24
initial commit
d21a453
verified
BooBooWu
commited on
May 24