Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
SpiceRL
/
DRA-GRPO
like
0
Follow
SpiceRL
3
Safetensors
qwen2
arxiv:
2505.09655
License:
cc-by-4.0
Model card
Files
Files and versions
xet
Community
main
DRA-GRPO
Commit History
Update README.md
ac317cb
verified
xiwenc1
commited on
Jun 16
Upload ./training_args.bin with huggingface_hub
5295be8
verified
BooBooWu
commited on
May 24
Upload ./tokenizer_config.json with huggingface_hub
9206182
verified
BooBooWu
commited on
May 24
Upload ./special_tokens_map.json with huggingface_hub
332ab06
verified
BooBooWu
commited on
May 24
Upload ./config.json with huggingface_hub
c2b11f2
verified
BooBooWu
commited on
May 24
Upload ./tokenizer.json with huggingface_hub
3d2a7cc
verified
BooBooWu
commited on
May 24
Upload ./model.safetensors with huggingface_hub
6a8581c
verified
BooBooWu
commited on
May 24
initial commit
06abcd7
verified
BooBooWu
commited on
May 24