Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
SpiceRL
/
DRA-GRPO
like
0
Follow
SpiceRL
3
Safetensors
qwen2
arxiv:
2505.09655
License:
cc-by-4.0
Model card
Files
Files and versions
xet
Community
main
DRA-GRPO
3.57 GB
2 contributors
History:
8 commits
xiwenc1
Update README.md
ac317cb
verified
7 months ago
.gitattributes
1.57 kB
Upload ./tokenizer.json with huggingface_hub
7 months ago
README.md
260 Bytes
Update README.md
7 months ago
config.json
768 Bytes
Upload ./config.json with huggingface_hub
7 months ago
model.safetensors
3.55 GB
xet
Upload ./model.safetensors with huggingface_hub
7 months ago
special_tokens_map.json
485 Bytes
Upload ./special_tokens_map.json with huggingface_hub
7 months ago
tokenizer.json
11.4 MB
xet
Upload ./tokenizer.json with huggingface_hub
7 months ago
tokenizer_config.json
6.77 kB
Upload ./tokenizer_config.json with huggingface_hub
7 months ago
training_args.bin
8.18 kB
xet
Upload ./training_args.bin with huggingface_hub
7 months ago