Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
WangYe007
/
RL
like
0
Model card
Files
Files and versions
xet
Community
main
RL
/
model
/
EasyR1
/
examples
26.4 kB
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
WangYe007
Upload folder using huggingface_hub
d65b589
verified
10 days ago
baselines
Upload folder using huggingface_hub
10 days ago
format_prompt
Upload folder using huggingface_hub
10 days ago
README.md
Safe
7.08 kB
Upload folder using huggingface_hub
10 days ago
config_d_grpo.yaml
2.95 kB
Upload folder using huggingface_hub
10 days ago
config_ema_grpo.yaml
2.33 kB
Upload folder using huggingface_hub
10 days ago
config_ema_grpo_64.yaml
2.97 kB
Upload folder using huggingface_hub
10 days ago
config_grpo.yaml
2.33 kB
Upload folder using huggingface_hub
10 days ago
qwen2_5_7b_math_grpo.sh
211 Bytes
Upload folder using huggingface_hub
10 days ago
qwen2_5_vl_32b_geo3k_grpo.sh
524 Bytes
Upload folder using huggingface_hub
10 days ago
qwen2_5_vl_3b_geo3k_grpo.sh
436 Bytes
Upload folder using huggingface_hub
10 days ago
qwen2_5_vl_7b_geo3k_dapo.sh
579 Bytes
Upload folder using huggingface_hub
10 days ago
qwen2_5_vl_7b_geo3k_grpo.sh
392 Bytes
Upload folder using huggingface_hub
10 days ago
qwen2_5_vl_7b_geo3k_reinforce.sh
545 Bytes
Upload folder using huggingface_hub
10 days ago
qwen2_5_vl_7b_geo3k_swanlab.sh
435 Bytes
Upload folder using huggingface_hub
10 days ago
qwen2_5_vl_7b_multi_image.sh
610 Bytes
Upload folder using huggingface_hub
10 days ago
qwen3_14b_dapo17k_dapo.sh
1.69 kB
Upload folder using huggingface_hub
10 days ago
qwen3_4b_math_grpo.sh
285 Bytes
Upload folder using huggingface_hub
10 days ago
qwen3_vl_30b_geo3k_grpo.sh
524 Bytes
Upload folder using huggingface_hub
10 days ago
runtime_env.yaml
294 Bytes
Upload folder using huggingface_hub
10 days ago