mendeza-umd
/
Qwen2-0.5B-GRPO-script

Model card Files Files and versions
xet
Metrics Training metrics Community