mlxha
/

Qwen-2.5-3B-grpo-code

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

Qwen-2.5-3B-grpo-code / tokenizer.json

Commit History

Training in progress, step 25

1a12ca7
verified

mlxha commited on Apr 16, 2025