maxbsoft/gemma-3-1b-it-gsm8k-structured-reasoning-grpo-stage-1-new Text Generation • 1.0B • Updated Feb 9 • 2
maxbsoft/gemma-3-1b-it-gsm8k-structured-reasoning-grpo-stage-2-2 Text Generation • 1.0B • Updated Jan 26 • 1
maxbsoft/gemma-3-1b-it-gsm8k-structured-reasoning-grpo-stage-3 Text Generation • 1.0B • Updated Jan 26 • 1
maxbsoft/gemma-3-1b-it-gsm8k-structured-reasoning-grpo-stage-1 Text Generation • 1.0B • Updated Jan 23 • 4 •