reinforce-flow/qwen2.5math-1.5b-newdata0919-adaptive-iter-440
Text Generation
• 2B • Updated
reinforce-flow/qwen2.5math-1.5b-newdata0919-adaptive-iter-420
Text Generation
• 2B • Updated
reinforce-flow/qwen2.5math-1.5b-newdata0919-adaptive-iter-400
Text Generation
• 2B • Updated
reinforce-flow/qwen2.5math-1.5b-newdata0919-adaptive-iter-380
Text Generation
• 2B • Updated
reinforce-flow/qwen2.5math-1.5b-newdata0919-adaptive-iter-360
Text Generation
• 2B • Updated
reinforce-flow/qwen2.5math-1.5b-newdata0919-adaptive-iter-340
Text Generation
• 2B • Updated
reinforce-flow/qwen2.5math-1.5b-newdata0919-adaptive-iter-320
Text Generation
• 2B • Updated
reinforce-flow/qwen2.5math-1.5b-newdata0919-adaptive-iter-300
Text Generation
• 2B • Updated
reinforce-flow/qwen2.5math-1.5b-newdata0919-adaptive-iter-280
Text Generation
• 2B • Updated
reinforce-flow/qwen2.5math-1.5b-newdata0919-adaptive-iter-260
Text Generation
• 2B • Updated
reinforce-flow/qwen2.5math-1.5b-newdata0919-adaptive-iter-240
Text Generation
• 2B • Updated
reinforce-flow/qwen2.5math-1.5b-newdata0919-adaptive-iter-220
Text Generation
• 2B • Updated
reinforce-flow/qwen2.5math-1.5b-newdata0919-adaptive-iter-200
Text Generation
• 2B • Updated
reinforce-flow/qwen2.5math-1.5b-newdata0919-adaptive-iter-180
Text Generation
• 2B • Updated
reinforce-flow/qwen2.5math-1.5b-newdata0919-adaptive-iter-160
Text Generation
• 2B • Updated
• 3
reinforce-flow/qwen2.5math-1.5b-newdata0919-adaptive-iter-140
Text Generation
• 2B • Updated
reinforce-flow/qwen2.5math-1.5b-newdata0919-adaptive-iter-120
Text Generation
• 2B • Updated
• 3
reinforce-flow/qwen2.5math-1.5b-newdata0919-adaptive-iter-100
Text Generation
• 2B • Updated
reinforce-flow/qwen2.5math-1.5b-newdata0919-adaptive-iter-80
Text Generation
• 2B • Updated
reinforce-flow/qwen2.5math-1.5b-newdata0919-adaptive-iter-60
Text Generation
• 2B • Updated
reinforce-flow/qwen2.5math-1.5b-newdata0919-adaptive-iter-40
Text Generation
• 2B • Updated
reinforce-flow/qwen2.5math-1.5b-newdata0919-adaptive-iter-20
Text Generation
• 2B • Updated
reinforce-flow/qwen2.5math-1.5b-adaptive-iter-360
Text Generation
• 2B • Updated
reinforce-flow/qwen2.5math-1.5b-adaptive-iter-340
Text Generation
• 2B • Updated
reinforce-flow/qwen2.5math-1.5b-adaptive-iter-320
Text Generation
• 2B • Updated
reinforce-flow/qwen2.5math-1.5b-adaptive-iter-300
Text Generation
• 2B • Updated
reinforce-flow/qwen2.5math-1.5b-adaptive-iter-280
Text Generation
• 2B • Updated
reinforce-flow/qwen2.5math-1.5b-adaptive-iter-260
Text Generation
• 2B • Updated
reinforce-flow/qwen2.5math-1.5b-adaptive-iter-240
Text Generation
• 2B • Updated
• 1
reinforce-flow/qwen2.5math-1.5b-adaptive-iter-220
Text Generation
• 2B • Updated