mesbahuddin1989
/

SmolLM2-135M-Instruct-GRPO

Text Generation

Generated from Trainer

SmolLM2-135M-Instruct_GRPO

text-generation-inference

Model card Files Files and versions

SmolLM2-135M-Instruct-GRPO / tokenizer.json

Commit History

End of training

8563d7f
verified

mesbahuddin1989 commited on Feb 14, 2025