benitomartin/grumpy-chef-lfm2.5-1.2B-bf16 Reinforcement Learning • 1B • Updated 6 days ago • 13