view article Article Fine-tuning SmolLM with Group Relative Policy Optimization (GRPO) by following the Methodologies prithivMLmods • Feb 17, 2025 • 29
docling-project/SmolDocling-256M-preview Image-Text-to-Text • Updated Sep 17, 2025 • 30.9k • 1.61k
Qwen/Qwen2.5-Coder-1.5B-Instruct Text Generation • 2B • Updated Jan 12, 2025 • 325k • • 121