SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published Feb 4, 2025 • 261
view article Article Finetune Stable Diffusion Models with DDPO via TRL +2 metric-space, sayakpaul, kashif, lvwerra • Sep 29, 2023 • 20