SmolLM2-360M-Think-R18 / tokenizer.json
DuoNeural's picture
DuoNeural Think Instillation R18 — dead-prompt filtered GRPO, +0.030 over post-SFT
e86cf4d verified
Raw
History Contribute Delete
3.52 MB
File too large to display, you can check the raw version instead.