Uploaded finetuned model
- Developed by: koutch
- License: apache-2.0
- Finetuned from model : unsloth/SmolLM3-3B
This smollm3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
- Downloads last month
- 4
Model tree for koutch/short_paper_smol_0.json_train_grpo_v1_dev
Base model
HuggingFaceTB/SmolLM3-3B-Base Finetuned
HuggingFaceTB/SmolLM3-3B Finetuned
unsloth/SmolLM3-3B