๐Ÿง  SpermLLM โ€” Distilled Reasoning Model

Parameters Teacher Method Format License

SpermLLM is a compact distilled reasoning model based on Qwen3-0.6B-Instruct, designed to improve performance in math, coding, and structured reasoning while remaining lightweight and efficient.

Training Method

The model was fine-tuned on a mixture of curated instruction datasets and further distilled from larger teacher models (Mix of GPT-OSS-120B and Kimi K2.5)

Training Overview

  • Base Model: Qwen3 0.6B Instruct
  • Training Method: SFT (Supervised Finetuning) + Distillation

Notes

SpermLLM is an experimental model, We plan on making this larger and better! Currently no benchmarks but benchmarks will be soon!

Downloads last month
9
Safetensors
Model size
0.8B params
Tensor type
F16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for SpermAI/SpermAI-S1-Qwen3-0.6B

Quantizations
1 model