๐Ÿง  SpermLLM โ€” Distilled Reasoning Model

Parameters Teacher Method Format License

SpermLLM is a compact distilled reasoning model based on Qwen3-0.6B-Instruct, designed to improve performance in math, coding, and structured reasoning while remaining lightweight and efficient.

Training Method

The model was fine-tuned on a mixture of curated instruction datasets and further distilled from larger teacher models (Mix of GPT-OSS-120B and Kimi K2.5)

Training Overview

  • Base Model: Qwen3 0.6B Instruct
  • Training Method: SFT (Supervised Finetuning) + Distillation

Notes

SpermLLM is an experimental model, We plan on making this larger and better! Currently no benchmarks but benchmarks will be soon!

Downloads last month
152
GGUF
Model size
0.8B params
Architecture
qwen3
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support