Qwen3-0.6B (Modified)

A fork of Qwen/Qwen3-0.6B, modified for use as a training target model with PrimeIntellect-ai/verifiers.

Changes

  • Extracted chat_template from tokenizer_config.json into a separate chat_template.jinja file (latest transformers format)
  • Reversed thinking tag logic to enable thinking mode by default (enable_thinking=True)

Original Model

For model architecture, performance, and usage details, refer to Qwen/Qwen3-0.6B.

Downloads last month
75
Safetensors
Model size
0.8B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for minpeter/Qwen3-0.6B-Instruct

Finetuned
Qwen/Qwen3-0.6B
Finetuned
(625)
this model