Qwen3-4B Instruct No-Think

DEPRECATED: Please use VladHong/Qwen3-4B-Instruct-NoThink-V2.1

Finetuned from Qwen/Qwen3-4B-Instruct-2507 using QLoRA + Unsloth. Trained to respond directly without chain-of-thought (<think> blocks stripped).

~18k rows after deduplication. Think blocks stripped before training.

ollama run VladHong/Qwen3-4B-Instruct-NoThink

Base model is Apache 2.0. Training data includes AI-generated content — review upstream dataset terms before commercial use.

GGUF

Model size

4B params

Architecture

qwen3

Hardware compatibility

4-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Base model

Quantized

(226)

this model