Qwen3-4B : GGUF

This model was finetuned and converted to GGUF format using Unsloth.

Example usage:

For text only LLMs: ./llama.cpp/llama-cli -hf michalzarnecki/Qwen3-4B --jinja
For multimodal models: ./llama.cpp/llama-mtmd-cli -hf michalzarnecki/Qwen3-4B --jinja

Available Model files:

An Ollama Modelfile is included for easy deployment. This was trained 2x faster with Unsloth

Safetensors

Model size

4B params

Tensor type

BF16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support