Qwen3 4B

Forkjoin.ai conversion of Qwen/Qwen3-4B to GGUF format for edge deployment.

Model Details

Source Model: Qwen/Qwen3-4B
Format: GGUF
Converted by: Forkjoin.ai

Usage

With llama.cpp

./llama-cli -m qwen3-4b-gguf.gguf -p "Your prompt here" -n 256

With Ollama

Create a Modelfile:

FROM ./qwen3-4b-gguf.gguf

ollama create qwen3-4b-gguf -f Modelfile
ollama run qwen3-4b-gguf

About Forkjoin.ai

Forkjoin.ai runs AI models at the edge -- in-browser, on-device, zero cloud cost. These converted models power real-time inference, speech recognition, and natural language capabilities.

All conversions are optimized for edge deployment within browser and mobile memory constraints.

License

Apache 2.0 (follows upstream model license)

Downloads last month: 10

GGUF

Model size

4B params

Architecture

qwen3

Hardware compatibility

8-bit

Model tree for forkjoin-ai/qwen3-4b-gguf

Base model

Qwen/Qwen3-4B-Base

Finetuned

Qwen/Qwen3-4B

Quantized

(208)

this model