Qwen3 4B
Forkjoin.ai conversion of Qwen/Qwen3-4B to GGUF format for edge deployment.
Model Details
- Source Model: Qwen/Qwen3-4B
- Format: GGUF
- Converted by: Forkjoin.ai
Usage
With llama.cpp
./llama-cli -m qwen3-4b-gguf.gguf -p "Your prompt here" -n 256
With Ollama
Create a Modelfile:
FROM ./qwen3-4b-gguf.gguf
ollama create qwen3-4b-gguf -f Modelfile
ollama run qwen3-4b-gguf
About Forkjoin.ai
Forkjoin.ai runs AI models at the edge -- in-browser, on-device, zero cloud cost. These converted models power real-time inference, speech recognition, and natural language capabilities.
All conversions are optimized for edge deployment within browser and mobile memory constraints.
License
Apache 2.0 (follows upstream model license)
- Downloads last month
- 10
Hardware compatibility
Log In to add your hardware
8-bit